Research On Depth Map Super-Resolution Algorithm Based On Guided Network Learning

Posted on:2024-01-29

Degree:Master

Type:Thesis

Country:China

Candidate:Y Zhang

Full Text:PDF

GTID:2568307136998399

Subject:Electronic and communication engineering

Abstract/Summary:

PDF Full Text Request

In the process of depth information acquisition,due to the limitations of consumer-grade cameras themselves and various external factors,the captured depth maps often suffer from poor quality and low resolution.Depth map super-resolution aims to reconstruct high-resolution depth images from low-resolution depth maps to improve the effectiveness of depth information in natural scene vision tasks.Traditional single depth map super-resolution techniques are very limited in their ability to recover fine structure and edge information.Models that use high-resolution color images to guide depth maps for super-resolution effectively improve the situation;however,such methods are limited by multimodal feature extraction and RGB texture over-transfer,and the recovered high-resolution depth maps cannot meet the needs of increasingly accurate techniques.In addition,reliability and real-time performance are also challenging tasks in complex scenarios such as human-computer interaction and autonomous driving.To further improve the depth map super-resolution performance to facilitate wider practical applications,this thesis proposes a depth map super-resolution algorithm based on a guided DCT multi-cascade pyramidal neural network and a guided spatial calibration transformer network.The details are as follows:(1)In response to the limitations of algorithm performance in color-guided depth map superresolution technology,such as cross-modal feature extraction and interaction,RGB texture replication.A depth map super-resolution network based on a guided DCT multi-level pyramid neural network is proposed in this thesis.The network forces learning of the difference information between highresolution RGB guide maps and low-resolution depth maps,and approximates the Ground Truth as the input of the network.By gradually extracting guide map features and depth map features through hierarchical sub-networks,the structural information of high-resolution color images is effectively guided for the super-resolution reconstruction of depth maps.The residual maps obtained from the reconstruction are again added to the depth maps for backpropagation correction to restore more edge details.In addition,considering the performance of discrete cosine transform in reconstructing multichannel high-resolution depth features,DCT is integrated as a module in each layer of the multi-level pyramid subnetwork.Guided by RGB features in the multi-channel feature domain,it can learn to obtain depth map features to improve the flexibility of the super-resolution network.(2)In response to the demand for algorithm reliability and real-time performance in complex scenarios such as human-machine interaction and autonomous driving,this thesis introduces a Transformer module with excellent model capacity and constructs a transformer based backbone network to serialize the input guidance and depth maps into multiple image blocks and integrate them into an encoding decoding architecture for comprehensive and multi-level feature extraction and interaction.It not only effectively reduces the generation of losses such as detail loss and noise introduction during depth map upsampling,but also reduces the model’s reliance on prior knowledge,and greatly improves the performance and efficiency of super-resolution reconstruction.At the back end of the network,this method also connects a spatial calibration module to refine high-frequency structural information and deep spatial attention to calibrate the attention representation of the transformer,further generating high-resolution depth maps with clearer boundaries.The proposed guided network learning-based depth map super-resolution algorithm generates sharper depth graphs based on a simplified framework.Experiments show that the depth map superresolution algorithm in this thesis has good robustness and adaptability on multiple data sets.

Keywords/Search Tags:

Depth image super-resolution, RGB guided image, multi-level pyramid, discrete cosine transform, Transformer backbone network, spatial calibration module

PDF Full Text Request

Related items

1	Fast Multi-focus Image Fusion Based On Region Division
2	Mutil-focus Image Fusion Based On Fully Convolution Neural Network
3	Learning-Based Image Super-Resolution
4	Research On Image Matching Algorithm Based On Siamese Network
5	Research On Image Super-resolution Algorithms Through Interpolation And Neighbor Embedding
6	Research On Super-resolution Reconstruction Algorithm Of Compressed Image
7	Research On Meta-transfer Learning For Multi-scale Image Super-resolution Algorithm
8	Study On The Reconstruction Of Spatial And Temporal Super Resolution Image Based On Bandelet Transform
9	Design And Research Of Discrete Cosine Transform IP Croe
10	Research On Image Super-Resolution Algorithm Based On Deep Learning