Monodepth Prediction Based On Deep Learning

Posted on:2021-08-16

Degree:Master

Type:Thesis

Country:China

Candidate:J Miao

Full Text:PDF

GTID:2518306476953379

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

Monocular depth prediction is a research topic that has received much attention in computer vision,and has wide application such as autonomous driving,VR games,and film and television production.In recent years,deep learning has made breakthrough progress in computer vision.However,at present,there are still many problems to be solved in this field.For example,the process of using radar laser to collect depth data is costly and is greatly affected by weather,illumination and other objective factors;the method of depth information based on sparse depth map recovery has edge depth Continuous problems.Stereo images are used for training to improve the prediction accuracy of the depth prediction network.Researches in methods such as deep network structure,stereo matching,and disparity map optimization are essential.The specific work is as follows:1.The DenseNet convolution module is used to repalce the original network modules in the monocular depth prediction framework,which deepens the number of network layers and improve information transformation.Experiments show that the depth prediction framework based on DenseNet modules has less local errors and higher accuracy.2.Cost aggregation based on reconstruction to strengthen the binocular matching constraints is proposed in cost function.Post-processing optimization is also performed on the disparity map outputs.In this thesis,image reconstruction techniques are used at both RGB and disparities to compare the similarity between the reconstructed image and the opposite image of the sampled one.Cost aggregation strengthen the matching constraints of stereo training.At the same time,the smoothing and post-processing method are used to optimize the generated dense disparities,to make disparities smooth without losing the edge of the object.3.The depth prediction model based on KITTI datasets is integrated into a 2D to 3D system,Monocular videos are decomposed into image sequences and uses DIBR to synthesize virtual viewpoint images.Then generated images combines with the original ones to generate 3D images.Evaluates of the model from multiple angles can increase the practicality of the method.Experiments show that the proposed monodepth prediction framework based on DenseNet and cost aggregation based on reconstruction can effectively improve the monocular depth estimation accuracy.Compared with other depth prediction methods,the method proposed in this thesis has a smaller average error rate and the resulting depth map is smoother.

Keywords/Search Tags:

Monodepth Prediction, DenseNet, Stereo Matching, Virtual Viewpoint Synthesis

PDF Full Text Request

Related items

1	Depth Map Acquisition Based On Fast Image Matching And Virtual Viewpoint Rendering
2	Depth Extraction And Virtual View Synthesis From Multi-view Stereo Images
3	Key Technology Research For Embedded Free Viewpoint Video System
4	Stereo Matching And Virtual Viewpoint Algorithm Based On Binocular Stereo Vision System
5	Extraction Corresponding Feature And Image Synthesis Of New Viewpoint
6	Research On Virtual Viewpoint Synthesis Technology Based On Depth-image-based Rendering
7	SAD Stereo Matching Algorithm Based On Binocular Viewpoint
8	Study And Realization Of Stereo Matching Based Depth Extraction And Virtual View Synthesis In Dynamic Scenarios
9	Virtual Viewpoint Synthesis System Based On Multiple-Panoramas
10	Research On Depth Estimatipn And Video Coding Technologies In Free-Viewpoint Video