Research On Scene Depth And Structure Prediction Method

Posted on:2021-02-13

Degree:Master

Type:Thesis

Country:China

Candidate:H Yao

Full Text:PDF

GTID:2428330626460364

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

Deep prediction and structural reconstruction are a hot research topic in the field of computer vision and robotics.The construction of depth maps is an important prerequisite for three-dimensional reconstruction.Traditional methods mainly rely on determining the depth of fixed points for manual annotation or binocular positioning according to the change of camera position to predict depth,but such methods are time-consuming and labor-intensive on the one hand,and it is also limited by factors such as camera position,positioning method,probability of distribution,and other factors.It is difficult to guarantee the accuracy,which makes it difficult to complete the subsequent 3D reconstruction of the predicted depth map.This paper solves this problem by introducing a multi-task module-based deep learning method This problem specifically solves how to achieve the depth prediction of monocular images,combined with semantic segmentation,surface vector prediction,and three tasks to train together,enhance the feature sharing type,and improve the accuracy and structure of depth prediction.This paper proposes a multi-task model-based monocular image depth prediction network for scene images,which can simultaneously train three tasks of learning deep prediction,semantic segmentation,and surface vector estimation,including a common feature extraction module and a cross-task feature fusion module.Extract common features while ensuring the independence of each feature,improving the structure of each task while ensuring the accuracy of prediction.In addition,for the reconstruction of 3D structures,in order to solve the structural redundancy caused by multi-views,to prove that perspective prediction plays a role in promoting the ability of three-dimensional feature extraction and reconstruction quality,this paper proposes a point cloud reconstruction network based on perspective planning.This method based on deep learning,a three-dimensional point cloud reconstruction network for multi-view images is designed.The network includes a 3D geometric semantic feature extraction module,a three-dimensional point cloud reconstruction module and a perspective planning module.The entire network is a unified training architecture.The algorithm is used It can predict the three-dimensional point cloud structure of the image well when the angle of view information is small,and the prediction planning of the angle of view can also increase the efficiency,so as not to cause the problem of excessive calculation.

Keywords/Search Tags:

Computer Vision, Monocular Depth Estimation, Multi-task Model, 3D Point Cloud Reconstruction, Multi-view planning

PDF Full Text Request

Related items

1	Research On 3D Point Cloud Reconstruction Algorithm Based On Monocular Vision Multi-View Geometry
2	Research On 3D Reconstruction Algorithm Based On Multi-view Stereo
3	Multi-view 3D Point Cloud Reconstruction Algorithm Based On DNN
4	Research And Implementation Of Multi-view 3D Reconstruction Algorithm Based On Depth Estimation
5	Research On Improvement Of Multi-view Stereo Matching Network Based On Deep Learning
6	Research On 3D Human Body Point Cloud Real-Time Reconstruction System Based On Multi-View Depth Perception
7	Research On Monocular Depth Estimation Based On Deep Learning
8	Research On 3D Point Cloud Reconstruction Algorithm Based On Depth Image Prediction
9	Research On Depth Estimation Network And 3D Reconstruction Method Based On Multi-view
10	Multi-view 3D Reconstruction Using Monocular Microscopic Images