Research On End-To-End Binocular Stereo Matching Algorithm Based On Deep Learning

Posted on:2022-04-30

Degree:Master

Type:Thesis

Country:China

Candidate:X H Zhang

Full Text:PDF

GTID:2518306536490994

Subject:Control Science and Engineering

Abstract/Summary:

Binocular stereo vision has the characteristics of easy expansion,low cost and strong scene adaptability in acquiring scene depth information,so it is widely used in many frontier fields such as unmanned driving and augmented reality.Compared with traditional methods,the method based on convolutional neural network can significantly improve the accuracy and operating efficiency of stereo matching.The algorithm in this paper solves the current problems of binocular stereo matching from the following aspects.The current stereo matching work ignores the problem that the aggregation network needs more multi-scale context information for feature similarity learning.This paper proposes a 3D convolutional aggregation network which integrates the unique idea of hole space pyramid pooling,and optimizes the processing effect of stereo matching in the inherent ill conditioned region.There is an imbalance between the number of pixels in the non-occlusion area(simple samples)and the number of pixels in the occlusion area(difficult samples)of the stereo image data set.The loss function Balanced L1 Loss is introduced to replace the original Smooth L1 Loss of the benchmark model to perform end-to-end supervised training of the model,prompting the model to strengthen the learning of pixels in the unoccluded area that occupy most of the stereo image data set.Aiming at the defect that most of the current cost construction methods lack sufficient similarity measurement features for subsequent aggregation networks,a group-by-group AD^2 cost construction method is proposed.The group-by-group AD^2 cost can clearly determine the similarity between the corresponding element values of the left and right unary features,and provide sufficient feature similarity guidance for the subsequent aggregation network.In addition,a 3D convolutional aggregation network based on feature reuse is constructed to suppress the phenomenon of image feature loss,which solves the problem of image feature loss caused by the feature map in the down-sampling stage of different hourglass modules,which makes it difficult for the model to perform more accurate disparity estimation.

Keywords/Search Tags:

Binocular stereo matching, Multi-scale context information, Sample imbalance, Cost construction, Feature reuse

Related items

1	Research On End-to-End Binocular Stereo Matching Method Based On Deep Learning
2	Research On Binocular Stereo Vision Based On Multi-scale And Convolutional Neural Network
3	Research On Stereo Matching Algorithm And Application Based On Binocular Vision
4	Research On Optimization Of End-to-end Binocular Stereo Matching Algorithm Based On Convolutional Neural Network
5	Research On Binocular Stereo Matching Calculation Technology Based On Densely Connected Convolutional Neural Networks
6	Research On Real-time Binocular Stereo Matching Algorithm Based On Attention And Multi-scale Information
7	Research On Binocular Vision Matching Algorithm And Its Key Technologie
8	Research On Key Technique Of Stereo Matching Based On Binocular Vision
9	Research Of Stereo Matching Based On Deep Learning
10	Research On Stereo Matching Algorithm Based On Binocular Vision