Deep Learning-based Algorithms For Visual Odometry

Posted on:2023-02-12

Degree:Master

Type:Thesis

Country:China

Candidate:S B Liang

Full Text:PDF

GTID:2568306815461864

Subject:Electronics and Communications Engineering

Abstract/Summary:

PDF Full Text Request

With the development of computer vision and the emergence of various high-performance vision sensors,visual odometry has been widely used in robots,autonomous vehicles,and other fields.Monocular visual odometry is the process of determining the position and orientation of a camera by analyzing a sequence of input images from a monocular camera and has the advantages of being low cost,simple to configure and computationally efficient.monocular visual odometry suffers from scale ambiguity due to the lack of depth information in the monocular camera.Deep Neural Networks(DNN)have been widely used in computer vision tasks,so it is important to study DNN-based monocular visual odometry.Firstly,a lightweight algorithm for local feature detection and description of images is designed to address the problems of large network model parameters and high ha rdware resource consumption of DNN-based feature extraction and description algorithms.In order to obtain multi-scale features of images,a backbone network based on dilated convolution and basic residual blocks is designed;the combination of attention m echanism and jump connection enables the fusion of low-level semantic information with high-level semantic information of the network while extracting better features.The experimental results show that with a model size of only 1MB and much smaller than similar methods,the mean matching accuracy(MMA)values are improved by 2.3%,2.4%,1.9%,1.4% and 1.1% respectively on the HPatches dataset compared to the results of the R2D2 algorithm when the threshold values are in the range of 2to 6 pixels.Secondly,the proposed feature detection and description network were integrated with the geometry-based odometry method to construct a hybrid method visual odometry and verify the algorithm in comparison on sequences 09 and 10 of the KITI Odometry dataset.Compared to the VISO2 algorithm,the proposed hybrid VO algorithm has a 55% and 53% reduction in translation error respectively.Finally,an end-to-end visual odometry calculation method is proposed to address the scale ambiguity problem in monocular visual odometry.A depth estimation network and a bit-pose estimation network are jointly trained in an unsupervised framework,and they are constrained to predict scale-consistent results.The algorithm is compared with the current mainstream endto-end VO and geometry-based algorithms such as ORB-SLAM2 to demonstrate the effectiveness of the algorithm.

Keywords/Search Tags:

Visual odometry, Deep learning, Transformer, Depth estimation, Attention mechanism

PDF Full Text Request

Related items

1	Research On Visual Odometry Based On Optical Flow And Depth Estimatio
2	Research Of Visual Odometry Technology Based On Unsupervised Deep Learning
3	Research On Visual Odometry Technology Based On Deep Learning
4	Research On Monocular Depth Estimation Algorithm Based On Deep Learning
5	Research On Monocular Visual Localization Algorithm Based On Joint Self-Supervised Learning
6	Image Depth Estimation Based On Deep Learning And Its Application
7	Research On Visual Odometry Based On Deep Learning
8	Research On Visual Odometry Method Based On Self-Supervised Deep Learning
9	Research On Visual Odometry Algorithm Based On Deep Learning
10	Research On Visual Inertial Odometry Algorithm Based On Deep Learning