Relative Camera Pose Estimation Us-Ing Deep Networks

Posted on:2019-01-31

Degree:Master

Type:Thesis

Country:China

Candidate:L J Zhang

Full Text:PDF

GTID:2348330545993381

Subject:Control Science and Engineering

Abstract/Summary:

PDF Full Text Request

The problem of estimating the camera pose through the visual information is called Vi-sual Odometry(VO).In the last twenty years,VO has been used in the navigation of robots widespreadly.Most of existing VO algorithms are based on the geometric constraint,which in-cludes a series of tedious processes such as feature extraction,feature matching and pose estima-tion etc.The geometric-based method will be influenced by images shoot in rainy or foggy day.And camera calibration is required every time when changing the platform.With the development of deep learning in the recent years,there are some VO algorithms based on the deep learning methods.Which regardless of the tedious processes in gemetric-based methods,and will obtain the pose via an end-to-end way.This paper presents two novel methods based on convolutional neural network(CNN)and recurrent convolutional neural network(RCNN)to estimate the cam-era poses.Which is trained and tested on KITTI VO dataset,and shows comparable results with several VO algorithms based on geometry.The main contributions of this work are as follows:1.Propose a method of generating image's pose label based on the KITTI VO dataset,include absolute poses by giving only one image and relative poses by giving two adjecent images.The generated poses are feeded into the deep networks for training process.2.Propose a method of estimating the relative camera pose of two images called CNN-VO.The input of the network are two adjecent images.The ouput of the network is the 6-d relative pose,include 3-d translation and 3-d rotation.This paper also adds image pairs with reversed order into training process to increase the netowrks generalization ability.3.Propose a method of estimating the relative camera poses of an image sequence called CNN-LSTM-VO.The input of the network are a sequence of raw RGB images,the output of the network are a sequence of relative poses between every two adjecent images.This method benefits from RNN's advantage of processing sequential signals.Which performs better comparing with the method of pure CNN.Meanwhile,this paper increases the training data set by adding sequential images with reversed order to get better performance.

Keywords/Search Tags:

Visual odometry, pose estimation, deep learning, convolutional neural networks, recurrent convolutional neural networks

PDF Full Text Request

Related items

1	Research On Visual Odometry Based On Deep Learning
2	Head Pose Estimation And 3D Face Reconstruction Using Convolutional Neural Networks
3	Pose Estimation Based On Attention-guided Deep Recurrent Neural Network
4	Research And Design Of Lip Reading Based On 3D Convolution
5	Research On Image Classification Algorithm Based On Circular Convolutional Neural Network
6	Visual Sparse Representation And Deep Ridgelet Networks Based Remote Sensing Image Fusion And Classification
7	Research Of Human Pose Estimation Method Based On Convolutional Neural Network
8	Application Of Deep Recurrent Neural Networks In Speaker Recognition On Mobile Phones
9	Action Recogniton Based On Deep Neural Networks With Visual Attention Mechanism
10	Research On Monocular Visual Odometry And Loop Closure Detection Algorithm Based On Deep Learning