Research On Technologies Of View-based 3D Object Recognition

Posted on:2020-02-17

Degree:Master

Type:Thesis

Country:China

Candidate:X C Liu

Full Text:PDF

GTID:2428330590973224

Subject:Computer technology

Abstract/Summary:

3D object recognition is one of the important research directions in the field of object recognition.Especially in recent years,it plays a major role in the fields of robot capture,detection,automatic driving,assembly tasks and medical image analysis.The view-based algorithm is a popular trend recently compared to the shape-based detection method,the advantage of it is that it does not rely on complex 3D features and is assisted with large amount of data and mature advanced network framework,which is simple and efficient.Compared with the recognition of single-view images,multi-view images can complement detail features with each other,which plays a great role in the case of occlusion,shading and other difficult scenes.Based on the multi-view convolutional neural network,this paper compares and analyzes the influence of different perspective selection schemes with the model,and reflects on the multi-view feature fusion mode in the model.This paper proposes a pooling method based on perspective weighting which provides a richer view image feature for subsequent classification networks.Furthermore,in view of the regularity and timing of the multi-view data acquisition process,this paper introduces a recurrent neural network unit based on the convolutional neural network,and uses the recurrent neural network to fuse the historical view image information.At the same time,three different attention modules are designed in the network,so that each perspective extracts more useful details in the spatial dimension and channel dimension.Finally,in order to enable the model to have the ability to actively select the next best view,this paper introduces the reinforcement learning module,using the REINFORCE method with baseline,combined with the SGD algorithm for joint training.And in order to solve the perspective "Boundary effect" and sub-network training imbalance problem,this paper proposes a classification confidence-guided strategy gradient flow enhancement method.At the same time,a regularization term with a positional limit is added to the loss function to avoid selecting a viewing angle.They overlap each other to ensure that the selected perspective is more scattered around the three-dimensional object,thereby learning more global object features.

Keywords/Search Tags:

3D object detection, multi-view image, RNN, reinforcement learning

Related items

1	Research On Object Detection Based On Reinforcement Learning
2	Research On The Multi-view Object Detection With Sparse Representation
3	Research On The Key Technology And Application Of Multi-view Machine Vision Detection
4	Image Dehazing And Object Detection Methods Based On Binocular Reinforcement And Adaptive Features
5	Research On Multi-view Object Class Detection
6	Research Of LiDAR Object Detection Based On Multi-view Data Representation
7	Deep Learning Based 3D Object Reconstruction Using A View Planner
8	Research On Image Representation Learning Method Based On Self-supervised Learning And Its Application
9	Research On Object Detection Algorithm And Application Based On Deep Reinforcement Learning
10	Research On Object Detection And Tracking Based On Deep Learning