Deep Learning 3D Object Detection

Posted on:2021-02-22

Degree:Master

Type:Thesis

Country:China

Candidate:Z X Wang

Full Text:PDF

GTID:2428330611466438

Subject:Signal and Information Processing

Abstract/Summary:

PDF Full Text Request

3D object detection is one of the more important tasks in computer vision,and it has important applications in automatic driving,robotic arm grasping,augmented reality and so-on.3D object detection on point clouds is more challenging than the 2D object detection on images.The sparse and irregular attributes of the point clouds ask for a higher requirement for the algorithm design.In this thesis,we focus on the algorithm of deep learning 3D object detection.The main contents are as follows:(1)We propose a novel method termed Frustum Conv Net(F-Conv Net)for amodal 3D object detection from point clouds.Leveraging current mature 2D object detectors to provide frustum proposals,we use a novel grouping mechanism � sliding frustums to aggregate local point-wise features.We transform unordered and irregular point clouds into regular feature maps for use of a subsequent fully convolutional neural network(FCN).Our proposed FConv Net achieves competitive results on both the indoor SUNRGBD dataset and the outdoor KITTI dataset,surpassing all methods in the same period.(2)We extend F-Conv Net to 3D object detection under binocular(stereo)vision setting.In this setting,we only use two color camera images as input and don't need any depth data such as Li DAR point clouds as input.Compared with current existing methods,our proposed F-Conv Net shows obvious advantages.(3)In view of the fact that current 3D object detection relies too much on 2D object detection,and 2D detection is difficult to detect occluded objects,we make an improvement on the 3D object detection algorithm Vote Net which does not depend on images.We propose the second extraction and aggregation of point clouds and features to achieve better boundary prediction.Compared with the baseline,we effectively improve the performance of 3D object detection on both SUNRGBD and Scan Net dataset.(4)Due to the weak feature extraction ability of Point Net++ network,we design a semantic-aware 3D object detection by taking the semantic category predicted by submanifold sparse convolution neural network UNet as extra input of each point.Compared with the benchmark method,the performance of 3D object detection on the Scan Net dataset has been greatly improved.(5)We extend object detection to more refined tasks � instance segmentation and panoptic segmentation.We design heuristic fusion algorithms to integrate the results of 3D object detection and semantic segmentation and achieve great performance on both point cloud instance segmentation and panoramic segmentation tasks of Scan Net dataset,outperforming all the published papers in the same period.

Keywords/Search Tags:

3D scene understanding, autonomous driving, convolutional neural network, 3D object detection, point cloud

PDF Full Text Request

Related items

1	Deep Learning-based 3D Object Detection In Point Cloud
2	Indoor Scene Understanding Based On Convolutional Neural Network And 3D Geometric Context Information
3	Research On Point Cloud Based On Object Detection Algorithm
4	Research On Outdoor Scene Understanding Using Deep Convolutional Neural Networks
5	Research On 3D Object Detection Technology Of Laser Point Cloud Based On Convolution Neural Network
6	Real-time Object Detection Based On Cascaded Neural Network
7	Graph Matching Based Scene Understanding And Reconstruction For Point Cloud Scene
8	A Research On Hierarchical Indoor And Outdoor Scene Understanding Method Based On 3D Point Clouds
9	Research On Scene Understanding Technology Of Indoor Service Robot Based On Deep Convolution Neural Networks
10	Research On 3D Object Detection Based On RGB And LIDAR Data