3D Shape Representation And Recognition Via Multi-modal Networks

Posted on:2022-10-04

Degree:Master

Type:Thesis

Country:China

Candidate:Y F Feng

Full Text:PDF

GTID:2568306326973519

Subject:Intelligent Science and Technology

Abstract/Summary:

PDF Full Text Request

3D object representation and recognition are crucial in manufacturing and intelligent transportation systems,which have attracted much attention.However,complex representations and organization of 3D object(such as point cloud,multi-view,voxels,mesh,etc.)increase the task’s difficulty.In the multi-modal representation learning of 3D models,a well-designed multi-modal learning framework solves not only the single-modal learning problem,but also fuses the data representations of different modalities.In this work,we address the above-mentioned problem from three perspectives.First,we propose a Group-view Convolutional Neural Network(GVCNN),which can recognize 3D objects based on each view’s discriminative for multi-view representations.Second,Point-view Network(PVNet)is designed for objection recognition from the joint representations of both point cloud and multi-view,which combine the high-dimensional features of point cloud and multi-view from both local and global perspectives.Third,we proposed Point-view Relation Network(PVRNet),which can automatically match views and point cloud for multi-modality fusion in 3D object recognition.We also provide the visualization results of the three network models,respectively.Experiments on ModelNet40 demonstrate the effectiveness of the proposed GVCNN,PVNet and PVRNet in 3D object classification and retrieval task,respectively.

Keywords/Search Tags:

3D Vision, Multi-view, Point Cloud, Multi-modal, Deep Learning

PDF Full Text Request

Related items

1	Research On Cross-modal Point Cloud Completion Based On Deep Learning
2	Research On Structure Based Multi-modal Data Analysis
3	Study On Multi-view Based 3D Model Retrieval
4	Research On 3D Point Cloud Reconstruction Algorithm Based On Monocular Vision Multi-View Geometry
5	Research On Human 3D Reconstruction System Based On Kinect Multi-view Point Cloud
6	Multi-view Neural Network Learning Approaches For Cross-modal Retrieval And Classification
7	Research On Improvement Of Multi-view Stereo Matching Network Based On Deep Learning
8	Research And Application Of 3D Shape Recognition Based On Multi-modal Feature Fusion
9	Point Cloud Deep Learning Based 3D Object Tracking
10	Research On Multi-view Stereo Reconstruction Under Complex Feature Conditions