Upright Orientation Of 3D Shapes With Convolutional Networks

Posted on:2017-05-22

Degree:Master

Type:Thesis

Country:China

Candidate:Z S Liu

Full Text:PDF

GTID:2308330485453801

Subject:Computational Mathematics

Abstract/Summary:

PDF Full Text Request

Most objects are usually posed in their upright orientations, which makes them easily recognizable. Also, it is the very first step to pose the given 3D shapes in their upright orientations in many graphics tasks, such as matching, retrieval and shape anal-ysis. Moreover, it can be used to generate recognizable object thumbnails, helping the management of 3D shape repositories. Due to various reasons, many models in existing databases are not in their upright orientation.In this paper, we present a learning based method to predict the upright orienta-tion using 3D Convolutional Networks (ConvNets). Given voxel representations of 3D shapes and corresponding orientation vectors, this prediction task can be formulated as a regression problem. Leveraging the learning ability of deep neural networks, general categories of 3D shapes can be handled without making any assumptions such as sym-metry or parallelism. Besides mesh models, the proposed method can deal with shapes represented in other types that can be voxelized, such as implicit surfaces and point clouds.Compared with the ConvNets based approach, existing methods are limited by their predefined rules. Nevertheless, this observation is not applicable to all shapes. Thus learning based methods are appreciated to deal with general objects. Although the idea of data-driven has been adopted in existing approaches, the learning procedure is based on the hand-crafted features such as stability, visibility and parallelism, which are not suitable for general 3D shapes. By contrast, neural networks work in the style of end-to-end learning. High-level knowledges can be captured from raw data, without relying on object’s regularity such as explicit symmetry.However, a single ConvNet does not work well for all types of shapes. The key challenge is that each shape category exhibits particular characteristic on the upright orientation. This is referred to as interference effect which will lead to poor generaliza-tion. In other words, different strategies should be taken to handle diverse categories. Thus a divide-and-conquer scheme is used in our system. Each shape is first classified by a network and then fed into one of the orientation regression networks that are trained on each of the categories. Furthermore, a distance based clustering method is proposed to reduce the number of networks and a novel test-time augmentation procedure is used to improve the accuracy.The efficiency and effectiveness of this approach are demonstrated by extensive experiments. Our system achieved the accuracy of more than 90% on the test data and showed the generalization capability of inferring upright orientations for shapes not belonging to the training categories. Also experimental results showed that our system is able to handle several cases that other methods fail. Moreover, estimation for each shape took no more than 0.15 s on average, which is much faster than existing approaches, thus applicable to robotics tasks in which immediate feedback is required.

Keywords/Search Tags:

Upright Orientation, Data-Driven Shape Analysis, Voxelization, Convo- lutional Networks

PDF Full Text Request

Related items

1	Research On ToF Depth Map Denoising Based On Deep Neural Networks
2	Research On Voxelization And 3D Data Processing For True Three-dimension Display
3	Feature Augmentation And Model Build For Acoustic Scene Classification With Multiple Devices
4	Research On Low-Illumination Image Enhancement And Super-Resolution Reconstruction Algorithms
5	Three-Dimensional Volume Data Generation And Three-Dimensional Buffer Analysis
6	Research On Data-driven 3D Shape Generation
7	3D Shape Feature Extraction And Matching Based On Voxelization
8	Data-driven Techniques For Human Shape Completion
9	Semantics Driven 3D Shape Analysis And Modeling
10	Volume-based 3D Shape Retrieval And Morphing