Non-rigid Structure From Motion Based On Unsupervised Neural Network Model

Posted on:2024-09-09

Degree:Master

Type:Thesis

Country:China

Candidate:X Y Peng

Full Text:PDF

GTID:2568307115992809

Subject:Information and Communication Engineering

Abstract/Summary:

Nonrigid Structure from Motion(NRSf M)based on monocular camera is one of the research hotspots in computer vision.Since the observed object is always in non-rigid motion during the whole observation process,the problem is essentially an underconstrained problem.Considering only a single reprojection constraint does not lead to a unique and accurate solution,and additional constraints are required.The existing NRSf M solution can deal with 3D reconstruction in the case of simple motion well,but it still has defects when facing the situation with complex motion or the situation with a large number of feature points.In addition,due to the lack of a large number of high-precision NRSf M datasets,it is more reasonable to train the network in an unsupervised manner,and the reconstruction results can be compared with existing methods.To solve the above problems,this paper proposes two NRSf M solutions based on unsupervised neural network models,which are respectively applied to sparse and dense data sets of complex motion.The main contributions of this paper are as follows:1.On the sparse data set of complex motion,based on the invariance and closure as the theoretical basis,this paper proposes a self-supervised network and WGAN-GP network(Wasserstein Generative Adversarial Networks with Gradient Penalty,WGAN-GP).Generative adversarial networks with gradient penalty)sparse 3D motion reconstruction algorithm.According to the invariance theory that the reconstruction results of 2D observations of the same 3D structure under any perspective should be similar,this paper applies graph convolution to 3D motion reconstruction for the first time,and proposes a self-supervised network based on graph convolution and Transformer encoder.Based on the closure theory of two-dimensional projection probability distribution similarity,a two-dimensional structure discriminator is added to the above self-supervised network to form the WGAN-GP architecture.In this paper,we analyze the importance of the adjacency matrix as prior knowledge and the effectiveness of the 2-D structure discriminator through extensive experiments.2.On dense datasets,we propose Reconstruction and Optimization Neural Network(RONN).RONN uses depth estimation instead of directly solving 3D structure,which reduces the theoretical calculation amount.RONN network is mainly implemented by convolutional neural network and has three modules for fusion,reconstruction and optimization respectively.The loss function mainly consists of two:the temporal smoothing loss and the Procrustes-alignment loss.The Minimum Singular Value Ratio(MSR)is used to weight both temporal smoothing and Procrustes-alignment.Experimental results show that in the sparse case,the proposed model based on self-supervised network and WGAN-GP network shows superior performance in the benchmark data set and the CMU MOCAP data set.RONN has excellent reconstruction performance in the dense case,and also has a good reconstruction effect on sparse datasets.

Keywords/Search Tags:

Non-rigid Body, Three-dimensional Reconstruction, Graph Convolution, Generate adversarial network, Minimum singular value ratio

Related items

1	Research And Application Of Image-oriented 3D Reconstruction Of Non-rigid Body
2	Research And Implementation Of Defense Algorithm Against Adversarial Attack For Graph Convolution Neural Network
3	3D Non-rigid Human Body Reconstruction Based On Kinect Depth Sequence
4	Three-dimensional Reconstruction Of Non-rigid Target Based On RGBD Camera
5	Research On The Static Reconstruction Method Of Three-dimensional Human Body Based On Java 3D
6	The Research Of 3D Non-rigid Body Reconstruction In Trajectory Space Based On Probability Model
7	Using Structured Light Means For The Rapid 3D Reconstruction Of Non-rigid Objects
8	Research Of Face Image Inpainting Algorithm Based On Edge Guidance And Multi-Scale Dense Convolution
9	Research And System Implementation Of 3D Holographic Human Body Reconstruction Based On RGB-D Camera
10	Localization Of A Rigid Body With A Calibration Emitter In The Presence Of Rigid Body Sensor Uncertainties