Human Pose Synthesis Based On Two-stream Deep Neural Network

Posted on:2023-07-16

Degree:Master

Type:Thesis

Country:China

Candidate:J Zhang

Full Text:PDF

GTID:2558306914973199

Subject:Control Science and Engineering

Abstract/Summary:

PDF Full Text Request

In recent years,the application of virtual reality and human-computer interaction has risen sharply,and artificial intelligence algorithms for human joint posture have become an important topic in the field of computer vision.Human pose synthesis is a task of extracting the human motion pattern by observing the human motion sequences,and deriving the realistic poses of other actions.In the current research of human pose synthesis using deep neural network,how to design a reasonable network structure to meet the fusion of multi-modal spatial-temporal coupling features and how to accurately model the dynamic pattern and static pattern of different joint motions have become key issues.Aiming at the above issues,this study is carried out from the aspects of neural network depth,residual connection structure,multi-modal utilization and fusion,etc.The main achievements are as follows:1.A spatial-temporal two-stream network based on temporal consistency(TCNet)is proposed,which solves the problem of spatialtemporal derangement in the fusion process of a two-stream network.Aiming at the problem of weak spatial-temporal coupling in the traditional fusion method of two-stream network,a spatial-temporal coupling reinforcement module is constructed in the fusion process,and the fused feature is re-modeled to strengthen the spatial-temporal co-occurrence modeling of the network.Besides,TCNet effectively combines the advantages of velocity information and position information in short-term and long-term modeling respectively.Experiments show that TCNet has achieved excellent results on H3.6M,CMU-Mocap and 3DPW public datasets.2.A symmetric residual connection structure suitable for human pose synthesis is proposed,which enriches the feature scales in the process of motion pattern modeling and alleviates the gradient disappearance.The designed symmetric residual connection module maximizes the convolutional receptive field.Experiments show that the proposed symmetric residual network(SRNet)improves the utilization of shallow dynamic features,making the generated human pose more accurate by fewer parameters.3.A dynamic pattern-guided multigraph network(DP-MGnet)is proposed to collaboratively model joint dynamic features within similar dynamic pattern,which alleviates the pattern involvement.The network utilized a temporal reinforcement module,which enhances the features of key frames.Experiments shows that the dynamic features of the joints at various scales are extracted,which effectively preserves the motion pattern of the global j oint points,and finally achieves the precise generation of the human pose sequences.

Keywords/Search Tags:

human pose synthesis, spatial-temporal convolution, symmetric residual network, temporal consistency network, graph convolutional network

PDF Full Text Request

Related items

1	Research On Multichannel Speech Enhancement Algorithm Based On Spatial-Temporal Graph Convolutional Network
2	Research On Human Action Recognition Method Based On Adaptive Spatial-temporal Fusion Graph Convolutional Network
3	Research On Gait Recognition Combined With Human Pose And Graph Convolution
4	Pose Spatial-temporal Feature Extraction And Matching Based On Graph Neural Network
5	Research On The Graph Convolutional Models On Spatial-Temporal Networks And Their Applications
6	RSVP Target Detection Based On Temporal-Spatial Convolutional Neural Network
7	Research On Power System Prediction Task Based On Improved Temporal Convolution Network
8	Spatial And Temporal Context Analysis Of Complex Behaviors In Deep Networks
9	Research On 3D Human Pose Estimation Based On Spatiotemporal Semantic Graph Attention
10	Research On Human Action Recognition Algorithm Based On Spatio-temporal Graph Convolutional Network