Human Pose Estimation Method Based On Multi-task Learning

Posted on:2023-04-14

Degree:Master

Type:Thesis

Country:China

Candidate:X H Yu

Full Text:PDF

GTID:2568307145466194

Subject:Electronic information

Abstract/Summary:

PDF Full Text Request

With the development of deep learning and convolutional neural networks,the network model structure,the semantic information of feature extraction is stronger,the network is more robust,and it can perform end-to-end feature extraction and prediction,in the context of complex poses,occlusion scenes,and big data,pose estimation based on deep learning The method has become the mainstream method in academia as well as industry.The existing pose estimation task still has a single network prediction function,and the model is time-consuming,which is not conducive to product deployment and multi-tasking;therefore,based on the pose task,it is more important to perform multi-task learning based on joint detection and segmentation tasks,and how to optimize the pose at the same time.It is of great significance to estimate the computational load of the model and realize the fast and good learning of the pose estimation task.The main research contents and achievements of this paper are as follows:A pose estimation method for high-resolution representation is designed,which is based on the model architecture of Mask-RCNN,obtains each target instance,and then maps the detection results of the instance to the high-resolution features of the feature pyramid,and predicts the module through key points.Upsampling improves the feature resolution,and finally the predicted features are encoded into the spatial position of the classification task to obtain the two-dimensional spatial position of the key points for each instance.Comparing the designed high-resolution representation model with the existing multi-layer feature extraction algorithms,it can be seen from the experimental results that this method has higher accuracy than other methods.A pose estimation method based on multi-task learning is designed,which combines target detection,segmentation,and pose estimation into one model for learning;the method is based on the above-mentioned high-resolution representation pose prediction model,and then establishes instance segmentation prediction.Module,instance segmentation module adopts two types of prediction tasks of foreground and background,and obtains the segmentation information of each instance.Experiments show that the multi-task learning pose estimation method designed in this paper has higher performance than the single-task model and is more practical for functional tasks.Finally,based on the above multi-task pose estimation method,a lightweight and brief multi-task pose estimation model is designed.The main purpose of this method is to optimize the multi-task pose estimation model,and consider that the above multi task models are independent and have no information interaction,which is not conducive to promoting information learning among multi tasks.Therefore,this method first fuses the segmentation and attitude prediction module to extract the mutual information between them and reduce the amount of computation.Then,considering the contradiction between the dense foreground of segmentation and the sparse foreground of pose estimation,the segmentation and attitude are output from different stages.Finally,the channel number of pose estimation module is reduced,and the receptive field of attitude estimation is improved by convolution with atrous-convolution.The performance is guaranteed and the task of 2D pose estimation is faster.

Keywords/Search Tags:

Convolutional neural network, deep learning, multi-task learning, model optimization, high resolution representation

PDF Full Text Request

Related items

1	Research On Multi-Task Learning Method Based On Convolution Neural Network
2	Some Optimization Methods For Deep Convolutional Neural Network
3	Multi-Path Convolutional Neural Network With Cooperative Adversarial Learning For Image Super-Resolution
4	Research On Low Resolution Face Recognition Technology Based On Deep Learning
5	Research And Implementation Of Super-Resolution Reconstruction Of Massive Images
6	Research On Image Super-Resolution Reconstruction Method Based On Deep Learning
7	Research On Deep Learning-Based Representation Learning Algorithms
8	Research On Sentence Representation Based On Contrastive Learning And Deep Neural Network
9	Deep Learning For Very High Resolution Remote Sensing Data Analysis
10	Research On Mobile Robot Relocalization With Deep Learing