Human Parsing Based On Deep Learning

Posted on:2021-04-11

Degree:Master

Type:Thesis

Country:China

Candidate:H L Zhao

Full Text:PDF

GTID:2428330605468056

Subject:Control Science and Engineering

Abstract/Summary:

PDF Full Text Request

Human parsing has now become a relatively popular research direction,and the technology has many application scenarios,such as video monitoring,smart home,and human-computer interaction.Human body analysis and semantic segmentation are essentially the same,and belong to the classification problem at the pixel level,so most of the human body analysis models are improved based on the semantic segmentation model.However,many research work ignores the problems of the structured prior information between each part of the human body and the correlation between each pixel of the human body picture,and the experimental results of many studies show that the human body analysis performance is poor in complex scenes where multiple people exist,and it is impossible to outline all individuals in the predicted pictures.For the above problems,this article uses the deep learning method to analyze the human body in the natural state from the perspective of single-person analysis and multi-person analysis.The main work is as follows.Firstly,based on the research status of semantic segmentation and human parsing,the relationship and difference between semantic segmentation model and human parsing model are analyzed,and the problems existing in humam parsing model based on semantic segmentation are summarized and improved methods are proposed.Secondly,3533 human pictures were extracted from PASCAL VOC 2012,and the human tags were fused,and the PASCAL-Person-Parts-5 dataset and PASCAL-Person-Parts-7 dataset were established.The PASCAL-Person-Part-Edges-7 dataset was made using the Eight-Neighborhood edge point extraction algorithm.Thirdly,from the perspective of single-person analysis,it is found that there is structured prior information between various parts of the human body picture,and compared with scene pictures,there is a stronger correlation between the pixels of the human picture,so a human parsing model based on the conditional random fields and edge point penalty mechanism is proposed.Using conditional random fields to consider the influence of adjacent pixels and optimize the prediction results,and the edge point penalty mechanism is used to model the structured prior information of each part of the body.Experiments on the PASCAL-Person-Parts-5 and the Freiburg sitting people datasets prove that the model can improve the performance of human parsing model and improve the outline effect at the edges.Finally,from the perspective of multi-person analysis in complex scenes,it is found that the human body analysis model has a poor analysis effect on complex scenes of multiple people and connot accurately locate all individuals in the pictures.Therefore,in order to improve the positioning process of human parsing model,a human parsing model based on edge detection is proposed.Experiments on the PASCAL-Person-Parts-7 and PASCAL-Person-Parts-Edge-7 datasets prove that the model can indeed strengthen the positioning process,improve the analytical performance of the model,and improve the effect of multiple human parsing.

Keywords/Search Tags:

human parsing, semantic segmentation, edge point penalty mechanism, conditional random field, edge detection

PDF Full Text Request

Related items

1	Based On Edge Perception And Part Relation Reasoning Of Human Parsing
2	Research And Application Of Image Semantic Segmentation Algorithm Based On Conditional Random Field
3	Research On Human Semantic Segmentation Based On Deep Learning
4	Unsupervised Multi-class Segmentation Of SAR Images Using Triplet Markov Fields Models Based On Edge Penalty
5	Research On Infrared Image Semantic Segmentation Technology Based On Deep Learning
6	Researh On 3D Semantic Map Construction Based On Edge Detection
7	Research On The End-to-end Semantic Image Segmentation Algorithm Based On The Conditional Random Field Model
8	Semantic Segmentation Of 3D Point Clouds Based On Deep Learning
9	Research On Road Scene Segmentation Model Based On FCN And Conditional Random Field
10	Research On The Semantic Image Segmentation Based On The Deep Learning And The Conditional Random Fields Model