Research On Human Parsing Algorithm Based On Feature Fusion And Posture Correction

Posted on:2022-07-08

Degree:Master

Type:Thesis

Country:China

Candidate:R L Zhao

Full Text:PDF

GTID:2518306494469124

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

Human parsing is usually considered as one of the basic tasks of human-centered visual understanding activities or event recognition and detection.Nonetheless,a large gap seems to exist between what is needed by the real-life applications and what is achievable based on modern computer vision techniques.With the explosion of massive visual media data in daily life,human body parsing task has gradually become a research hotspot in the field of computer vision.This paper will focus on two different tasks in human parsing.The first part focuses on the human parsing algorithm in single instance visual scene;the second part focuses on the perception and parsing algorithm of multi-instance in complex visual scene.The key technologies of this paper mainly focus on multi-level feature fusion of human parsing and case perception and parsing based on human posture.The main research contents and contributions of this paper are as follows:In order to solve the problem of detail missing and semantic confusion in human parsing algorithm,a new double pyramid unit parsing network is proposed.Firstly,a spatial pyramid unit is introduced to capture the multi-scale information of two low-level features,so as to enhance the ability of the model to capture tiny targets and overcome the problem of detail missing in the process of human parsing.Then,a context pyramid unit is used to aggregate the local to global semantic information of two high-level features to eliminate the parsing confusion in the process of human parsing.The method achieves better performance than the existing methods on single person and multi person datasets.At the same time,the speed can reach 41.2 FPS.In the multi-instance scene,aiming at the problem of incomplete parsing caused by irregular human posture(such as back or tilt)and cross occlusion,a multi-person human parsing network based on human posture correction is introduced for the first time.Starting from the inherent characteristics of the human body,firstly,the key points of the human body are used to sense and segment multiple instances,and different instance regions are extracted.Then,the affine transformation is used to correct some human features with irregular posture,so as to obtain a complete and accurate multi person analytic reasoning result.The experimental results show that the average accuracy and the average cross union ratio of the proposed method are71.91% and 59.61% respectively.

Keywords/Search Tags:

Human parsing, Multilevel mapping, Feature fusion, Posture correction

PDF Full Text Request

Related items

1	Research On Human Motion Posture Recognition Based On Multi-feature Fusion
2	Kinect-based Human Posture Recognition And Correction
3	Research On Face Posture Correction Algorithm Based On Multi-feature Fusion Network
4	A Research On IoT Posture Recognition Technology Based On Multi-source Information Fusion
5	Human Pose Detection And Implementation Based On Multimodal Information Fusion
6	Research On Larva Feature Extraction Technology Based On Bending Posture Correction
7	Research On Multilevel Mapping Between Difference-Features And Fusion Strategy For Dual-Modal Infrared Image Fusion
8	Research On Multi-feature Fusion Recognition Of Incorrect Sit Posture
9	Human Motion Detection And Posture Recognition Algorithm Research On Based On Computer Vision
10	The Human Body Posture Recognition Sysytem Based On Multi-sensor