Research On Human Behavior Recognition Based On Kinect And Bag-of-words Model

Posted on:2021-01-21

Degree:Master

Type:Thesis

Country:China

Candidate:S N Li

Full Text:PDF

GTID:2438330626455038

Subject:Computer application technology

Abstract/Summary:

PDF Full Text Request

Human behavior recognition is one of the hot research directions of computer vision.The emergence of low-cost depth sensors,such as Microsoft Kinect,can effectively avoid the influence of light,environment and other factors.It can obtain the depth information and skeleton information in the image,which provides a good data source for human behavior recognition research.Therefore,in the research of human behavior recognition,researchers gradually tend to use three-dimensional skeleton information.As a machine learning method,Bag of words(BoW)model is widely used in human motion recognition based on skeleton joints.At present,the recognition accuracy of this method still needs to be further improved.According to the current research situation,this paper mainly carries out the following work:(1)The recognition accuracy can be influenced by the shape of tester,as well as the distance and angle between tester and depth sensors in the Kinect based human action recognition.Therefore,this paper proposes the adaptive adjustment method and further to process uniformly the rotation angle of the skeleton motion,which can effectively process the depth datasets with different joint number and camera angle.(2)Recently,the methods for constructing feature vectors can be divided into three categories: coordinate,distance and orientation.The distance representation is robust to the position and orientation of human body relative to the camera.The orientation representation is robust to position and human body,shape of human body and the orientation relative to the camera.Based on the above three representations,this paper proposes to construct the action feature vectors by combining spatial descriptors and temporal descriptors,in which the spatial descriptors are constructed by the coordinates of joints,the angle between skeleton vectors,the direction angle and elevation angle of skeleton vectors and the temporal descriptors are constructed the difference of joint coordinates between frames.(3)In the traditional BoW model,the visual vocabulary is constructed with Kmeans algorithm whose performance is subject to the initial cluster centers.This paper improves the visual vocabulary construction method and proposes to construct the candidates set for the data points with larger hub values by using hubness phenomenon of high-dimensional data set.And the visual vocabulary is determined by maximin method.We evaluate the effectiveness of the proposed method on five public datasets,including CAD-60,UTKinect,MSR action 3D,UTD-MHAD and MSRC-12 datasets.The experimental results show that the proposed method performs better than several state-of-the-art algorithms.

Keywords/Search Tags:

Kinect, Bag of words model, visual vocabulary, action features, maximin method

PDF Full Text Request

Related items

1	Research On Aligning Visual Properties And The Corresponging Describing Words Of Images
2	Human Action Recognition By Fusing Video Information And Skeletal Point Data Based On Kinect Platform
3	Research On Indoor Mobile Robot Simultaneous Localization And Mapping Algorithm Using Kinect And Bag-of-words Model
4	Research On Action Perception Method For Service Robot Based On Local Spatiotemporal Features
5	Research On Bag Of Visual Words Based Image Classification
6	Research On Action Evaluation Method Based On Kinect
7	Research On Visual Tracking And Action Recognition Based On Bag-of-Words Model
8	Research On The Algorithms Of Video-Based Human Action Analysis
9	Research On Human Action Recognition And Detection Based On Multilayer Visual Features
10	Discovering And Exploiting The Structural Pattern Of Feature Vector Generated By Bag Of Visual Words Model Based Methods For Human Action Recognition