Study Of Video Human Action Recognition Based On Local Spatio-temporal Features

Posted on:2014-08-20

Degree:Master

Type:Thesis

Country:China

Candidate:T Wang

Full Text:PDF

GTID:2268330425966552

Subject:Signal and Information Processing

Abstract/Summary:

PDF Full Text Request

Nowdays，in the field of computer vision and multimedia systems, the scientificcommunity has given particular attention to the automatic recognition and interpretation ofhuman behaviours. The applications of such research are in the context of classification andretrieval of multimedia contents, humancomputer interaction and surveillance systems.Recent works have been focused on describing human actions using local descriptiontechniques of moving patterns. The description is achieved through algorithms that detect anddescribe features who are named ‘space-time interest points’ in the spatio-temporaldomain.Representation based on local spatio-temporal feature has very good stability even ifsome action is changing of scale or orientation or illumination，therefor，the focus of thispaper researches video human action recognition based on local spatio-temporal feature.Firstly,this paper analyzes the basic traditional principles of video human actionrecognition system. According to various modules in the system,the system is divided intofeature extraction and model matching.In order to reduce the influence of the shortcomings ofcommon feature detector algorithm,this paper puts forward to a new feature extraction method.Then,in the phase of model matching, traditionally,the BoW model has much morereconstruction error due to the more restrictive constraint，this paper puts forward torepresentation before recognition using sparse coding instead of vector quantization,andcombines space pyramid with mapping three orthogonal planes (TOP) under max poolingmodel.At last, this paper designs experiments which divids video database into training videoand testing video by the ratio of7:3,which verifies the effctiveness of this system.In addition,this paper studies video human action recognition based on instance-to-class(I2C) distance.And it puts forward to using Naive Bayes Nearest Neighbor algorithm (NBNN)and Local Naive Bayes Nearest Neighbor algorithm (LNBNN) to classify test videos directly.Through experiment simulation, video human action recognition based on I2C distance has amore obvious performance improvement compared to the traditional SVM classificationwhich is based on instance-to-instance (I2I) distance.

Keywords/Search Tags:

Local spatio-temporal feature, Human action recongnition, Sparse coding, I2Cdistance

PDF Full Text Request

Related items

1	Research On The Local Spatio-Temporal Relationships Based Feature Model For Action Recognition
2	A Study Of Human Action Recognition Based On Spatio-temporal Features
3	Research On Human Action Recognition Based On Sparse Spatio-temporal Features
4	Research On Spatio-Temporal Feature Based Human Action Recognition
5	Human Action Recognition In Video Sequence
6	Research On Feature Extraction And Recognition Of Human Actions In Video Sequences
7	Human Action Recognition Based On Sparsely Coded Spatio-temporal Video Features
8	Constrained Sparse Coding Methods For Human Action Recognition In Video
9	Spatio-Temporal Interest Points (STIP) Based Method Of Recognizing Human Action
10	Local Feature And Generative Model For Human Action Recognition