Research On Human Action Recognition Algorithm Based On Deep Learning

Posted on:2019-03-22

Degree:Master

Type:Thesis

Country:China

Candidate:X J Liu

Full Text:PDF

GTID:2348330545992097

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

Action recognition is a research focus point in the computer vision and artificial intelligence field,and is widely used in many other fields,including intelligent surveillance systems,human-computer interaction,and motion analysis.However,human action is recognized as requiring a complicated procedure that includes spatial and temporal information from videos.Thus,effectively extracting video features and classify for action recognition is still considered a challenging problem.In this paper,we mainly studied human action recognition algorithms based on depth learning,which realized the automatic feature extraction and the accurate action classification.First of all,traditional convolution neural network(CNN)can not extract the dynamic feature of human action in video.To sovel this problem,we studied the construction method of spatio-temporal convolution neural network and proposed gray single channel CNN(Gray-CNN)and RGB three channel CNN(3Channel-CNN),which automatically extracted appearance and dynamic features of human action in videos without low-level features.Experimental results show that Gray-CNN is superior to methods based on handcraft features in the Weizmann dataset.Secondly,in order to improve the feature extraction and generalization ability of CNN,we proposed further quaternion spatial-temporal convolutional neural network(QST-CNN)for human action understanding.Then,the input for a QST-CNN utilizes a quaternion expression for an RGB image and the values of the red,green,and blue channels are considered simultaneously as a whole in a spatial convolutional layer,avoiding the loss of spatial relationships.The proposed QST-CNN then considers the dynamical information of adjacent frames in a temporal convolutional layer.Experimental results show that QST-CNN improved generalization ability and recognition rates.Then,human action is a continuous process based on video,confusing actions clips existing in similar actions video.In this paper,long term dependence between video clips were learned by combining the QST-CNN and long short-term memory network(LSTM),which can delete useless information and add associated information from video information by using threshold,distinguishing action classification of similar clips.Experimental results show that the proposed method solved the confusion of similar movements well and improved the action recognition accuracy rate.Finally,because the number of samples of the behavior data sets is limited,it will lead to the overfitting phenomenon of the deep learning network.In this paper,the Dropout method is introduced in fully connected layer,which makes the network connection structure sparse,forcing the network learning to be more robust and avoiding the over fitting of a local feature.The optimal Dropout coefficient is determined through experimental analysis,and the human action recognition task is completed.

Keywords/Search Tags:

Human action recognition, Deep learning, Quaternion, Convolutional neural network, Long short-term memory network

PDF Full Text Request

Related items

1	Research On Human Behavior Recognition Algorithm Based On V-DBN And Spital Double-stream LSTM
2	Human Skeletal Action Recognition Based On Deep Learning
3	Research On Human Action Recognition Method Integrating Visual Attention Mechanism And Deep Learning
4	Research On Human Action Recognition Base On Deep Learning
5	Action Recognition Of Human Skeleton Motion Sequences Based On Deep Learning
6	Research On Action Recognition In Videos
7	Research On Human Action Recognition Method Based On Deep Learning
8	Research And Application Of Human Action Recognition Based On Deep Learning
9	Study On The Action Recognition Based On Multi-layer Recurrent Neural Network
10	A Comparative Study Of Deep Neural Network For Mobile Phone Sensor Activity Recognition