Research On Behavior Recognition Algorithm Based On RGB-D And Deep Learning

Posted on:2020-09-26

Degree:Master

Type:Thesis

Country:China

Candidate:Y Zhang

Full Text:PDF

GTID:2428330590952974

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

Human behavior recognition algorithms have great research significance and industrial value in virtual reality,intelligent monitoring and unmanned driving.The traditional behavior recognition algorithms are based on the color pictures,which manually designs the feature extraction to extract the shape and color features,establishes feature descriptor and selects the classifier for classification.It will lead to two problems.Firstly,the color pictures have less information entropy,which makes extracted features not to represent the behavior well.At the same time,the generalization of background occlusion and viewing angle changes is poor.Secondly,the traditional feature extraction algorithms are difficult to design and its behavior recognition rate is not high.The new RGB-D data includes color pictures,depth pictures,and skeleton pictures,which are rich in information entropy.But RGB-D multi-source information fusion is still a difficult problem.In addition,experiments have shown that convolutional neural networks have achieved great success in image classification.Therefore,this paper proposes a human behavior recognition algorithm based on RGB-D and deep learning.In order to solve the problem that traditional algorithms design feature extraction difficultly,Faster RCNN is used to extract features and classification.By analyzing the framework of Faster RCNN,the human behavior recognition rate is improved by data enhancement,deleting a layer of the fully connected layer and Dropout strategy.We use the complementarity between RGB-D information to solve the problem that RGB-D multi-source information fusion is difficult.Specifically,the interested region of color pictures is located by using depth pictures and skeleton pictures,which avoids the interference from unrelated regions.In summary,the human behavior recognition optimization algorithm based on RGB-D and Faster RCNN is proposed.Experimental results show that the average recognition rate of the proposed algorithm on the UTKinect dataset has reached 94.70%,which is better than other algorithms and verifies the advantages of the algorithm.In order to solve the problem of less information entropy in color pictures and the poor generalization in the background occlusion and the viewing angle changes,Two Stream CNN is used to fuse the features of the depth pictures and the skeleton pictures.Because depth pictures and skeleton pictures are robust to background occlusion and viewing angle changes.Two fusion strategies are proposed in the network,which fuses features respectively in the fully connected layer and Softmax layer to study the impact of different multi-source information fusion strategies on behavior recognition.The average recognition rates of the two different fusion strategies on the UTKinect dataset are96.20% and 95.70%,respectively.The average recognition rates on the SBU Kinect dataset are 92.70% and 92.10%,respectively,which are better than other algorithms and verifies the robustness of the algorithm.

Keywords/Search Tags:

RGB-D data, deep learning, multi-source information fusion, SBU Kinect dataset

PDF Full Text Request

Related items

1	Research On Multi-source Information Fusion Recommendation Algorithm Based On Deep Learning
2	Research On Prediction And Decision-making Methods Based On Multi-source Information Fusion
3	Research On Data Fusion Method Based On Deep Learning
4	Research On The Recommendation Algorithm Of Fusion Of Multisource Heterogeneous Data Based On Deep Learning
5	Joint Representation Learning Based On Multi-source Data And Its Management Application
6	Research On Image Semantic Segmentation Based On Deep Learning
7	Research On The Application Of Multi-source Data Fusion And Deep Learning Technology In Stock Market
8	Multi-source Sensor Data Fusion And Its Applications In The Target Detection
9	Research On Key Technologies Of Multi-Source Heterogeneous Data Fusion
10	Research On Stock Trading Based On Deep Reinforcement Learning