Research On Video Classification Method Based On Deep Learning Network

Posted on:2021-05-01

Degree:Master

Type:Thesis

Country:China

Candidate:S Y Peng

Full Text:PDF

GTID:2518306548956399

Subject:Control theory and control engineering

Abstract/Summary:

PDF Full Text Request

To solve the problem of complex recognition and low accuracy in video classification,this paper proposes a three-stream deep learning network framework that combines spatialtemporal-relational feature extraction with feature aggregation and fusion mechanism.We introduce the relational network into a two-stream convolutional neural network,focus on solving the problems such as poor stability and insufficient semantic understanding in the video feature extraction.At the same time,a feature aggregation method based on Vector of Locally Aggregated Descriptors is proposed to aggregate features,which reduces intra-class differences and realize effective utilization of the features.Furthermore,a decision-level fusion mechanism based on improved Softmax logistic regression function is proposed to adopt a three-stream feature extraction framework,which can preserve the information of the images in different subnetworks.So that the network can reflect the information contained in the video more realistic,which significantly reduces the probability of misclassification of actions by a single sub-network,and the video information can be expressed and recognized well.Finally,in addition to verifying the performance of a threestream convolutional neural network on HMDB51 and UCF-101 standard datasets,it’s also proved on a daily student action video dataset,which is collected from actual campus monitoring scenarios by us.It demonstrates that the three-stream convolutional neural network has not only has an accurate classification effect on the action of complex data sets but also applies to the action classification of daily campus students,which provides a strong scientific and technological support for campus security students.

Keywords/Search Tags:

video classification, temporal-spatial-relational feature extraction, three-stream deep learning framework, feature aggregation, decision-level fusion

PDF Full Text Request

Related items

1	Video Spatial-Temporal Features Collaborative Learning And Fusion Methods
2	Research On Soft Sensor Modeling Method Based On Spatial-temporal Feature Fusion Deep Neural Network
3	Research On Video Action Recognition Method Based On Spatial-Temporal Feature Fusion And Deep Learning
4	The Research On Spatial And Temporal Fusion Feature Extraction Algorithm Based On Video
5	Research On Deep Learning Based Video Classification Technologies
6	Feature Representation And Re-identification Of Person
7	The Design And Realization Of Spatial-Temporal Feature Extraction And Recognition Algorithm For Human Action Analysis
8	Research On Video Captioning Methods Based On Visual Text Association And Multimodal Feature Fusion
9	Multi-level Feature Extraction For Image Representation And Its Application
10	Exploiting Spatio-Temporal Fusion And Perception For Video Object Segmentation