Research On Temporal Action Detection In Video

Posted on:2021-05-26

Degree:Master

Type:Thesis

Country:China

Candidate:C X Xiong

Full Text:PDF

GTID:2428330614960368

Subject:Computer application technology

Abstract/Summary:

PDF Full Text Request

In recent years,with the development of multimedia technology and the rapid popularization of digital equipment,video data on the Internet has widely exploded.How to quickly,accurately,and efficiently analyze the massive and unorganized video data has become a significant issue for researchers.As an important branch of machine learning,deep learning technology has made a great breakthrough in the field of image classification and detection,and researchers deveoted to introducing neural networks into the field of video understanding,which contains various video tasks such as temporal action detection,action recognition,video summary,and object tracking.This dissertation aims to address the temporal action detection task.Specifically,temporal action detection is an important task in the field of computer vision,which not only need to locate the precise action interval of each action instance in a long untrimmed video,but also to identify the action lable.Temporal action detection algorithms have broad application prospects in many fields such as medical monitoring and national security.The difficulty lies in two aspects: on one hand,the action localization is sensitive to temporal timestamps;on the other hand,the duration of the action instances may vary greatly.It requires the models to accurately capture long time series information.Based on the deep learning technology,this thesis proposes a Temporal Proposal Optimization(TPO)network for temporal action detection.First,TPO utilizes CNN(Convolutional Neural Network)module to capture the local temporal information,and adopts BLSTM(Bidirectional Long Short Term Memory)and CTC(Connectionist Temporal Classification)modules to capture global temporal information.Then,TPO jointly uses these two types of temporal information to construct boundary probability curve,local action probability curve and global action probability curve.Then TPO constructs candidate action proposals based on the boundary probability curves,and fuses the two action probability score curves to optimize and rank the candidate action proposals.Finally,TPO adderesses temporal action detection.TPO has two advantages:(1)TPO effectively learns the long-term dependence in videos by introducing BLSTM and CTC,(2)the above-mentioned probability prediction curves extract sufficient temporal proposal candidates that can effectively capture the large time-span changes of action instances.Experiments show that TPO achieves promising performances in boththe tasks of proposal generation and the temporal action detection.

Keywords/Search Tags:

Temporal action detection, Temporal action proposals, Connectionist temporal classification, Convolutional neural network, Bidirectional long short term memory network

PDF Full Text Request

Related items

1	Research On Connectionist Temporal Classification In Speech Recognition
2	Specific Action Detection Algorithm Based On Deep Learning
3	Temporal Convolutional Network Based Temporal Action Detection
4	The Extraction And Application Of Spatial-Temporal Co-occurrence For Facial Action Units
5	Algorithm Of Complex Action Recognition Based On Temporal Proposals
6	Research On Video Action Recognition Based On Improved Long Short-term Memory Network
7	Research On Temporal Action Detection Based On Neural Network
8	Video Action Recognition Based On Multi-Stream Network Architecture
9	Research On Video Action Recognition Algorithm Based On Spatio-Temporal Features With 2D Convolutional Neural Networks Framework
10	Amdo Tibetan Speech Recognition Based On Deep Neural Network