Spatiotemporal Multi-task Network For Video Action Detection

Posted on:2018-11-23

Degree:Master

Type:Thesis

Country:China

Candidate:Y Liu

Full Text:PDF

GTID:2348330512499464

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

Recently,remarkable progress has been achieved in video action detection by using deep learning techniques.However,for action detection in real-world untrimmed videos,the accuracies of most existing approaches are still far from satisfactory,due to the difficulties in temporal action localization.To tackle this challenge,we propose a spatiotemporal,multi-task,3D deep convolutional neural network to detect(including temporally localize)actions in untrimmed videos.First,we introduce a fusion framework which aims to extract video-level spatiotemporal features in the training phase.And we demonstrate the effectiveness of video-level features by evaluating our model on video action recognition task.Then,under the fusion framework,we propose a spatiotemporal multi-task network,which has two sibling output layers for action classification and temporal localization,respectively.To obtain precise temporal locations,we present a novel temporal regression method to revise the proposal window which contains an action.Meanwhile,in order to better utilize the rich motion information in videos,we introduce a novel video representation,interlaced images,as an additional network input stream.As a result,our model outperforms state-of-the-art methods for both action recognition and detection on standard benchmarks.

Keywords/Search Tags:

spatiotemporal features, multi-task, deep learning

PDF Full Text Request

Related items

1	Research On Algorithm Of Pedestrian Pose Estimation And Re-recognition Based On Deep Learning
2	Research On Image Steganalysis Method Based On Deep Learning
3	Research On Visual Salient Object Detection Via Graph Fusion Of Spatiotemporal Features
4	Text Independent Speaker Recognition Based On Deep Learning Framework
5	Group Sparse Multi-task Learning Method And Its Application
6	Research On Action Recognition Algorithm Based On Spatiotemporal Modeling And Its Application
7	Spatiotemporal Deep Neural Network For Video Salient Object Detection
8	Video Action Recognition Based On Spatiotemporal Grouping And Cooperative Network
9	Gait Recognition Based On Joint Spatiotemporal Features
10	Multi-task Learning Based Classification Algorithm For Balanced Image Data And Unbalanced Temporal Data