Target Domain Adaptation In Visual Tracking

Posted on:2021-03-14

Degree:Doctor

Type:Dissertation

Country:China

Candidate:S Pu

Full Text:PDF

GTID:1368330605981250

Subject:Information and Communication Engineering

Abstract/Summary:

PDF Full Text Request

Existing visual tracking algorithms mainly use visual features of targets from independent frames as target representations.Due to the diversity and complexity of tracking videos in the real world,there is a domain gap in the representation capability of visual features(i.e.,visual features present differ-ent representation capability in the source and target domains),which limits the performances of existing trackers.To alleviate the effects of the domain gap problem,existing trackers perform target domain adaptation from two as-pects:enhancing target representations and performing online learning.These trackers combine different types of visual features to enhance target represen-tations but pay less attention to exploit the spatial structural information and temporal coherence in the target domain effectively.These trackers learn the tracking models online in the target domain to facilitate the feature extractors to adapt to target appearances and improve the utilization ability of the decision-making modules to target representations.However,existing online learning strategies often encounter the overfitting problem,which causes the decision-making modules to fall into local attention and the feature extractors to forget the objectness information.In order to perform effective target domain adapta-tion,this thesis proposes four solutions in terms of enhancing target represen-tations and improving existing online learning strategies:(1)This thesis pro-poses to use the spatial structural information to enhance target representations and proposes a structure and appearance preserving network flow algorithm for multi-object tracking.(2)This thesis proposes to incorporate the tempo-ral coherence to enrich target representations and proposes a recurrent memory activation network for visual tracking.(3)This thesis proposes a reciproca-tive learning algorithm to constrain attention.The deep attentive tracker based on this algorithm can avoid the local attention problem which is caused by the overfitting.(4)This thesis proposes an objectness transfer network for visual tracking.This network incorporates objectness transfer learning to avoid the problem of forgetting the objectness information due to the overfitting.The above algorithms are executed on the standard benchmark datasets for perfor-mance evaluations.Extensive experimental results verify the effectiveness and robustness of the above algorithms.This thesis aims to research target domain adaptation in visual tracking,proposes to use spatial structural information and temporal coherence to enhance target representations,proposes to constrain at-tention and transfer objectness to improve the robustness of models over a long temporal span.The proposed algorithms effectively facilitate target domain adaptation,improve tracking performances,and promote the development of start-of-the-art trackers.

Keywords/Search Tags:

Spatial structural information, Temporal coherence, At-tention, Objectness

PDF Full Text Request

Related items

1	Research On Video Spatial - Temporal Resolution Adjustment
2	Efficient Animation Rendering Method Based On Path Spatio-Temporal Coherence
3	Fast Optical Flow Estimation And Spatio-Temporal Coherence Reconstruction
4	Research On Object Detection Method By CNN Based On Temporal-Spatial Information
5	Holographic spatial and temporal coherence methods for optical discrimination
6	Acquisition,Expression And Mining Of Spatial-temporal Information In Tourism Cross-media Big Data
7	The Role Of Temporal And Spatial Information In Intercepting Moving Targets
8	Research On Modeling And Verification Of CPS Based On Spatial-temporal Petri Net
9	The Video Saliency Extraction Technology Based On Temporal And Spatial Contrast
10	The Data Fusion And Synthesis Algorithm For Multi-view Video Signal