Research On Video-based Person Re-identification

Posted on:2021-07-16

Degree:Master

Type:Thesis

Country:China

Candidate:Z Yang

Full Text:PDF

GTID:2518306503491094

Subject:Electronics and Communications Engineering

Abstract/Summary:

PDF Full Text Request

In recent years,with the construction of the National Skynet Project and people's concern for public safety,person re-identification has been widely used in video surveillance,smart security,criminal investigation and other fields.Due to the urgent practical needs,person re-identification technology develops rapidly.Video-based person re-identification contains richer semantic information and motion information,which has gradually attracted more researchers' attention.Mapping pedestrian sequences to a single feature representation is the key in the study of video-based person re-identification.Average pooling or RNN is the most classic method for aggregating all frame-level features.However,they are often difficult to deal with spatial misalignment caused by occlusion,posture changes,and camera views.Therefore,we introduce the Non-local mechanism to learn the spatiotemporal attention inside the sequence adaptively.At the same time,we use the feature erasing mechanism to build local feature learning branch,so that the network simultaneously focuses on the learning of local feature and global feature,which improves the discriminability of the overall features.The appearance model based on Non-local and feature erasing achieves MAP =81.9%,rank1 =87.0% on the large-scale public dataset��MARS,which is comparable to the state-ofthe-art methods.In addition,in practical applications,existing methods based on appearance feature often have poor results in dealing with changing clothes.Therefore,we introduce human biological characteristics��gait as auxiliary information.Our proposed network combining appearance feature and gait feature,which has a superior performance in a variety of scenarios compared to single appearance feature or gait feature.Especially in the CASIA-B dataset CL(change clothing)subset,the rank1 has been significantly improved to 75.95%,which surpasses single appearance feature or gait feature more than 20%.In our fusion network,we make full use of pedestrian's mask: it is used not only the input of the gait feature extraction network but also the spatial attention in the appearance model to construct the foreground appearance feature branch.On the Mask-MARS and CASIA-B datasets,a large number of ablation experiments have verified the performance of our proposed fusion network.

Keywords/Search Tags:

Video-based Person Re-Identification, Non-local Attention, Feature Fusion

PDF Full Text Request

Related items

1	Research On Video-based Person Re-identification
2	Research On Person Re-identification Algorithm Based On Attention Mechanism And Local Feature Fusion
3	Research On Person Re-identification Method Based On Deep Feature Fusion Network
4	The Research Of Person Re-identification Based On Multi-granularity Feature Fusion And Local Information Enhancement
5	Research On Person Re-Identification Technology Based On Deep Learning
6	Person Re-identification Algorithm Based On Fused Contrastive Attention
7	Person Re-identification Based On Feature Fusion
8	Research On Person Re-identification With Local Features Fusion Based On Deep Learning
9	Research On Person Re-identification Based On Multi-granularity Feature Fusion
10	Research On Key Technologies Of Person Re-identification In Video Surveillance System