Deep Network For Image-Text Cross-Modal Retrieval

Posted on:2020-08-17

Degree:Master

Type:Thesis

Country:China

Candidate:H Y Peng

Full Text:PDF

GTID:2428330596964240

Subject:Pattern Recognition and Intelligent Systems

Abstract/Summary:

PDF Full Text Request

Image-text retrieval has gained much attention recently.However,features from image and text have large gap and query is time-consuming.To reduce the gap of features from image and text,we propose a image-text attention block to learn the cross-modal relationship via an elaborately-designed attention mechanism.Image-text hashing has received intensive attention due to its low computation and storage efficiency in image-text retrieval task.Most previous cross-modal hashing methods mainly focus on extracting correlated binary codes from the pairwise label,but largely ignore the semantic categories of cross-modal data.We propose to embed category information into hash codes.More specifically,we introduce semantic prediction loss into our framework to enhance hash codes with category supervision.leading cross-modal hashing to link irrelevant features for retrieval task.Our cross-modal network applies cross-modal attention block to efficiently encode rich and relevant features to learn compact hash codes.Extensive experiments on three challenging benchmarks demonstrate that our proposed method significantly improves the retrieval results.On IAPR TC-12,our method outperforms the state-of-art by a large margin,7.2% increase in MAP.To improve the efficiency and reduce the computational requirements of the inference of deep networks,we propose to use CCP channel pruning to compress networks.Our results on IAPR TC-12 demonstrates that the parameter of AlexNet can be reduced 20 X without effects on performance.

Keywords/Search Tags:

Hash code, cross-modal, retrieval, deep learning, feature extraction

PDF Full Text Request

Related items

1	Research On Information Retrieval Based On Cross-modal Association Analysis
2	Research On Deep Hashing Method And Security For Cross-Modal Retrieval
3	Design And Implementation Of A Cross-modal Retrieval System Based On Deep Hashing
4	Research On Cross-modal Retrieval Based On Semantic Discriminative Hash
5	Research On Cross-modal Retrieval And Recognition Of Visual And Text
6	Cross-modal Retrieval Using Deep Neural Network
7	Semantic Transfer Hashing Based On Deep Learning For Cross-modal Retrieval
8	Attention-aware Deep Cross-modal Hashing
9	Research On Algorithm Of Deep Convolution Network And Feature Fusion For Cross Modal Commodity Retrieval
10	Learning To Hash For Large-scale Cross-modal Retrieval