Semantics Consistent Adversarial Cross-Modal Retrieval

Posted on:2020-08-28

Degree:Master

Type:Thesis

Country:China

Candidate:R S Xuan

Full Text:PDF

GTID:2428330596980005

Subject:Computer Science and Technology

Abstract/Summary:

Cross-modal retrieval refers to a retrieval method that returns a modal query word and returns other modal data related to its semantics.The biggest difficulty in cross-modal retrieval is the “heterogeneity” problem caused by the different representations and distributions between different modalities,which leads to the fact that the similarities between different modal samples cannot be directly measured.This thesis solves the problem of “heterogeneity” by mapping different modal samples into common subspaces and then measuring the similarity of different modal samples in the common subspace.Specifically,this thesis did the following:For samples with labels,a cross-modal retrieval method based on semantic consistent adversarial network is proposed.Specifically,under the framework of adversarial learning: 1)finding the class center of each semantic class with different semantic tags in different modalities;2)under the same modal,minimizing the Euclidean distance of the same semantic sample with the class center;3)Under different modals,minimizing the Euclidean distance with the same semantic class center;4)for class centers,minimizing the distance between the class center and other modal and samples with the same semantic class.Comprehensive experiment were carried out on the Wikipedia dataset and the NUSWIDE-10 k dataset.The experimental results show the effectiveness of the proposed method in cross-modal retrieval.For parts of samples with labels,a semi-supervised adversarial cross-modal retrieval method based on graph constraints is proposed.Specifically,under the framework of adversarial learning: 1)for unlabeled samples,constructing a graph based on Euclidean distances between samples,expecting common subspace representations of similar samples to be similar;2)learning from labeled samples using traditional adversarial cross-modal retrieval methods;3)the unlabeled sample and the labeled sample learn the common subspace representation together in the anti-learning framework.The experimental results on the Wikipedia dataset and the NUSWIDE-10 k dataset show that the proposed method achieves a comparable result to that of the supervised learning retrieval method.

Keywords/Search Tags:

Cross-modal retrieval, Adversarial learning, Semantic consistency, Graph constraints, Common subspace

Related items

1	Research On Cross-modal Retrieval Algorithm With Semantic Shared Subspace
2	Research On Cross-Modal Retrieval Based On Deep Semantic Analysis
3	Cross-modal Retrieval Research Based On Correlation Analysis And Structure Preserving
4	Category Alignment Adversarial Learning And Fine-Grained Supplementary Feature Learning For Cross-modal Retrieval
5	Study On Deep Cross-modal Retrieval With Preserving Semantic Ranking Structure
6	Semantic Alignment-based Robust Cross-modal Retrieval
7	Research On Cross-modal Hash Retrieval Based On Common Subspace Learning
8	Research On Cross-modal Retrieval For Semantic Consistency Learning
9	Research On Semantic Preserving And Correlation Mining For Cross-modal Hashing
10	Research On Relevance Computation Of Cross-modal Retrieval