Research On Cross-Modal Hashing Based On Deep Learning

Posted on:2021-02-04

Degree:Master

Type:Thesis

Country:China

Candidate:J X Qian

Full Text:PDF

GTID:2428330614465990

Subject:Information security

Abstract/Summary:

Due to the explosive growth of multimedia data on the Internet,there is an increasing demand for people to be able to retrieve multi-modal data quickly and easily.Cross-modal retrieval has become a current research hotspot.In order to meet the requirements of low storage cost and high query speed in practical applications,hashing has attracted much attention in the field of cross-modal retrieval.It maps high-dimensional multi-modal data to a common Hamming space for cross-modal retrieval.This article mainly studies how to use deep neural networks to project multi-modal data to a common Hamming space,while reducing coding errors,mining the semantic information of muti-modal data and narrowing the difference between cross-modal data so that the performance of using image retrieves text or using text retrieves image better.In order to achieve better cross-modal retrieval results,this paper proposes the following three hashing techniques for cross-modal retrieval based on deep learning:1.In order to reduce coding errors,Deep Ranking-based Hashing for Cross-Modal Retrieval(DRH)is proposed.This method uses deep neural networks to integrate feature learning and hash code learning into an end-to-end framework,and uses pair-wise loss to reduce the semantic gap between different modalities.In terms of hash functions,this method uses a ranking-based hash function to reduce encoding errors,while using the Adaboost to reduce redundancy between hash codes.2.In order to preserve richer semantic information of cross-modal data,Deep Triplet-based Hashing for Cross-Modal Retrieval(DTH)is proposed.This method utilizes the triplet labels,which describes the relative relationships among three instances as supervision in order to capture more general semantic correlations between cross-modal instance,and uses regularity to solve the problem of hash code discontinuity and maximize the information contained in the hash code.3.In order to narrow the heterogeneity gap between cross-modal data,Generative Adversarial Networks for Cross-Modal Hashing(GANCMH)is proposed.This method models the joint distribution over the data of different modalities by generative adversarial networks.The intermodality and intra-modality correlation can be explored simultaneously in generative and discriminative models.Both of them beat each other to promote cross-modal correlation learning.In this paper,the cross-modal retrieval experiments are carried out on MIRFlickr-25 K,Wiki and NUS-WIDE-10 K.The results show that the three algorithms mentioned in this paper have certain advantages,validity and feasibility compared with the other five state-of-the-art algorithms under the same experimental settings.

Keywords/Search Tags:

Hashing, Neural Networks, Adaboost, Triplet, Generative Adversarial Networks

Related items

1	Research On Perceptual Hashing Algorithm Based On Generative Adversarial Network
2	Research On Quality Improvement Technology For Generative Adversarial Network
3	Research On Image Retrieval Based On Hashing And Generative Adversarial Networks
4	Study And Application On The Structure Improved Deep Convolutional Generative Adversarial Networks
5	Research On Generative Adversarial Networks-Based Terrain Mapping
6	Research On Dehazing Algorithms Based On Generative Adversarial Networks
7	3D Object Reconstruction With Generative Adversarial Networks
8	Research On Generation Model Of Perceptually Similar Image Classification Adversarial Example
9	Research On Face Sketch Synthesis Algorithm Based On Generative Adversarial Networks
10	Research On Visual Neural Networks And Their Applications In Image Hashing