Research On Image Retrieval Algorithms Based On Deep Learning

Posted on:2022-05-10

Degree:Master

Type:Thesis

Country:China

Candidate:H T Wang

Full Text:PDF

GTID:2518306734979589

Subject:Electronics and Communications Engineering

Abstract/Summary:

PDF Full Text Request

Image retrieval technology studies how to retrieve the most similar image from the massive image data,which has been applied to the e-commerce,the public security,the urban planning and other aspects.This technology has become an important issue in the field of artificial intelligence research.At the same time,with the continuous development of aerospace technology and remote sensing technology,remote sensing image retrieval has become a hot research topic in recent years.Feature extraction and similarity retrieval are the most critical steps of remote sensing image retrieval technology.When massive high-resolution remote sensing data are being processed,the existing methods do not fully consider the multi-channel characteristics of multi-spectral remote sensing images,resulting in insufficient feature extraction.At the same time,they also fail to fully learn the multi-scale information of features and fail to capture more contextual information,resulting in the loss of feature information.In addition,the existing cross-modal retrieval methods do not learn the consistent correspondence between image and audio well.Therefore,this paper takes advantage of deep learning to learn high-level semantic information and the advantages of fast and low memory of hashing algorithm to conduct in-depth research and exploration on single-modal retrieval of remote sensing images and cross-modal retrieval between image and audio respectively.Specifically,the main work content of this paper includes the following three parts:1)We propose an unsupervised variational auto-encoder hashing algorithm based on multi-channel feature fusion(VAEH).Multi-Channel Feature Fusion(MCFF)is employed to acquire the feature information of image,which fully considers the multi-channel properties of the multi-spectral RS image.In order to enhance the discriminability in the local preservation mapping process,variational construction process and automatic encoder are added into the learning process of hashing function,in which the KL distance of the Variational Auto-Encoder(VAE)is used to constrain the hashing code.Experiments on two large public RS image data sets(i.e.SAT-4 and SAT-6)have shown that our VAEH method outperforms state of the arts.2)We propose an unsupervised variational auto-encoder hashing algorithm based on multiple-scale dilated convolution(MECH).A new context feature enhancement based on multi-scale dilated convolution module is introduced,which is composed of multiple dilated convolution with different dilation rate.This module can increase the receptive field of convolution and gain more semantic information from the feature,which can solve the problem that the multi-scale information of the image is not fully learned and more contextual information cannot be captured.The proposed algorithm is also evaluated on SAT-4 and SAT-6 data sets.The experimental results prove that the algorithm has better retrieval accuracy.3)We propose a cross-modal retrieval algorithm based on deep similarity learning(CRDSL),which extracts the feature information of image and audio respectively,and the model is trained based on the deeply similar network.The proposed algorithm can solve the problem of retrieval between image and audio,and it is tested on PASCAL statement dataset and Wikipedia dataset.The experimental results demonstrate the feasibility of the proposed algorithm.

Keywords/Search Tags:

Deep Learning, Hash algorithm, Image Retrieval, Cross-modal Retrieval, Variational Autoencoder

PDF Full Text Request

Related items

1	Research On The Method Of Cross-modal Image And Text Retrieval Based On Deep Learning
2	Deep Network For Image-Text Cross-Modal Retrieval
3	Research On Algorithm Of Deep Convolution Network And Feature Fusion For Cross Modal Commodity Retrieval
4	Design And Implementation Of A Cross-modal Retrieval System Based On Deep Hashing
5	Research On Cross-modal Retrieval And Recognition Of Visual And Text
6	Research On Cross-modal Retrieval Of Images And Texts Based On Deep Hashing Learning
7	Cross-modal Retrieval Using Deep Neural Network
8	Research On Hierarchical Supervised Cross-modal Image And Text Retrieval Based On Deep Hashing
9	Deep Learning For Cross-Modal Retrieval
10	Research On Deep Hashing Method And Security For Cross-Modal Retrieval