Research And Implementation Of Key Technologies For Large Scale Multi-modal Image Retrieval

Posted on:2018-12-10

Degree:Master

Type:Thesis

Country:China

Candidate:R Yang

Full Text:PDF

GTID:2348330512986435

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

With the popularity of the Internet and the progress of information technology,compared with the text,people are increasingly inclined to use images to express,transfer and gain information.As a result,the number of images on the Internet presents an explosive growth trend;the application of large-scale image data is facing great challenges.How to retrieve similar images from massive image data quickly and accurately is one of the hot topics in the field of image retrieval.Machine learning is one of the most important tools for dealing with large-scale data.Hashing learning has become a hot topic in recent years because it can reduce the storage space of data and its computational performance.Hashing technique maps high-dimensional data to low-dimensional Hamming space and obtains compact binary hash codes,which can reduce the storage space of data.At the same time,hash codes can keep the similarity between original spatial data.When searching,we can quickly get the similarity between the data by calculating the Hamming distance between hashing codes,thereby retrieval efficiency can be increased.In this paper,a Discrete Multi-view Hashing(DMVH)method based on Spectral Hashing for multimodal image retrieval is proposed.This method can improve the retrieval performance by using multi-modal feature-rich information.Firstly,the multimodal features(such as GIST,SIFT)are extracted and preprocessed to make the multimodal features consistent.Then,a new method to construct the similarity matrix is proposed,which not only preserves the local similarity of data,but also preserves the semantic similarity between the data.While preserving the similarity between data,the mapping matrix is used to map the data of the high-dimensional space to the low-dimensional space to obtain the hash code.As the hashing code is discrete,it is difficult to optimize the method directly.Based on this,two auxiliary variables are introduced,so that in the process of optimization,the discrete conditions are not relaxed to obtain higher quality hash codes.In this paper,the validation experiments compared with several advanced hashing methods are performed on three public datasets.Experimental results show that the proposed method outperforms the hashing method we compared with.Finally,based on the method proposed in this paper,a multi-modal image retrieval system based on discrete hashing is designed and implemented.The system is mainly to provide users with "image search image" function.After features are extracted,the user uses the learned hash function to map image to hash code.And then just calculate Hamming distance between the learned hash code and hash code in the database.The image corresponding to the smallest Hamming distance will be returned to the user as search result.

Keywords/Search Tags:

Hashing, Multi-modal, Image Retrieval

PDF Full Text Request

Related items

1	Research On Multi-modal Hashing Methods For Efficient Multimedia Retrieval
2	Research And Implementation Of Key Technologies For Large Scale Multi-modal Image Retrieval
3	Research On Multi-modal Multi-label Hashing Methods For Large Scale Data Search
4	Cross-modal Retrieval And Annotation Based On Hashing Learning Method
5	Research On Semantic-Guided Hashing Algorithms For Multimedia Data Retrieval
6	Heterogeneous Graph Hashing For Cross-Modal Audio-Image Retrieval
7	Image-text Cross-modal Retrieval Based On Deep Hashing Method
8	Research On Cross-modal Retrieval Method Based On Deep Semantic Hashing
9	Research On Cross-Modal Retrieval Method Based On Asymmetric Hashing
10	Instance-Aware Image Retrieval Technology Based On Multi-Task CNN