Research Of Content-based Image Copy Detection

Posted on:2016-03-29

Degree:Master

Type:Thesis

Country:China

Candidate:L Luo

Full Text:PDF

GTID:2308330467982273

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

With the development of Internet, a lot of image editing softwares appear, moreand more copies of the images appear on the Internet, and spread fastly, which lead toa series of issues, such as infringement, counterfeiting, and database storageredundancy, and so on. The near-duplicate image detection is a key point of the imageresearch, which is to detect near-duplicate images from the query image library, inother words, to detect those images which have high similarity with the query images.The methods which make the near-duplicate images includes changing the image sizeand the contrast, rotating, cropping, inserting text, adding noise. The near-duplicateimage detection can be applied widely, such as image copyright protection, imageforgery detection, video copy detection, image query and so on.The difficulty of the near-duplicate image detection is how to extract and matchimage feature more efficient. For the shortcomings of lower efficiency and accuracy, anew detection algorithm based on MSER, SURF and spatial pyramid model is raised.Firstly, the MSER and SURF features of images are extracted. Secondly all thefeatures are clusted by the k-means algorithm in order to form a visual dictionary.Finally the spatial pyramid model is used to integrate spatial information into theimagesâ€™ feature information, as a result, the recall and precision rates of near-duplicateimage detection are improved. The experimental results show that this algorithm isfeasible in large-scale near-duplicated images detecting.The traditional bag of words model uses the K-means algorithm to cluster theimagesâ€™ feature, which lead to the synonymy and ambiguity of the visual vocabularies.The redundancy of the visual vocabularies are not fit to dynamic expansion, so we usethe GMM which based on K-means algorithm to cluster the imagesâ€™ features, in orderto generate more reliable visual vocabularies. Then in order to obtain the spatialinformation of the targets in the image scene and potential topics discriminableinformation, we use PLSA to integrate more information of the image into the BoW,in order to increase the accuracy of the copy detection. The experimental results showthat this algorithm is feasible.

Keywords/Search Tags:

near-duplicate image detection, visual vocabulary, Bag of Words, SpatialPyramid Model, Gaussian Mixture Models, Probabilistic Latent Semantic Analysis

PDF Full Text Request

Related items

1	Research On Near-Duplicate Image Detection And Its Application
2	Research On Image Semantic Representation And Metric Learning Technologies
3	Semantic-based Image Multiclass Annotation
4	Audio Scene Recognition Based On Probabilistic Latent Semantic Analysis
5	Research On Local Semantic Concept Representation Based Image Scene Classification Technology
6	PLSA Model Based Detection Of Porn Pictures
7	Researches On Linear, Kernel Gaussian Models And Their Mixtures
8	Video Semantic Detection Method Based On Gaussian Mixture Model Visual Feature
9	Improvement Of Bag-of-visual Words Model And Its Application In Image Classification
10	Research On Middle Semantic Representation Based Image Scene Classification