Improvement Of Bag-of-visual Words Model And Its Application In Image Classification

Posted on:2018-11-10

Degree:Master

Type:Thesis

Country:China

Candidate:H Yang

Full Text:PDF

GTID:2348330536480346

Subject:Signal and Information Processing

Abstract/Summary:

PDF Full Text Request

Image classification technology is one of the most important and challenging research topics in computer vision,which has been widely used in many fields,such as image retrieval,video retrieval,medical application et al.In recent years,many scholars have made a deep research on image classification technology,and the Bag-Of-Visual words(BOV)model is one of the most successful and widely used image classification models.However,there are still some shortcomings in the traditional BOV model,this paper will improve it from the following aspects:1.Concerning the problem that the scale of visual dictionary is too large and the discrimination ability of visual dictionary is poor in the BOV model,a Weighted-Maximal Relevance-Minimal-Semantic similarity(W-WR-WS)criterion was proposed to optimize visual dictionary.Firstly,the Scale Invariant Feature Transform(SIFT)features of images were extracted,and the K-Means algorithm was used to generate a original visual dictionary.Secondly,the correlation between visual words and image categories and semantic similarity among visual words were calculated,and a weighted parameter was introduced to measure the importance of the correlation and the semantic similarity in image classification.Finally,based on the weighing result,the visual words which correlation with image categories was weak and semantic similarity with among visual words was high were removed,which achieved the purpose of optimizing the visual dictionary.The experimental results show that using the optimized visual dictionary to image classification can improve the performance of image classification.2.In order to solve the problems that the lack of the spatial distribution information of the local features and the poor semantic property of image classification in the BOV model,an image classification method based on Probability Latent Semantic Analysis(PLSA)and visual phrases was proposed.Firstly,the visual dictionary was optimized by using the W-MR-MS criterion,and the visual phrases were established on the basis of the optimized visual dictionary which increased the spatial distribution information of local image features.Then,a new semantic visual dictionary that combined with visual phrases and visual words in the optimized visual dictionary was constructed.Finally,PLSA model was used to dig out more semantic latent themes based on the semantic visual dictionary.The experimental results show that the combination of visual phrases and PLSA modelcan improve the performance of image classification.

Keywords/Search Tags:

Image classification, Bag-Of-Visual words model, Feature extraction, Probability Latent Semantic Analysis(PLSA), Visual phrases

PDF Full Text Request

Related items

1	PLSA Model Based Detection Of Porn Pictures
2	The Improvement Of Bag-of-visual-words Model And Its Application Research In Images Classification
3	Research On Middle Semantic Representation Based Image Scene Classification
4	Research Of Middle-level Semantic Based Image Scene Classification Algorithm
5	Research On Local Semantic Concept Representation Based Image Scene Classification Technology
6	Visual Dictionary Based Study On Plsa Classifier
7	Research On Image Semantic Representation And Metric Learning Technologies
8	Research On Semantic Model Based Image Classification Method
9	Research On Bag Of Visual Words Based Image Classification
10	Research On Scene Classification Technologies With The Local Region Description Feature And Probabilistic Latent Semantic Analysis Model