Image Representation On Set In Image Classification

Posted on:2017-06-21

Degree:Master

Type:Thesis

Country:China

Candidate:P P Peng

Full Text:PDF

GTID:2348330518472255

Subject:Information and Communication Engineering

Abstract/Summary:

PDF Full Text Request

The essential problem of image classification is to identify objects or targets in the image,which requires accurate description of the visual information of the image. Local information on the background details, lighting and other external conditions has good robustness and it become the mainstream of feature representation, especially Scale Invariant Feature Transform and various improved algorithm based on Scale Invariant Feature Transform appears. However, the number of local features of different images is often different, not suitable for subsequent operation such as classification and retrieval directly on local features,so we need a unified set of representation in the local features of the image collection.The set representation use a certain method to operate all the local feature points extracted from the image, forming a vector to represent the image. The main work and contributions in this paper are as follows:First, in this paper, we introduce details of three methods of set representation, named Bag of Words,Efficient Match Kernels and Vector of Locally Aggregated Descriptors. And a large number of experiments are done based on these methods on selected database, verify the three set representation classification performance.Second,we test the applicability of the Vector of Locally Aggregated Descriptors latest proposed with different clustering algorithms and clustering center numbers. According to the various of the numbers of cluster centers selected way and assignment of local features, we select K-means, Affinity Propagation and Gaussian Mixture Model, which are three kinds of clustering algorithm.Finally, we propose an improved method of Vector of Locally Aggregated Descriptors. In this paper, we study the normalization and pooling on the effect and validity of Vector of Locally Aggregated Descriptors. We choose power-law and L2 norm in the Normalized method, and use sum pooling, average pooling and Generalized Max Pooling in the pooling method.PPMI, Caltech-101 and Scene-15 are three databases about action, objects and scenes respectively. We verify the effectiveness of the above methods on the three databases.

Keywords/Search Tags:

Image categorization, Set representation, Bag of Words, Efficient Match Kernels, Vector of Locally Aggregated Descriptors

PDF Full Text Request

Related items

1	Research On Image Retrieval Algorithms Based On Separable Visual Vocabulary And Multi-feature Representation
2	Compact Aggregated Descriptors For Mobile Visual Search
3	On Study Of The Feature Extraction Of Human Behavior
4	Research On Generative Adversarial Network Of Facial Image Generation Based On Self-encoder Structure
5	Research And Application On Bag Of Features Based On Spatial Fisher Kernel Framework
6	Research On Image Sentence Annotation Based On Feature Learning And Tag Refinement
7	Design And Implementation Of Short Video Classification System
8	The Design And Implementation Of Content Management System Based On CBIR
9	Research On Locally Features Aggregating And Indexing Algorithm In Large-scale Image Retrieval
10	Research On Key Technologies Of Video-based Human Face And Action Recognition