Research On Spectral Clustering Algorithm And Its Application

Posted on:2016-07-01

Degree:Master

Type:Thesis

Country:China

Candidate:X H Zhao

Full Text:PDF

GTID:2348330542476095

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

Spectral graph theory is the foundation of Spectral clustering algorithm,which can be used as a segmentation method for graph theory,because of its strong mathematical theory basis and broad applicability,the spectral clustering is the hot topic in the field of international pattern recognition currently.Spectral clustering data set can be changed from the original space conversion through the Gauss kernel function features,after the conversion the data in the original space will be linearly separable,in the obtained low dimensional linear space,the traditional clustering method such as k-means can be used.However,the cluster results of spectral clustering algorithm directly depends on the similarity function measure,in the existing spectral clustering algorithm,often using Gauss kernel function as the similarity measure function,but the measure methods have defects in the aspect of representing the spatial characteristics of data,which can not reflect the distribution characteristics of complex data space.Except the above,in the process of the implementation of the algorithm requires not only to store all the data in the similarity matrix resulting high time consumption,but also the space overhead of matrix eigenvalue decomposition is relatively high,the applicationis of large-scale data sets can not to obtain the promotion in the future.In view of the above problems,the research contents of this paper include the following aspects:Fist,to choose a new similarity measure method,we improved the manifold distance similarity which will be able to reflect the data the local and global properties in the complex space,and the robust method as a weighting parameter form the new similarity measure,The improved manifold distance measure can fully reflect the similarity of the multi-scale clustering data between pairs of points,for identify the various data sets have the applicability in broad sense.Then,based on the new similarity measure proposed a new spectral clustering.The algorithm can be used to void the Sensitive issues because of the Gauss Kernel function parameters,and the robustness of the parameter take full account of the neighborhood information effectively prevent noise data resulting error clustering and with good stability.Through with the Gauss kernel function,the common manifold distance similarity measurement comparison,the experimental results display that,the proposed algorithm has a better performance.Finally,improved the application of spectral clustering algorithm on large scale data set,this paper proposes the DBSCAN-based approximate algorithm which is based on the analysis of existing approximate measures and combined the thought of DBSCAN algorithm,proposed the approximate spectral clustering based on DBSCAN,The algorithm avoids the problem of noise sensitive points,and reduces the time complexity.Compared with other approximate algorithms,the algorithm can maintain higher clustering accuracy in most data sets.

Keywords/Search Tags:

spectral clustering, similarity function, manifold distance, large-scale data sets, approximate spectral clustering

PDF Full Text Request

Related items

1	Research On Spectral Clustering Of Large Scale Complex Data
2	Research And Application Of Spectral Clustering Algorithm Based On Manifold Distance Kernel
3	Research And Application Of Clustering Algorithms For Large Scale Data Sets
4	Study On Clustering For Large Data Sets And Its Applications
5	Design And Implementation Of Spectral Clustering Algorithm For Large Scale Data
6	Research And Implementation Of Parallel Clustering Algorithm Based On Approximate Spectrum Hadoop MapReduce
7	Research On Spectral Clustering Methods For Large Scale Datasets
8	Research On Two Types Of Spectral Clustering Algorithms Based On Sparse Similarity Matrix
9	Research Of Hybrid Manifold Clustering Algorithms Based On Spectral Clustering
10	Research And Application Of Several Clustering Algorithms For Mixed Attribute Data