Clustering Analysis Study Based On Kernel Function

Posted on:2007-10-18

Degree:Master

Type:Thesis

Country:China

Candidate:X L Li

Full Text:PDF

GTID:2178360182960961

Subject:Management Science and Engineering

Abstract/Summary:

PDF Full Text Request

Data mining is an important subject of information field. Data mining technique is widely used in many industries now. Clustering is one of the core technique of data mining. K-means clustering, which is widely used in presently clustering algorithms, is a concise and practical algorithm, but it didn't optimize the features of the data samples and so the result of the clustering is not satisfying if the boundaries of the samples is nonlinear or the samples doesn't subject to Gaussian distribution. The kernel method improves the optimization of the example features and transforms the nonlinear learning problems to linear learning problems by mapping the samples from input space to feature space. By introducing the kernel methods the clustering algorithm can obtain better performance.Aiming on absence of robust performance of presently kernel clustering based on hard partition and limitation of defining the fuzzy parameter in fuzzy clustering algorithm, this thesis presents a new clustering algorithmâ€”kernel k-aggregate clustering algorithm in virtue of aggregate function approximating maximum function, which is both soft clustering and avoidance of choosing fuzzy parameter. Next, focus on amount of categorical valued and mixed valued data in practical use and the limitation of k-prototypes and fuzzy k-prototypes clustering dealing with categorical valued and mixed valued data, this thesis extends the algorithm to categorical valued and mixed valued data by combining categorical value attributes decomposing method with kernel k-aggregate clustering algorithm so as to make the algorithm more widely applicable.At the beginning, this thesis introduces some background information about clustering and relative theory about kernel function. Secondly, the thesis presents presenting kernel k-aggregate clustering algorithm based on explaining the limitation of existing clustering and proves the effectiveness by data experiment using Matlab. The author suggests this algorithm is more stable, robust and accurate in clustering. At last, the author apply the algorithm to segmentation of a certain hairdressing health center and the clustering analysis provides them implication and evidence of different marketing strategies for different segmental markets.

Keywords/Search Tags:

Data mining, Clustering, Kernel function, Aggregate algorithm, Mixed valued attribute

PDF Full Text Request

Related items

1	Research And Application Of Clustering Analysis Algorithm Based On Mixed Attribute Data
2	Research On Clustering Algorithm Of Mixed Data
3	Research On Mixed Kernel Function Based On Support Vector Machine
4	Research On Mixed Attribute Clustering Technology Based On Cluster Center Selection Strategy
5	The Research Of Ant-Based Clustering Algorithm For Data Sets With Mixed Attribute
6	Research And Application Of Several Clustering Algorithms For Mixed Attribute Data
7	Scmi-superviscd K-means Clustering Algorithm In Data Mining
8	The Research Of Fuzzy Clustering For Categorical And Mixed Valued Data
9	Research On Clustering Algorithm For Mixed-type Data Based On K-modes Algorithm
10	Research On Clustering Ensemble Of Mixed Data And Clustering Algorithm Of Mixed Data Streams