Research On An Improved Clustering Algorithm Of K_means

Posted on:2011-03-05

Degree:Master

Type:Thesis

Country:China

Candidate:Z G Liu

Full Text:PDF

GTID:2178330332960375

Subject:Computer application technology

Abstract/Summary:

PDF Full Text Request

With the rapid development of network and the steady progress of information technology, a variety of data expand rapidly. How to find useful information and to classify them has been becoming increasingly urgent. The appearance of data mining technologies makes it possible to process large amounts of data. Clustering is one of important data mining technology and has been becoming a research hotspot.In this paper, two improving points of the k_means clustering algorithms are proposed to solve the limitation of its sensitivity to outliers and the selection of initial points. Firstly, after outlier detection problem of the k_means algorithm is studied deeply, a grid-based data pre-processing algorithm is proposed. This method detects outliers by dividing the dataset grid. Secondly, on the base of analyzing the selection of initial points, an algorithm of initial point selection is proposed. This method directly chooses the initial point based on the grid-base data's pre-processing, so that the initial point is more reasonable and closer to the actual cluster center. Finally, during the process of k_means clustering, those two algorithms are used to deal the outlier and to select the initial point and the improved k_means algorithm is given.The improved k_means algorithm is verified and analyzed by experiments. Experimental results show that the improved k_means algorithm improves the accuracy of clustering in a certain extent.

Keywords/Search Tags:

Clustering, K_means algorithm, Network, Average point

PDF Full Text Request

Related items

1	Research On K_means Clustering Algorithm Based On MapReduce
2	Research And Application Of K_means Algorithm And Swarm Intelligence Algorithm(PSO)Fusion
3	The Study Of The Weighted Average Density Self-adaptive Clustering Algorithm Based On Grid And Its Application
4	Research On Improvement Of PageRank And K_means Algorithm In Web Data Mining
5	The Research Of Clustering Algorithm Based On Cumulative Average Density
6	Study In Nature Image Retrieval Based On Image Semantic
7	Clustering Algorithm Based On Slepian-Wolf Theorem For Wireless Sensor Network And Improvement
8	Research On Simplified Algorithm Of 3D Point Cloud Data Based On Grey Wolf Optimized K-means Clustering Algorithm
9	The Research And Improvement Of DV-Hop Algorithm In Wireless Sensor Network
10	Fuzzy C-means And K-means Clustering Algorithm And Its Parallel