The Reaserch Of Clustering Techlogies In Data Mining

Posted on:2012-03-04

Degree:Master

Type:Thesis

Country:China

Candidate:F Zhang

Full Text:PDF

GTID:2218330362960363

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

Accompanied with the development of Internet, an era of information explosion has already come. It is a new chanllenge to find out the truly helpful information from the vast data ocean. Data mining is a technology that emergies under such a background and is now a very important research area. The target of data mining is to extract useful knowledge for the users in an understandable data structure. It is related with many other areas such as database, data management, modeling and inference, assessment of complexity, vision technology, online updating, etc. Clustering, which is a process that cluster different abstract data into groups based on the similarity among them, is the essenial subject of data mining, and is now applied broadly in mathematics, statistics, biology and economics.This paper analyzes and systematically introduces the broadly used clustering technologies. Based on that, two improved algorithms are proposed:The first is the modified k-means algorithm taking in use of two kinds of improvements, the initial centroids selecting and outlier points deleting policies. The modification efficiently removes the shortcoming of non-controllability caused by random initial centroids selection, adapts the traditonal k-means algorithm into the senario of overlapped clustering.The second is NOV-SOM algorithm, which is a modification to SOM algorithm that alternates the units with function module and extends the latter algorithm into the application of non-vectorized data clustering.In the end, groups of contrast experiments have been conducted. The results show that the two improved algoritms effectively augmented both the precision and efficiency of corresponding traditional algotithms.

Keywords/Search Tags:

Data Mining, Cluster Analysis, Genetic Algorithm, K-means algorithm, Kohonen neural network

PDF Full Text Request

Related items

1	Optimized K-Means Clustering Analysis Based On Genetic Algorithm
2	Research On Parallel K-means Algorithm Based On Genetic Algorithm
3	Research Of K-Means Clustering In Data Mining Based On Genetic Algorithm
4	Data Mining Technology And Its Application In The Supermarket In Crm
5	Research On Cluster Analysis Based On Optimized Genetic Algorithm
6	Research In Data Mining Method Based On Genetic Algorithms
7	Cluster Analysis In Data Mining And Its Control In Applied Research
8	Research And Application In Classifier Of Kohonen-ELM Neural Network Model
9	Research On K-Means Algorithm And Its Integration With Intelligent Algorithms
10	Research And Application Of K-means Algorithm In Data Mining Technology Based On Genetic Algorithm