Dynamic Parameters Of A Density-based Unit Clustering Algorithm

Posted on:2006-02-11

Degree:Master

Type:Thesis

Country:China

Candidate:H Q Bao

Full Text:PDF

GTID:2208360155966446

Subject:Computer software and theory

Abstract/Summary:

PDF Full Text Request

In this article, concepts, techniques and algorithms about clustering will be discussed. We give a dynamic parameter solution for parameter selection problem in density based clustering algorithm.Among the various algorithms put forward, a main class of them are based on "distance" , whether it is in the sense of traditional Eculid distance or others. "K-means" and "k-medoids" are two of this kind. However, these algorithms are inefficient when dealing with large data sets and data sets of high dimension. Further more, the number of clusters they can find usually depends on users' input. But this task is often a very tough one for the user.In this article we give a solution for this parameter selection problem,called a dynamic parameter computing algorithm. The algorithm in this article differs much with above ones and it takes a totally different approach, which we call a grid and density based algorithm. It can automatically find out subspaces containing interesting patterns we want and discover all clusters in that subspace. Besides, it performs well when dealing with high dimensional data and has good scalability when the size of the data sets increases. As results, clusters found are presented to users in DNF expressions.

Keywords/Search Tags:

Data mining, Cluster, density, dynamic parameter., DNF.

PDF Full Text Request

Related items

1	A Clustering Algorithm Based On Density With Its Application In The Customer Cluster In The Field Of Telecom
2	The Outliuer Detection Algorithm Based On Cluster Outlier Factor And Unique Closet Neighbor Set
3	Large-scale Scientific Data Mining Density Clustering Algorithm
4	Improvement Of Density-based Algorithm In Cluster Analysis
5	Research On Data Stream Clustering Algorithm Based On Double-layer Grid And Density
6	Intelligent Based On The Web Log Mining Site
7	The Research And Application Of Data Mining Based On Grid-Density
8	Research On Dynamic Parameter Report System Supporting Data Mining
9	Based Density Data Stream Cluster Mining Algorithm
10	Data Mining Technology In Smt Production Decisions