Improvement Of Density-based Algorithm In Cluster Analysis

Posted on:2014-11-06

Degree:Master

Type:Thesis

Country:China

Candidate:Z Z Lin

Full Text:PDF

GTID:2298330434472506

Subject:Computer software and theory

Abstract/Summary:

Cluster analysis is the task of grouping a set of objects in such a way that objects in the same group are more similar to each other than to those in other groups. It is a main task of exploratory data mining, and a common technique for statistical data analysis used in many fields, including statistics, machine learning, pattern recognition, information retrieval, bioinformatics, etc.Numerous clustering algorithms have been proposed so far, the density-based clustering is one of the powerful methods that can detect arbitrarily shaped clus-ters in data space. The existing density-based clustering algorithms, such as DB-SCAN, DENCLUE, are not suitable to deal with clusters of different densities due to their usage of global parameters. SNN is not very efficient because it has to reconstruct the shared nearest neighbor (sNN) graph from the k nearest neigh-bor (kNN) similarity matrix. In this paper, we propose a clustering algorithm DEFAT which is based on a novel model called Density-Flow. In Density-Flow model, data objects can share their local density information for global objectsâ€™ similarity. Based on that, DEFAT can separate dense area from the spare easily, so that it can detect clusters of various shape and size, different density, even the clusters are overlapping. Our experiments on both synthetic and real-world data sets demonstrate that our approach outperforms existing density-based clustering both on effectiveness and efficiency.

Keywords/Search Tags:

Density-Flow, Similarity, Clustering, Data Mining

Related items

1	Clustering XML Documents Based On Density And Fuzzy Set
2	Study On Clustering For Large Data Sets And Its Applications
3	A Improved Density Peaks Clustering Algorithm
4	Research On Density Peaks Clustering
5	Research On Hierarchical Clustering Algorithm Based On Density Peaks
6	Research On Density Based Clustering Algorithms For Varying Density Data
7	Density Clustering Analysis Algorithm Based On Variable Neighbor And Adaptive Density
8	Clustering Ensemble Method And Application Based On Local Weighting And Inter-class Similarity
9	Study Of Spatial Data Mining Algorithm Based On Density Clustering
10	Research On Improved Clustering Algorithm Base On Density