The Research On Clustering Algorithms Based-on Rough Set Theory

Posted on:2010-01-11

Degree:Master

Type:Thesis

Country:China

Candidate:Y Ceng

Full Text:PDF

GTID:2178360275484422

Subject:Computer software and theory

Abstract/Summary:

PDF Full Text Request

With the development of database technology and the widely use of database management system, people need to manage with more and more data. However, currently very few tools can be used for the analysis of these data. So many people want to predict the future trend of existing data by advanced analysis. Thus, data mining was proposed. Data mining is novel and active and is one of the most forward lines of database and information decision research. The rough set theory, which was introduced by Pawlak Z. in 1982, is a useful tool to deal with vagueness and uncertainty. With rough set theory, decision or classification rules can be deduced during the process of knowledge reduction, with classify ability not decreased. Rough set theory are often used with rule induction, classification and clustering.This paper firstly introduces the basic concepts and improvements and applications of data mining and rough set theory, and the rough set based data mining methods. After draw comparisons of rough sets and clustering algorithms, it presents a novel method to BIRCH, by introducing the concept of upper and lower approximation into the BIRCH. This method analyses an attribution problem of data objects, when data objects simultaneously belong to multiple clusters. The experimental results show that the improved BIRCH has accuracy improved.Outlier mining is an important research of data mining. After analysis and comparison of commonly used methods of outliers, it makes an outlier detection method based on K-means and agglomerative clustering for data stream. The result of the experiment shows that this algorithm is feasible and effective.

Keywords/Search Tags:

Data mining, Rough set theory, Clustering, Outliers

PDF Full Text Request

Related items

1	Study On Rough Set Based Data Mining Methods
2	The Research Of Clustering Based On Rough Set Theory
3	Research On The Outliers Detection Algorithm
4	Research On Application Of Rough Set Theory In Data Mining
5	The Research On Clustering Algorithm Based On Particle Swarm Optimization And Rough Set Theory
6	Network Intrusion Detection Research Based On Rough Set And Outliers Mining
7	Research Of Text Mining Based On Rough Set Theory
8	The Research On Data Mining Algorithms Based-on Rough Set Theory
9	Research And Application Of Data Mining Based On Rough Set In Clustering Discrimination
10	K-distance-based Outliers And Clustering Algorithm