Study Of Mutigranular Attribute Reduction Method Based On Clustering

Posted on:2019-04-19

Degree:Master

Type:Thesis

Country:China

Candidate:X J Wang

Full Text:PDF

GTID:2428330566465494

Subject:Master of Engineering - Software Engineering

Abstract/Summary:

PDF Full Text Request

At present,with the rapid development of science and technology,a large amount of unlabeled data has been produced in practical applications.The clustering method is a representative method for processing such unlabeled data.However,due to the existence of redundant information,traditional clustering methods suffer from large time consumption and low accuracy.On the other hand,attribute reduction based on rough set theory can reduce redundant attributes and extract useful information while maintaining the same identification ability as the original information system.This paper proposes a cluster-based mutigranular attribute reduction method for unlabeled data.Through the adjustment of the K value in the clustering algorithm,the mutigranular calculation is performed to form the partitions of the universe from coarse to fine.The clustering results are used to complete the supervised attribute reduction in the rough set theory to remove the redundant attributes,and finally the KNN algorithm is used to determine classification results.Specifically,the main work of this thesis is divided into two parts:On the one hand,for equivalence relation-based information systems composed of symbolic data,this paper uses K-modes clustering algorithm,and then uses the clustering results as class labels.By adjusting the K value to form multiple divisions of the universe of domains,the positive domain dicernibility matrix is used to reduce the redundancy attribute for each division.Then the dimensionality of datasets is reduced and the algorithm cost can be saved.On the other hand,for information systems with ordinal attributes,we use dominance relation-based rough set method to reduce redundant attributes based on the mutigranular computation of K-means clustering.Finally,the proposed method is compared with the traditional rough set model and the traditional clustering method.Because the proposed method performs supervised attribute reduction based on clustering information,it improves the unsupervised attribute reduction method.Additionally,since redundant information has been reduced,the performance of clustering algorithm is also improved.

Keywords/Search Tags:

Rough set, Dominance relation, Clustering algorithm, Multigranular computing

PDF Full Text Request

Related items

1	Study And Application Of Knowledge Reduction Algorithms Based On Indiscernibility Relation And Dominance Relation
2	Research On Rough Set Theory And Application In Higher Edueation Evaluation
3	Research On Classification Method Of Inconsistent Information Systems Based On Dominance-based Rough Set
4	The Research On Probabilistic Dominance Relation And The Related Problems Based On Ordered Information Systems
5	The Rough Set Model Based On Tolerance Dominance Relation And Its Application Research
6	Research And Application Of Granularity Clustering Algorithm For Mixed Attribute Data Under Dominance Relation
7	Reseach Of The Rough Set Model Based On Dominance Relation And Its Konwledge Reduction
8	Research On Ordered Decision Based Confidential Dominance Relation Rough Set Model And Its Application
9	Research Of The Rough Set Model Based On Prior Probability Dominance Relation And Its Data Mining Methods In Incomplete System
10	Study Decision-making Approaches Based On Dominance Relation Rough Sets In Incomplete Information System