Research Of Some Intelligent Mining Algorithms Based On Knowledge Roughness And Extended Reducts Of Rough Sets

Posted on:2006-08-27

Degree:Master

Type:Thesis

Country:China

Candidate:H X Sha

Full Text:PDF

GTID:2168360152466615

Subject:Computer software and theory

Abstract/Summary:

PDF Full Text Request

Rough sets theory is an effective approach to analyzing incomplete and uncertain information systems. Rough sets analysis only uses the internal information of data and does not rely on any prior knowledge . It has been successfully applied to data mining and KDD, which promotes itself as a predominant approach in DM and KDD.The classical rough sets model was mainly set up for dealing with complete information systems without any missing data nor with imprecise data and such systems have been the topics for many rough set based knowledge reduction. However, we often need to deal with incomplete or inconsistent data in practical applications because of some undesirable reasons , which puts forward too many problems to the classical rough sets to solve . So, many extended rough sets models merge as the time requires ,such as the variable precision rough set , fuzzy rough sets etc . The extended models of rough sets introduce approximate and probabilistic ideas into classical rough sets and define new reducts , core and discernibility matrix of themselves and have made some satisfactory achievements in certain applications.In this paper, we first propose a new approach to the construction of a decision tree based on knowledge roughness and the decision tree obtained by this algorithm is simpler than that of ID3.In the third chapter, we analyze the properties of some reducts in inconsistent information system , namely the distribution reduct, the maximal distribution reduct and the reduct under entropy perspective. As a result of an intensive analysis of them , we conclude an equivalence between the distribution reduct and the one under entropy theory.Furthermore , we bring forth a new way to measure the casualness of the relationship between the reduct attributes and the decision ones withthe help of chi-square in statistic .which provides referenced information when choosing a best reduct from among many candidate ones. In many special applications , dynamic additions and modifications of data require learning algorithms to learn incrementally on the base of previous knowledge instead of running the mining algorithm from scratch every time. Therefore , we present an incremental learning algorithm based on the maximal discernable matrix and we apply the algorithm to the clinical data analysis .The learning results prove the effectiveness and correctness of the incremental learning algorithm in the end.

Keywords/Search Tags:

Rough sets, reduct, chi-square, incremental learning, discernibility matrix

PDF Full Text Request

Related items

1	Research On Reduct Algorithm In Rough Set Theory
2	Fuzzy Rough Sets Reduct Based On Discernibility Matrix
3	Research On Optimization Computation Approach For Reduct Based On Rough Set
4	Research On Incremental Knowledge Reduction Algorithm Based On Binary Discernibility Matrix
5	Study Of Rough Set Theory Based Incremental Algorithms And Its Application
6	Research On Data Reduction Based On Rough Sets And Extension Of Rough Set Models
7	Rough Sets Model Based In Incomplete Information Systems
8	The Decision Table Compression Based On The Rough Set Theory
9	Study Of Several Problems In Rough Set Theory And Its Applications
10	Research On Incremental Knowledge Reduction Algorithm For Imcomplete Information System