The Research On The Algorithms Of Optimizing Decision Trees

Posted on:2005-12-09

Degree:Master

Type:Thesis

Country:China

Candidate:J He

Full Text:PDF

GTID:2168360152956005

Subject:Computer software and theory

Abstract/Summary:

Knowledge discovery in databases (KDD) is a multidisciplinary field, drawing work from areas including database technology, artificial intelligence, statistics and so on. Decision tree is a method of KDD that is used widely to mining classification models. It has been studied widely and made a great progress. While the decision trees are always tends to be over-fitting, to have larger scales and to induce longer classification rules in that the tree induction algorithm adopts greedy method. Many methods are proposed to improve these flaws mentioned above. In this thesis these methods are studied completely and sequentially a new method to optimize decision trees is put forward.There are three main points in this thesis as follows:1.A general survey of KDD is given including the definition, basic process, main method and the status of development. The decision tree and several other methods used to mining classification rules are introduced as emphasis.2. A detailed survey of all the decision tree optimization approaches is given, such as modifying test space, modifying test search, restricting database and alternating data structures. The classical algorithms of each kind of approach are also summarized and critiqued.3. A new approach is proposed to reduce the testing attribute sets of decision trees on the base of knowledge reduction coming from the Rough Set theory. With this approach some test attributes unrelated to the classification are removed. Therefore relatively smaller training sets can be found to induce relatively smaller decision trees without reducing accuracy. In the last part, we evaluate the method on several data sets compared with ID3 algorithm.

Keywords/Search Tags:

KDD, Data Mining, Decision Tree Optimization, Rough Sets

Related items

1	The Research On The Algorithms Of Optimizing Decision Trees
2	Research And Implementation On Larger Data Sets Mining Algorithm Based On Rough Set
3	The Data Mining Algorithm Based On Rough Sets
4	Research On Decision Tree Algorithm Based On Rough Sets And Ensemble Learning
5	Based On The Decision Tree And Rough Set Classification
6	The Research Of Optimizing Algorithms Decision Tree Based On Rough Set Theory
7	Research & Optimization Of Rough Set-Decision Tree Neural Network Forecast Model
8	Rough Set Theory In The Decision Tree
9	Research On Data Mining Methods Based On Fuzzy Sets And Decision-Theoretic Rough Sets And The Application In Image Segmentation
10	The Research On Decision Tree Algorithm Based On Rough Set And Application In CRM