The Improvement Of Complete Decision Tree Based On The Information Gain Theory

Posted on:2012-05-03

Degree:Master

Type:Thesis

Country:China

Candidate:Q Liu

Full Text:PDF

GTID:2218330362456802

Subject:Spatial Information Science and Technology

Abstract/Summary:

As an important classification method in Data Mining, Decision Tree is a method that has simple and efficient classification results. The Decision Tree constructs a model by training sample data and then classifies data by the model previously built. Research on Decision Tree has more than 40 years history and many algorithms came out these years. Some classic algorithms, such as ID3, C4.5, C5.0, are based on Information Gain theory. These methods have their own advantages like clear, simple and fast. But on the other hand, when using the Information Gain theory to divide attributes, it tends to choose the attribute which has more values. In this paper, combined with Information Gain theory's advantages, the improved decision tree algorithm is chosen to be the main research target.By introducing a new type of decision tree node, a couple of attributes would be chosen instead of a single attribute. We call the decision tree built with this type of node the Complete Decision Tree (CDT). CDT based on Information Gain retains the gain calculation selection criteria. Meanwhile, CDT improve the robust and accuracy of the algorithm. It excavates the potential of decision tree based on Information Gain.A Car Evaluation Data Set of UCI is used to test the CDT based on Information Gain. The result is compared to ID3 and C4.5. Depend on the Range parameter, CDT got a better accuracy compared to ID3 and C4.5. On the other hand, CDT made a reasonable time consuming. It proves an improvement on classification accuracy rate and sacrifices few time complexity.

Keywords/Search Tags:

Information Gain, Complete Decision Tree (CDT), Attributes division, Range parameter

Related items

1	Text Classification Algorithm Based On Attributes Correlation
2	The Research On Application Of Improved ID3 Of Decision Tree Classification Algorithm In Management Of Students' Grades
3	The Research On The Algorithms Of Optimizing Decision Tree Classification
4	Research On The Improvement Of Filtration Ability Of Firewall Using Decision Tree
5	Research On Decision Tree Classification Based On Discrete Attribute
6	The Decision Tree Classifier Hope In The Sales Management System
7	Research And Application On Decision Tree In Data Mining
8	The Application Of Decision Tree In Food Rotation Business
9	Inductive Decision Tree Classification Model In The Military Transport Vehicle Management System
10	Decision Tree Methods In Data Mining And Customer Classification