The Improvement Of Two Typical Classification Algorithms

Posted on:2012-12-23

Degree:Master

Type:Thesis

Country:China

Candidate:F Z Wang

Full Text:PDF

GTID:2178330335983490

Subject:Computer application technology

Abstract/Summary:

Data mining extracts the useful information from a lot of data. Classification is one of the important functions of data mining and has been widely used in many fields, such as medical treatment, insurance, finance. The different classification methods have their advantages and disadvantages. The data's accuracy may be different by using the different classification methods for the same data.Bayesian algorithm is often used, because of its simple algorithm and its high accuracy. When the assumption of attribute independence does not hold, naive Bayesian algorithm possibly leads to misjudgment in types of the will-be-tested samples. When the will-be-tested samples have the same probabilities in all categories, it is unable to judge the type of samples. Three improved algorithms are proposed for the limitations of the above algorithm in this paper and experiments are made in the mushroom data. Experimental results show that the accuracy of the improved algorithm is much higher than the accuracy of naive Bayesian algorithm.Rough set is another important technology of classification. The attribute reduction is an important problem in rough set theory. It can maintain the classification of the knowledge base and decision-making on the same conditions, delete the irrelevant or unimportant attributes. It can receive different reduction results by using the different reduction algorithms in a given information system. And the accuracy of different attribute reduction are not the same accuracy, the classification accuracy of some attribute reduction result may be much lower than the classification accuracy of another reduction. In view of this situation, the algorithm based on attribute frequency and the lower approximation of the attribute reduction is proposed in this paper and compared with the two other attribute reduction algorithms. Experimental results show that the accuracy of the proposing algorithm for attribute reduction is much higher.

Keywords/Search Tags:

Data Mining, Classification, Naive Bayesian Algorithm, Rough Set, Attribute Reduction

Related items

1	The Improvement Of Two Typical Classification Algorithms
2	Research On The Approach Of Classification In Data Mining Based On Naive Bayesian
3	Study On Naive Bayesean Algorithm Based On Attributes Weighting And Reduction
4	Data Mining Research Of Vehicle Sales Based On Hash Quick Attribute Reduction Algorithm
5	Rough Set Data Mining Approach And Its Application Relative To Decision Problem
6	Bayesian Classification Algorithm Based On Attribute Discretization And Its Application
7	Study On Intrusion Detection Method Based On Rough Set Attribute Reduction And Bayesian Classification
8	Research And Application Of Naive Bayesian Classification Based On Attribute Selection
9	Research On Heuristic Attribute Reduction Algorithm Based On Rough Set
10	Research On Classifying Algorithm Based On Rough Set