Font Size: a A A

A Method Of Classification And Association Rules Obtaining

Posted on:2011-06-12Degree:MasterType:Thesis
Country:ChinaCandidate:Turiho Jean ClaudeFull Text:PDF
GTID:2178360308968654Subject:Computer Science and Technology
Abstract/Summary:PDF Full Text Request
Data mining popularly known as data or knowledge discovery is the process of analyzing data from different perspectives and summarizing it into useful information-information that can be used to increase revenue, cuts costs, or both. Being an interesting field, data mining has attracted many researchers. Extensive researches have been conducted on major data mining techniques but few researches have addressed the integration of these techniques.My thesis focuses on how to integrate the two major data mining techniques namely, classification and association rules to come up with An Optimal Class Association Rule Algorithm (OCARA). It is proposed by our study group. Classification and association rule mining algorithms are two important aspects of data mining. Classification association rule mining algorithm is a promising approach for it involves the use of association rule mining algorithm to discover classification rules.OCARA inherits the strength of Classification and association rule mining algorithms. Because of this reason, OCARA is a powerful algorithm when compared to either Classification or Association rule mining algorithms.To verify the strength of OCARA, we conducted experiment using eight different data sets of UCI (University of California at Irvine). We compared OCARA with other three popular algorithms (C4.5, CBA, RMR). The end result proved that the support threshold was greatly influenced by the rule accuracy and the rule number. If the support threshold is between 0.02 and 0.03, the accuracy will be much better, as discussed in this paper. The support threshold was set as 0.02, and the confidence was set as 0.30 in our work. Therefore, OCARA proved to more efficient when compared with others making it more robust in terms of its accuracy.The reason for OCARA's high accuracy is because of optimal association rule mining algorithm and the rule set is sorted by priority of rules resulting into a more accurate classifier. Therefore, we can confidently say OCARA is an accurate classifier and has better performance and is more efficient when compared with C4.5, CBA, and RMR algorithm. This thesis makes major contribution to this young industry of data mining since it has successfully proposed and tested a new algorithm, OCARA.However, OCARA has many rules when compared with RMR when the support is lower. To overcome this limitation of having many rules, we are encouraging others researchers to focus on this promising algorithm by improving its efficiency.
Keywords/Search Tags:class association rule, association rule, classification, data mining
PDF Full Text Request
Related items