Font Size: a A A

Research And Application Of Association Rule Mining Algorithm

Posted on:2008-10-28Degree:MasterType:Thesis
Country:ChinaCandidate:Q MaFull Text:PDF
GTID:2178360242458942Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
Data mining is to reveal the implicated but useful information from massive, incomplete, noisy, fuzzy dataset. Its essential target is to extract valuable pattern from the large-scale database. Association rule mining is an important branch of data mining that has obtained many valuable results but there still are a deal of more challenging problems to discuss. Data Mining especially association rule mining is discussed systematically, deeply, completely and in detail. The main contents are listed as follows:Firstly, survey the research of data mining. Having mastered basic definitions of data mining, the common techniques and objective of this method are classified and summarized respectively in detail. Also the current development tendency in inland and abroad is analyzed widely and completely. Furthermore, the future development tendency and research hotspot are discussed.Secondly, survey the research of association rule mining. Having mastered the basic concepts of association rules, the Apriori algorithm which belongs to a kind of association rules algorithm is analyzed and discussed in detail. Then all kinds of optimized techniques designed to promote this algorithm's efficiency are studied and discussed in detail here.Thirdly, a novel LApriori algorithm implemented with reference to Apriori algorithm is produced here which is used to mine association rules from large-scale dataset. The theories related to this algorithm are given a detailed statement and then the whole implementation of this algorithm is expounded in detail. Comparing with Apriori algorithm, LApriori algorithm has the following properties: the first, database is accessed only once by this algorithm; the second, frequent k-itemset mining could obtained in virtue of frequent (k-1)-itemset mining and it is not necessary to scan database again; the third, adopting binary system storage mode will save a mass of space and then operating binary data also could save lots of time. The main conclusion drawn from theoretical analysis and experimental result is that LApriori algorithm is more effective than Apriori and the enhancement in effectivity will be more remarkable with the further expansion of database.Finally, LApriori algorithm is used to vehicle peccancy data mining.
Keywords/Search Tags:data mining, association rule, frequent itemset, the candidates of the frequent pattern
PDF Full Text Request
Related items