Study Of Association Rules Algorithms In Data Mining

Posted on:2012-07-16

Degree:Master

Type:Thesis

Country:China

Candidate:Y Q Duan

Full Text:PDF

GTID:2248330395955230

Subject:Computer system architecture

Abstract/Summary:

PDF Full Text Request

With the development of computer and information technology, data mining hasbecome the focus of attention, widely used in all walks of life. Association rule miningis one of the most important parts in data mining. It obtain the relationship betweenfrequent itemsets by find all frequent itemsets in transaction database and dig outvaluable knowledge and information for users. Association Rules Mining plays animportant role in insurance coverage design, stock market analysis, network dataprocessing and other fields. With the development of database applications, the speedof data collection and storage increases ceaselessly, the traditional association rulemining algorithm has been unable to adapt to such changes, so the study of a highperformance data mining algorithm is very necessary.This thesis analyzed Apriori and FP-Growth in association rule algorithm andresearched them in detail, on this basis of the algorithm come up with two improvedalgorithms. The main work is as follows:Study of the basic concept of data mining technique, the process of data miningand association rules mining concept, basic principles and main research direction indetail.Analysis and research on data mining association rules of the classic Apriorialgorithm, The algorithm has short-coming in the implementation process of scanningdatabase too many numbers and produce a large number of candidate itemsets defects,we put forward a kind of algorithm based on the improved algorithm, the improvedalgorithm in mining process can removed Non frequent itemsets timely and onlyneeds to scan the database once, the follow-up work is coped in memory, so in timeand space efficiency has greatly improved.In-depth study of the frequent pattern growth algorithm FP-Growth, the algorithmneed not to produce candidates and widely used in the current mining frequent itemsetsalgorithm, but FP-Growth algorithm can not effectively mining large databases, andthe time and space complexity is higher. Aiming at the deficiency of the algorithm, thisthesis improved the original algorithm. The improved algorithm improve miningefficiency by adopted cancel redundant item, the decomposition database method formining association rules, at the same time greatly meet the needs of large databasemining.

Keywords/Search Tags:

Data Mining, Association Rules, Apriori Algorithm, FP-GrowthAlgorithm

PDF Full Text Request

Related items

1	Research And Application Of Apriori Mining Algorithm Based On Association Rules In Colleges Management System
2	The Research And Implementation Of Association Rules Algorithms-Apriori Based On Cloud Computing
3	Research And Implementation Of Web Log Mining Based On Asociation Rules Apriori Algorithm
4	Research Of Data Mining By Assocation Rules And Its Application To The Analysis Of Academic Achievements
5	Research On Incremental Updating Association Rules Mining Based On Apriori Algorithm
6	Research And Application Of Apriori Algorithm In Association Rules Mining
7	Study On Associations Rules's Apriori Algorithm In Data Mining
8	Research And Application Of Association Rules Mining Algorithm Based On MapReduce
9	Model Design, Based On The Apriori Algorithm And Olap Association Rules Mining
10	Research On The Apriori Algorithms For Meteorological Data Association Rules Analysis Based On Cloud Computing