Research On The Basic Technology Of Association Rules

Posted on:2010-05-27

Degree:Master

Type:Thesis

Country:China

Candidate:Y K Guo

Full Text:PDF

GTID:2178360278981265

Subject:Computer application technology

Abstract/Summary:

PDF Full Text Request

Data mining means a process of finding nontrivial, extraction of implicit, pervious unknown and potential useful information from data in database. Association rule mining as an important field of data mining discovers interesting relationships among attributes in those data.By studying the literature domestic and abroad, we research some basic problems of association rules mining algorithms. The main contexts are showed as follows:Firstly, a maximal frequent itemset mining algorithm SFP-Miner, which based on Sorted FP-Tree was proposed. The SFP-Miner scanned Database twice and compress stored the frequent itemset in SFP-Tree. By using depth-first strategy, the algorithm pruned the searching space by pre-prune and mergence strategy and discovered all the maximal frequent itemset efficiently and didn't need to scan the Database. The experimental result indicated that SFP-Miner is an efficient algorithm.Secondly, we presented a new updating algorithm, UAMFI, for mining maximal frequent itemsets from transaction database when minimum support was changed by customer. The algorithm adopted a new data structure FMSFP-Tree (Full Merged SFP-Tree) which stored all the frequent itemsets in any given minimum support and it directly mined and updated the maximal frequent itemsets in FMSFP-Tree. It can efficiently mine maximal frequent itemsets with changed minimum support. From the experimental result, we can conclude that the algorithm is highly efficient to the updating mining problems.Finally, we presented a new algorithm, DBSMiner (Density Based Sub-space Miner), for mining quantitative attributes association rule. This algorithm, which referenced the ARCS (Association Rule Clustering System), used a grid structure to quantize the object space into a finite number of cells; it sorted all the dense grids by descending order and used a grid based cluster algorithm to cluster the data with all attributes. At last, it clustered the association rules. Theoretical analysis and experimental results show that, DBSMiner algorithm has good performance and accuracy. It can effectively mine association rule of quantitative attributes.

Keywords/Search Tags:

Data mining, Association rules, Maximal frequent itemset, Updating mining, Sorted Frequent pattern tree, High dimension cluster

PDF Full Text Request

Related items

1	The Research And Implementation Of Association Rule Data Mining Algorithm
2	The Research On The Related Problems Of Association Rule Mining
3	The Research On The Algorithms Of Mining Distributed Maximal Frequent Patterns
4	The Research And Application Of Association Rules Mining Algorithms Based On Directed Itemset Graph
5	Research On Mining Algorithms Of Maximal Frequent Item Sets
6	Research On Algorithms For Mining Maximal Frequent Itemsets
7	The Research And Realization Of Association Rules Data Mining Algorithms
8	Research And Application Of An Improved Algorithm For Association Rules
9	Study On Frequent Pattern Mining Algorithms And Pruning Strategies
10	Research On Key Algorithms For Mining Frequent Patterns In Data Streams And Their Application In Simulation System