Research On The Algorithm Of Mining Frequent Itemsets From Uncertain Data Based On The Tree

Posted on:2012-01-03

Degree:Master

Type:Thesis

Country:China

Candidate:Y J Guo

Full Text:PDF

GTID:2178330338992697

Subject:Computer application technology

Abstract/Summary:

With the development of society, more and more data are collected by people. Sometimes they are partly lost or with the noise. But the old algorithms couldn't mine the frequent itemset from the uncertain data effectively, we need the new one to handle these problems.In this paper, we discuss the UF-growth algorithm firstly, and test its performance (compare it with the U-Apriori algorithm). But there are some problems: consumption memory excessively and take too much time. So we propose two methods to solve the difficulties,and three new algorithms:(1)To reduce the memory consumption and increase the chance of path sharing,we discretize and round the expected support,and propose the"LUF-growth".The experiments shows it is effective,and better than UF-growth.(2)When building the UF-tree,we improve the header table to save the time,and propose the"UFT-growth". The experiments shows it is effective and improve the efficiency.(3)When improving the UF-growth by these two methods, we propose the"LUFT-growth". The experiments shows the LUFT-growth save more time than others.

Keywords/Search Tags:

uncertain data, frequent itemset, UF-growth, UF-tree

Related items

1	Research On The Algorithm Of Mining Frequent Itemsets From Uncertain Data Based On The Tree
2	Research On Weighted Frequent Itemset Mining In Uncertain Databases
3	New algorithms for frequent sequential pattern and itemset data mining in certain and uncertain databases
4	Study On Probabilistic Frequent Pattern Mining Over Uncertain Data Stream
5	Research And Application Of Frequent Itemset Mining Algorithm
6	Mining Algorithm Of Frequent Items Based On Item Adjacentcy List And Trasaction Tree
7	The Research And Application Of Association Rules Mining Algorithms Based On Directed Itemset Graph
8	Research On Mining Frequent Itemsets Algorithm Based On Bittable
9	Study On The Key Methods Over Uncertain Database
10	The Research On IDS Based On Mining Max Frequent Itemset Using Big Step Pruning Strategy