Font Size: a A A

Mining Frequent Itemsets from Uncertain Data: Extensions to Constrained Mining and Stream Mining

Posted on:2011-07-17Degree:M.ScType:Thesis
University:University of Manitoba (Canada)Candidate:Hao, BoyuFull Text:PDF
GTID:2448390002954722Subject:Computer Science
Abstract/Summary:
Most studies on frequent itemset mining focus on mining precise data. However, there are situations in which the data are uncertain. This leads to the mining of uncertain data. There are also situations in which users are only interested in frequent itemsets that satisfy user-specified aggregate constraints. This leads to constrained mining of uncertain data. Moreover, floods of uncertain data can be produced in many other situations. This leads to stream mining of uncertain data. In this M.Sc. thesis, we propose algorithms to deal with all these situations. We first design a tree-based mining algorithm to find all frequent itemsets from databases of uncertain data. We then extend it to mine databases of uncertain data for only those frequent itemsets that satisfy user-specified aggregate constraints and to mine streams of uncertain data for all frequent itemsets. Experimental results show the effectiveness of all these algorithms.
Keywords/Search Tags:Uncertain data, Frequent, Mining, Satisfy user-specified aggregate constraints, Situations
Related items