Mining Frequent Itemsets Over Recent Data Stream

Posted on:2011-11-19

Degree:Master

Type:Thesis

Country:China

Candidate:J Han

Full Text:PDF

GTID:2178330332460943

Subject:Computer application technology

Abstract/Summary:

PDF Full Text Request

Quickly and accurately finding frequent items of large amounts of the data stream is an important basis for prediction and decision-making, this paper presents an approach about mining frequent itemsets on data stream within the current window. The study combines the sliding window techniques, frequent itemsets, genetic algorithm and parallel processing technology.Sliding window has been used in the network communication, time-series data mining, data stream mining and so on. This algorithm uses the sliding window to obtain the current data stream. We use the genetic algorithm to achieve the result mainly through crossover, mutation and selection. After several generations of selection, we achieve a final frequent itemsets. In this paper, we use standard pattern PGA (parallel genetic algorithm). When we establish the parallel part in the program, we can let this part run into GPU.First, the nested sliding window divides the data into data sets, and then the method use the parallelism and the global optimum and the capability of processing mass data of genetic algorithms to search for the frequent itemsets in sliding window. With the data stream flowing, this method is to capture the latest frequent itemsets accurately and timely on data stream. It is also periodically delete the expired data stream. As the use of nested windows and the parallel processing capability of genetic algorithm, this method reduced the space complexity and time complexity. Test proved that the method is effective and practical.

Keywords/Search Tags:

data stream, frequent item sets, genetic algorithm, nested sliding window

PDF Full Text Request

Related items

1	Research On Multi-stream Frequent Item Set Mining Algorithm
2	Research On Frequent Patterns Mining Algorithm Based Sliding Window In Data Streams
3	Research On The Algorithm For Mining Frequent Items From Data Streams
4	Frequent Itemsets Mining Algorithm And Its Application In Data Flow
5	Research On Frequent Item Mining And Correlation Analysis In Data Streams
6	Research On Frequent Pattern Mining Algorithm Of Data Stream Based On Sliding Window
7	Study On Probabilistic Frequent Pattern Mining Over Uncertain Data Stream
8	Research On Optimization Of Data Stream Frequent Itemsets Mining Algorithm Based On Sliding Window
9	Research On Frequent Itemsets Mining Algorithm In Data Stream
10	Study On Key Technologies Of Frequent Items Mining And Clustering On Data Streams