Data Density Description Based Data Stream Frequent Pattern Mining

Posted on:2014-01-16

Degree:Master

Type:Thesis

Country:China

Candidate:Q Gao

Full Text:PDF

GTID:2268330392969068

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

Lots of current Streaming data mining method are developed from the Static dataset based data mining method. And these method inherited the basic idea of Static dataset based data mining method, which is storing the data in the easy controlled memoryand mining on it. So many Streaming data mining methodsâ€™ thought is get part ofStreaming data in local machine, and mining on the part of Streaming data which isstored in local machine, the so called window mechanism. But the idea is notcompletely suitable to use in Streaming data mining. That is to say the most slidingwindow, landmarks window based Streaming data mining methods are borndisadvantages, that is they can only depend on the current window. This is inevitable toignore the fluctuation characteristics of Streaming data. There is another disadvantage isbecause storage equipment limits, the size of the window is restricted, and even if theRecession window mechanism which is taken out for solving the problem canâ€™t solvethe question thoroughly.Aims to solving These Shortages, an original method that more suitable forstreaming data mining is proposed. That is a mining method that based on the statisticaldata density distribution characteristics the so called PDB-FIM. The most importantcontent of this paper is as follows:First, how PDB-FIM store and process the information of every high speedarriving stream data.Second, the method of how to keep balance of main store of PDB-FIM is cut setsby the probability density information and the support information.Third, the conceptions of complete information tree and un-complete informationtree are proposed. And the strategy of keep an un-complete information tree and acomplete information tree to solve the store problem is adopted.Last, the method of processing streaming data and generation probabilityinformation from them.This method has the following advantages: less memory requirements, Giveconsideration to the historical data, can detect the frequent now but not frequenthistorical data, sensitive to the changing of streaming data.

Keywords/Search Tags:

data density description, data stream mining, frequent item-set

PDF Full Text Request

Related items

1	Research On And Implementation Of Frequent Item Set Mining System In Data Stream
2	Study On Probabilistic Frequent Pattern Mining Over Uncertain Data Stream
3	Research On The Algorithm For Mining Frequent Items From Data Streams
4	Research On Frequent Item Mining And Correlation Analysis In Data Streams
5	Mining Frequent Itemsets Over Recent Data Stream
6	Research On Multi-stream Frequent Item Set Mining Algorithm
7	Study On Key Technologies Of Frequent Items Mining And Clustering On Data Streams
8	Research And Application Of Mining Frequent Items Algorithm In Data Streams
9	Research Of Webpage Hot Topic Retriving Technology Based On Data Stream Mining
10	Frequent Itemsets Mining Algorithm And Its Application In Data Flow