Based Density Data Stream Cluster Mining Algorithm

Posted on:2008-08-29

Degree:Master

Type:Thesis

Country:China

Candidate:Y M Wang

Full Text:PDF

GTID:2178360212480715

Subject:Computer application technology

Abstract/Summary:

PDF Full Text Request

Recently, there are more and more applications that are facing the envirnoment of stream data. Stream data is a kind of continuous; ordered, changing fast and huge amount data. It is quite a new object that is different from traditional static data stored on the disk. Currently, data mining in data stream becomes a hot research field. First, we introduce the knowledge of data mining and discuss the data stream mining, then we build a data stream mining algorithmâ€”DSCluster which may cluster and detect outliers in data stream containing both continuous and categorical attributes. Furthermore, the paper reports experiments on real-life datasets and synthetic datasets, the results show that our algorithm can get higher accuracy of clustering within limited memory, and has the good scalability with the quantity and the dimensionality of stream data. Finally, we summarize the content of paper and point out the research emphases for future work.

Keywords/Search Tags:

Data Stream Ming, Outlier, Mixed Attibute, Data Mining

PDF Full Text Request

Related items

1	Research And Application On Data-stream Outlier Data Mining
2	Local-oriented Data Stream Abnormal Outlier Mining Algorithms And Applications Dynamically
3	Outlier Mining Study Over Data Streams
4	Research On Outlier Detection For Stream Data Based On Sliding-window Model
5	Clustering Analysis And Outlier Detection Algorithms On Uncertain Data
6	Research And Implementation On Key Techlogy Of Data Stream Mining
7	Research On An Application Of Data Stream Query And Data Stream Mining In Oil Field
8	Research On Outlier Mining Method Based On Deviation Characteristic
9	Study On Data Stream Techniques And Its Application In Electric Power Information Processing
10	Research On Frequent Itemset Mining Algorithm Of Uncertain Data Stream