Font Size: a A A

Research On Two-tier Structure Clustering Mining Based On Data Stream

Posted on:2009-04-17Degree:MasterType:Thesis
Country:ChinaCandidate:H T ChuFull Text:PDF
GTID:2178360242486662Subject:Application Research of Computers
Abstract/Summary:PDF Full Text Request
With the high development of computer technology,there are more and more applications that facing the environment of stream data.Stream data is a kind of continuous,ordered,changing fast and huge amout data.It is quite a new object that is different from conventional static data stored on the disk.The main achievement in this paper is to design and realize the two-tier framework TWDSCluster which includes two parts the online cluster and the offline cluster.We introduce two concepts microcluster and pyramidal time framework.The statistical information in data points is retained as the form of microcluster,and stored in terms of the pyramidal time framework. It can also detect outliers in data stream efficiently.Experiments show that our algorithm can get higher accuracy of clustering within limited memory.Finally,we summarize the content of the paper and point out the research emphases for future work.
Keywords/Search Tags:data stream, clustermining, outlier detection, two-tier structure
PDF Full Text Request
Related items