Algorithms For Querying And Processing Over Data Streams

Posted on:2006-06-06

Degree:Doctor

Type:Dissertation

Country:China

Candidate:C Q Jin

Full Text:PDF

GTID:1118360155460692

Subject:Software and theory

Abstract/Summary:

PDF Full Text Request

Data stream model has appeared in a growing number of information-processing applications in the last decade, such as internet, sensor networks, network traffic monitoring, network security, data mining, financial monitoring, manufacturing, chronometer and many more. Compared with traditional data models, data stream model owns several distinguishing characteristics, (l)the volume of a stream is unbounded, (2)the rate of stream is very rapid, (3)tuple's arriving order cannot be controlled by applications, (4)each tuple can only be seen once, except that it is reserved for a special purpose.Because of features listed above, devising algorithms for querying and processing over data streams encounters following great challenges. At first, on seeing each new element in the data stream, stream algorithms are required to process it rapidly to update answers in real time. Secondly, compared to the volume of data stream seen so far, the main memory or disk storage that is available for computation is typically very small. Thirdly, for most problems, stream algorithms can only provide approximate answers, but with guaranteed precision in general. Finally, a good stream algorithm can still be efficient even when streams outside change a lot.Traditional data processing techniques can hardly be applied to process data streams directly. Despite the success in traditional applications, Database Management System(DBMS) is infeasible to process such data because DBMS can run a query only when all data are preloaded. Another traditional method, which is based on randomly accessing memory where all data are loaded, is also inapplicable for the volume of stream seen so far is greater than the memory size. This made researchers work out novel querying and processing techniques over data streams.In this paper, we have studied a few principal problems over data streams, and made several contributions.1. Mining frequent items over data streams is a basic problem over streams. We firstly propose a novel method, called hCount, which can estimate the fre-...

Keywords/Search Tags:

data stream model, frequent item, quantile, cardinality, continuous query, shared window joins

PDF Full Text Request

Related items

1	Research On Algorithms For Mining Top-K Frequent Patterns Over Data Streams
2	Complex Rank Query Over Data Streams: Research And Implementation
3	Research On The Technology Of Continuous Query Processing Over Data Stream
4	Mining Frequent Itemsets Over Recent Data Stream
5	Research On Multi-stream Frequent Item Set Mining Algorithm
6	Research On Frequent Item Mining And Correlation Analysis In Data Streams
7	Study On Key Technologies Of Frequent Items Mining And Clustering On Data Streams
8	Study On Probabilistic Frequent Pattern Mining Over Uncertain Data Stream
9	Frequent Itemsets Mining Algorithm And Its Application In Data Flow
10	Research On The Algorithm For Mining Frequent Items From Data Streams