Font Size: a A A

Optimization of sequence queries in database systems

Posted on:2002-03-16Degree:Ph.DType:Dissertation
University:University of California, Los AngelesCandidate:Sadri, Mohammad RezaFull Text:PDF
GTID:1468390011990670Subject:Computer Science
Abstract/Summary:
The need to search for complex and recurring patterns in database sequences is shared by many applications. In this work, we discuss how to express and support efficiently sophisticated sequential pattern queries in relational database systems. Thus, we first introduce SQL-TS, an extension of SQL, to express these patterns, and then we study how to optimize search queries for this language. We take the optimal text search algorithm of Knuth, Morris and Pratt, and generalize it to handle complex queries on sequences. Our algorithm exploits the interdependencies between the elements of a sequential pattern to minimize repeated passes over the same data. We then present extensions of the algorithm for detecting repeated patterns and disjunctive patterns. We also provide methods for finding the inter-dependencies between the pattern elements for important domains including intervals and vector time-series. In addition, a logic based semantics for SQL-TS is given. Experimental results on typical sequence queries, such as double bottom queries, confirm that substantial speedups are achieved by our new optimization techniques.
Keywords/Search Tags:Queries, Database, Patterns
Related items