Efficient storage and query processing of set-valued attributes

Posted on:2002-09-02

Degree:Ph.D

Type:Dissertation

University:The University of Wisconsin - Madison

Candidate:Ramasamy, Karthikeyan

Full Text:PDF

GTID:1460390011992035

Subject:Computer Science

Abstract/Summary:

In order to better support complex applications, object relational systems provide features that are absent in relational systems. The main features a new type by providing collections of existing types. It is well known that sets are useful in modeling a great deal of real world data. However, such powerful modeling comes at a price; without an efficient implementation, using sets can yield a performance much worse than that obtained using only traditional relational constructs. This dissertation explores novel ways of implementing set-valued attributes in an object relational system. Specifically, it considers various options for storing set-valued attributes, and ways of computing the challenging set containment join operation.; We first address the problem of storing set-valued attributes. Using the orthogonal attributes of nesting and location we identify four options for representing sets: nested internal and external, and unnested external and internal. These representations can be combined with the creation of various indices to create various classes of indexed representations. We evaluate each of these representations with respect to conjunctive and disjunctive queries. Our results show that overall the nested implementations perform better than the unnested implementations because: (a) they exploit grouping semantics while fetching the members of a set instance and (b) they allow the evaluation of set predicates directly on the set instance.; Next we consider the problem of efficiently evaluating set containment joins. For unnested external representation, the set containment join can be expressed directly in SQL. By contrast, the most obvious algorithm for computing set containment joins on nested representations is the signature nested loops algorithm, which computes set signatures and compares each signature in a relation with all the signatures in the other relation. To improve on the performance of this algorithm we propose a new partitioned set join algorithm (PSJ), which uses a multi-level scheme of partitioning by replicating the inner relation. Our performance study shows that for extremely small relation and small set cardinalities, the SQL query approach and signature nested loops perform comparably to PSJ. However, as the size of the data sets increase (in both relation and set cardinality), PSJ clearly dominates.

Keywords/Search Tags:

Set-valued attributes, Relation, PSJ, Set containment, Sets

Related items

1	The Solution Set Structure Of Fuzzy Relation Equations With Max-*Composition On [0,1]
2	Research On The Topological Structure And Model Extension Of Single-valued Neutrosophic Rough Sets
3	Research On Some Special Fuzzy Sets Theories And Their Applications
4	The Research Of Fuzzy Equivalence Relations In Different Fuzzy Systems
5	Theory And Application Of Flou Sets And Flou-valued Sets
6	Rough Set Theory On The Interval-Valued Fuzzy Information Systems
7	Intuitionistic Fuzzy Sets And Their Applications Based On Interval-valued Level Cut Sets
8	The Research Of Certain Questions About Interval-valued Intuitionistic Fuzzy Sets
9	Research On Metric Of Interval-valued Fuzzy Sets
10	Intuitionistic Fuzzy Granular Structures And Uncertainty Research In Intuitionistic Fuzzy Information Systems