Query optimization in distributed database systems and multidatabase systems

Posted on:1995-05-31

Degree:Ph.D

Type:Dissertation

University:Northwestern University

Candidate:Chong, Eugene Inseok

Full Text:PDF

GTID:1468390014989035

Subject:Computer Science

Abstract/Summary:

Distributed query processing algorithms usually perform data reduction by using a semijoin program, but the problem with these approaches is that they still require an explicit join of the reduced relations in the final phase. We introduce an efficient algorithm for join processing in distributed database systems that makes use of bipartite graphs in order to reduce data communication costs and local processing costs. The bipartite graphs represent the tuples that can be joined in two relations taking into account also the reduction state of the relations. This algorithm fully reduces the relations at each site. We then present an adaptive algorithm for response time optimization that takes into account the system configuration, i.e., the additional resources available and the data characteristics in order to select the best strategy for response time minimization. We also report on the results of a set of experiments that show that our algorithms outperform a number of the recently proposed methods for total processing time and response time minimization.; In multidatabase systems, we address the problem of data integration and query processing in the presence of conflicting values for the attributes that correspond to different instances of the same real-world object appearing in multiple roles. We present a new method for schema and data integration in multidatabase systems that presents the answer to the query as a set of sets representing the distinct intersections between the relations representing the various roles of interest. We present extensions to MSQL that allow us to express role-sets and quantifiers applied to them. We further develop a strategy for query processing based on our role-based model. Our method allows for local selection for all queries with/without aggregation and thus reduces the amount of data transmitted. In addition, it makes efficient use of merge-sort techniques to generate the role-sets and compute joins at the global query site.

Keywords/Search Tags:

Query, Data, Systems

Related items

1	Research On Techniques And Systems For Index And Query Optimization Of Big Data
2	Online query processing in Geographic Information Systems
3	Aggregation Query Research Over Continuous Data Streams
4	Research On Querying Missing Data
5	Integrated Query Processing Over Autonomous Heterogeneous Data Sources
6	Research On Key Techniques Of Query Processing Over Wireless Sensor Networks
7	ER-Topk Query Processing On Ucertain Streams
8	Keyword Query For RDF Data Based On Query Translation
9	A framework for ranking data sources and query processing sites in database middleware systems
10	Implementation and applications of query interfaces to constraint databases in a distributed computing environment