The Optimization Of The Query Execution Engine In Column Oriented DWMS

Posted on:2015-02-11

Degree:Master

Type:Thesis

Country:China

Candidate:D T Hao

Full Text:PDF

GTID:2268330425981883

Subject:Computer application technology

Abstract/Summary:

PDF Full Text Request

As information technology continues to evolve, people have to deal with the explosive growth of data. In order to better analyze large amounts of data, data warehouse system (DWMS) came into being. Data warehouse is used for data analysis. Therefore, comparing to OLTP database system, data warehouse is a read optimization system. Column oriented data warehouse system is more adaptive to read optimization environment comparing to traditional data warehouse system. The lab work is trying to develop a column oriented data warehouse system. The author is developing the execution engine of the system.This paper is based on the developing of query exection engine in DWMS. DWMS is a column oritented data warehouse system. This paper is trying to research the implementation and optimization problems in query execution engine. This paper studies three main issues.First is the architecture and implementation of execution engine. This paper proposes a new hash join method to solve hash collision problems by building index in bucket. This index is built on buckets which have too many collisons in them. When probing in this bucket, this index can help improve the probing efficiency.Second is the optimization with column store related technology like ulitizing B+tree index to achive high efficiency selection operation. This paper also tries to integrate Bitmap index into query execution engine. Another important one is to try implementes direct execution on compressed data without decompression. Considering the code explosion problem of achiving direct exection on compressed data, this paper only explores the selection and tuple reconstruction operation on compressed data.Third is the optimization of query execution on multi-core processors. This paper focuses the optimization on aggregation operation that tryes to improve the partition method in aggregation. This paper propose a new dynamic partition method by sampling which make it more adaptive to different characteristic data which achives high efficiency utilization of CPU Cache.

Keywords/Search Tags:

column store, query execution, hash join, CPU Cache

PDF Full Text Request

Related items

1	Research On Optimization Of Database Query Execution For Shared Cache Chip Multi-Processor
2	Research And Implementation Of Key Techniques For Query Rewriting In Column-Store Data Warehouse
3	Optimization And Implementation For DWMS Column-Store Query Execution Engine
4	Efficient Star Join For Column-Oriented Data Store In The MAP Reduce Environment
5	Research And Implementation Of Query Optimizing Of Column Store In Data Warehouse Management System
6	Research On Query Optimization In Column-Oriented Data Warehouse
7	Research And Implementation Of Parallel Query Processing In Column-store
8	The Optimization Of Hash Join Algorithm Based On KNL
9	Research On Some Key Technologies Of Parallel Processing For Big Data Based On Map Reduce
10	Research And Implementation Of Query Execution In Column-Stored Data Warehouse Management System