Research On The Storage Technique Of Data Cube Based-on Dimension Hierarchy

Posted on:2009-06-29

Degree:Master

Type:Thesis

Country:China

Candidate:Z P Peng

Full Text:PDF

GTID:2178360245483250

Subject:Computer application technology

Abstract/Summary:

PDF Full Text Request

Data cube is the kernel conception of data warehouse and on-line analytical processing (OLAP). It usually needs to pre-compute and save the data cube in disk in order to promptly answer complex multidimensional queries in the OLAP applications. But the large size brings about a lot of trouble when they are computed and saved. To decrease disk storage cost and improve queries performance are very important but contradictive goals of data cube research. For the sake of resolving these problems, it needs to explore the effective structures of data cube.A new approach named ICODH is improved here, which computes tuples in recursion from bottom to top. When one dimension is computed, it computes from the coarse granularity to the fine granularity in circle. By sharing the sorting costs, it decreases the reading and writing operations of the disk in order to reduce the dimension hierarchy cube's computation time. On the other side, basing on the research of dimension hierarchy encoding technique, this paper also proposes an effective method to encode the dimension table. This approach preserves the semantic relations by virtue of the compressing mechanism. Through two sides, the data cube speeds up the computation and improves the performance of query.Condensed Data Cube has been proposed as an effective approach for reducing data cube's size, but there are still lots of prefix-redundancy in the data cube, such as intra-cuboid prefix redundancies and among-cuboid prefix redundancies. For this, a data cube structure named IDHC is extended here. It combines two techniques—BST condensing and intra-cuboid prefix-sharing. According to the character of dimension hierarchy, it clusters the cube tuples which has the same grouping dimension set (or the single dimension set), reduces the size of cube because the tuples in same cluster can share the prefix. Meantime, when these tuples are preserved in disk, the algorithm which tuples generated inthe same batch is proposed in order to eliminate tuple comparisons. Thisapproach eliminates comparisons among tuples in cuboid which containsonly one grouping dimension, and computes IDHC in batch mode wasproposed.

Keywords/Search Tags:

OLAP, Date Cube, dimension hierarchy, Dimension hierarchy encoding, Base single tuple

PDF Full Text Request

Related items

1	OLAP Algorithm Research Based On Dimension Hierarchy For Data Cube
2	Research On The Method Of Extracting Dimension Hierarchy Of Tweeters' Interests For OLAP
3	Optimization Method To Calculate Of High-dimensional Data Cube Based On The Shell-fragment
4	Research On Multi-Dimensional Data Storage And Query Based On P2P
5	Research In Data Warehouse Technology
6	The Deviseing And Achieveing Of OLAP System Base On Datawarehouse
7	Structurally heterogeneous OLAP dimensions
8	Research And Application On Grouping Aggregation Algorithm For OLAP Based On Dimension Hierarchical Encoding
9	Research On Query Optimization And Multi-dimensional Data Model For OLAP
10	Research And Implementation On Multi-dimensional Network Analysis System Based On Cloud Platform