Title :
A Parallel Hierarchical Aggregation Algorithm in High Dimensional Data Warehouse
Author :
Hu, Kongfa ; Liu, Jiajia ; Chen, Ling ; Da, Qingli
Author_Institution :
Yangzhou Univ., Yangzhou
Abstract :
OLAP (on-line analytical processing) queries tend to be complex and ad hoc, often requiring computationally expensive operations such as multi-table joins and aggregation. In the high dimensional data warehouse(DW), we full materialized the data cube impossibly. In this paper, we propose a novel aggregation algorithm, PDHEPA (parallel pre-grouping aggregation based on the dimension hierarchical encoding), to vertically partition a high dimensional dataset into a set of disjoint low dimensional datasets called fragment mini-cubes. PDHEPA uses the small dimension hierarchical encoding and their prefix, so that it can drastically reduce the multi-table join operations. As a result, the method we proposed in this paper can greatly reduce the disk I/Os and highly improve the efficiency of OLAP queries. The analytical and experimental results show that the PDHEPA is more efficient than other existed ones.
Keywords :
data warehouses; hierarchical systems; parallel algorithms; query processing; fragment minicubes; hierarchical encoding; high dimensional data warehouse; high dimensional dataset; multitable joins; online analytical processing queries; parallel hierarchical aggregation algorithm; parallel pregrouping aggregation based on the dimension hierarchical encoding; Aggregates; Cities and towns; Computer science; Costs; Data analysis; Data warehouses; Databases; Encoding; Material storage; Partitioning algorithms;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery, 2007. FSKD 2007. Fourth International Conference on
Conference_Location :
Haikou
Print_ISBN :
978-0-7695-2874-8
DOI :
10.1109/FSKD.2007.106