DocumentCode :
1571269
Title :
A Parallel Algorithm for Closed Cube Computation
Author :
Jinguo You ; Jianqing Xi ; Pingjian Zhang
Author_Institution :
Sch. of Comput. Sci. & Eng., South China Univ. of Technol., Guangzhou
fYear :
2008
Firstpage :
95
Lastpage :
99
Abstract :
Closed cubing is a very efficient algorithm for data cube compression proposed recently in the literature. It losslessly condenses a group of cells into one cell if these cells have the same aggregate value and preserve roll-up/drill-down semantics. Despite its importance, parallel closed cubing solutions for huge data sets are not well studied so far to the best of the authors´ knowledge. This paper presents a parallel closed cube construction and query algorithm over low cost PC clusters using the MapReduce framework. In addition, we proved that with the number of data blocks increases, the closed cubes´ storage size decreases gradually. Thus users can specify the number of data blocks to balance the performance between cubes storage and query time. Experimental study demonstrates that our algorithm is efficient and scalable.
Keywords :
data compression; parallel algorithms; query processing; MapReduce framework; data cube compression; drill-down semantics; parallel algorithm; parallel closed cubing solutions; query algorithm; Aggregates; Clustering algorithms; Computer science; Concurrent computing; Costs; Data engineering; Information science; Parallel algorithms; Partitioning algorithms; Upper bound; Hadoop; MapReduce; OLAP; closed cube; parallel computation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer and Information Science, 2008. ICIS 08. Seventh IEEE/ACIS International Conference on
Conference_Location :
Portland, OR
Print_ISBN :
978-0-7695-3131-1
Type :
conf
DOI :
10.1109/ICIS.2008.63
Filename :
4529804
Link To Document :
بازگشت