DocumentCode
1571269
Title
A Parallel Algorithm for Closed Cube Computation
Author
Jinguo You ; Jianqing Xi ; Pingjian Zhang
Author_Institution
Sch. of Comput. Sci. & Eng., South China Univ. of Technol., Guangzhou
fYear
2008
Firstpage
95
Lastpage
99
Abstract
Closed cubing is a very efficient algorithm for data cube compression proposed recently in the literature. It losslessly condenses a group of cells into one cell if these cells have the same aggregate value and preserve roll-up/drill-down semantics. Despite its importance, parallel closed cubing solutions for huge data sets are not well studied so far to the best of the authors´ knowledge. This paper presents a parallel closed cube construction and query algorithm over low cost PC clusters using the MapReduce framework. In addition, we proved that with the number of data blocks increases, the closed cubes´ storage size decreases gradually. Thus users can specify the number of data blocks to balance the performance between cubes storage and query time. Experimental study demonstrates that our algorithm is efficient and scalable.
Keywords
data compression; parallel algorithms; query processing; MapReduce framework; data cube compression; drill-down semantics; parallel algorithm; parallel closed cubing solutions; query algorithm; Aggregates; Clustering algorithms; Computer science; Concurrent computing; Costs; Data engineering; Information science; Parallel algorithms; Partitioning algorithms; Upper bound; Hadoop; MapReduce; OLAP; closed cube; parallel computation;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer and Information Science, 2008. ICIS 08. Seventh IEEE/ACIS International Conference on
Conference_Location
Portland, OR
Print_ISBN
978-0-7695-3131-1
Type
conf
DOI
10.1109/ICIS.2008.63
Filename
4529804
Link To Document