• DocumentCode
    1571269
  • Title

    A Parallel Algorithm for Closed Cube Computation

  • Author

    Jinguo You ; Jianqing Xi ; Pingjian Zhang

  • Author_Institution
    Sch. of Comput. Sci. & Eng., South China Univ. of Technol., Guangzhou
  • fYear
    2008
  • Firstpage
    95
  • Lastpage
    99
  • Abstract
    Closed cubing is a very efficient algorithm for data cube compression proposed recently in the literature. It losslessly condenses a group of cells into one cell if these cells have the same aggregate value and preserve roll-up/drill-down semantics. Despite its importance, parallel closed cubing solutions for huge data sets are not well studied so far to the best of the authors´ knowledge. This paper presents a parallel closed cube construction and query algorithm over low cost PC clusters using the MapReduce framework. In addition, we proved that with the number of data blocks increases, the closed cubes´ storage size decreases gradually. Thus users can specify the number of data blocks to balance the performance between cubes storage and query time. Experimental study demonstrates that our algorithm is efficient and scalable.
  • Keywords
    data compression; parallel algorithms; query processing; MapReduce framework; data cube compression; drill-down semantics; parallel algorithm; parallel closed cubing solutions; query algorithm; Aggregates; Clustering algorithms; Computer science; Concurrent computing; Costs; Data engineering; Information science; Parallel algorithms; Partitioning algorithms; Upper bound; Hadoop; MapReduce; OLAP; closed cube; parallel computation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer and Information Science, 2008. ICIS 08. Seventh IEEE/ACIS International Conference on
  • Conference_Location
    Portland, OR
  • Print_ISBN
    978-0-7695-3131-1
  • Type

    conf

  • DOI
    10.1109/ICIS.2008.63
  • Filename
    4529804