• DocumentCode
    468288
  • Title

    A Parallel Hierarchical Aggregation Algorithm in High Dimensional Data Warehouse

  • Author

    Hu, Kongfa ; Liu, Jiajia ; Chen, Ling ; Da, Qingli

  • Author_Institution
    Yangzhou Univ., Yangzhou
  • Volume
    3
  • fYear
    2007
  • fDate
    24-27 Aug. 2007
  • Firstpage
    36
  • Lastpage
    40
  • Abstract
    OLAP (on-line analytical processing) queries tend to be complex and ad hoc, often requiring computationally expensive operations such as multi-table joins and aggregation. In the high dimensional data warehouse(DW), we full materialized the data cube impossibly. In this paper, we propose a novel aggregation algorithm, PDHEPA (parallel pre-grouping aggregation based on the dimension hierarchical encoding), to vertically partition a high dimensional dataset into a set of disjoint low dimensional datasets called fragment mini-cubes. PDHEPA uses the small dimension hierarchical encoding and their prefix, so that it can drastically reduce the multi-table join operations. As a result, the method we proposed in this paper can greatly reduce the disk I/Os and highly improve the efficiency of OLAP queries. The analytical and experimental results show that the PDHEPA is more efficient than other existed ones.
  • Keywords
    data warehouses; hierarchical systems; parallel algorithms; query processing; fragment minicubes; hierarchical encoding; high dimensional data warehouse; high dimensional dataset; multitable joins; online analytical processing queries; parallel hierarchical aggregation algorithm; parallel pregrouping aggregation based on the dimension hierarchical encoding; Aggregates; Cities and towns; Computer science; Costs; Data analysis; Data warehouses; Databases; Encoding; Material storage; Partitioning algorithms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery, 2007. FSKD 2007. Fourth International Conference on
  • Conference_Location
    Haikou
  • Print_ISBN
    978-0-7695-2874-8
  • Type

    conf

  • DOI
    10.1109/FSKD.2007.106
  • Filename
    4406197