• DocumentCode
    466985
  • Title

    A Rapid Grouping Aggregation Algorithm Based on the Multi-Dimension Hierarchical Encoding

  • Author

    Hu, Kong-fa ; Gong, Zhen-Zhi ; Qing-li Da

  • Author_Institution
    Southeast Univ., Nanjing
  • Volume
    2
  • fYear
    2007
  • fDate
    July 30 2007-Aug. 1 2007
  • Firstpage
    368
  • Lastpage
    373
  • Abstract
    On-Line Analytical Processing(OLAP) refers to the technologies that allow users to efficiently retrieve data from the data warehouse for decision support purposes. Data warehouses tend to be extremely large. Queries tend to be complex and ad hoc, often requiring computationally expensive operations such as multi-table joins and aggregation. To solve this problem, a novel pre-aggregation algorithm, MDHEGA (Grouping Aggregation based on the Multi- dimension Hierarchical Encoding ) ,is proposed in this paper. By using the small multi-dimension hierarchical encoding and their prefix path, MDHEGA can rapidly retrieve the matching dimension hierarchical encoding and evaluate the set of query ranges for each dimension. As a result, the algorithm can greatly reduce the disk I/Os and highly improve the efficiency of OLAP queries. The analytical and experimental results show that the MDHEGA algorithm proposed in this paper is more efficient than other existed ones.
  • Keywords
    data mining; data warehouses; MDHEGA algorithm; data warehouse; decision support purpose; multidimension hierarchical encoding; on-line analytical processing; rapid grouping aggregation algorithm; Aggregates; Artificial intelligence; Clustering algorithms; Data warehouses; Distributed computing; Encoding; Information retrieval; Multidimensional systems; Software algorithms; Software engineering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing, 2007. SNPD 2007. Eighth ACIS International Conference on
  • Conference_Location
    Qingdao
  • Print_ISBN
    978-0-7695-2909-7
  • Type

    conf

  • DOI
    10.1109/SNPD.2007.452
  • Filename
    4287710