• DocumentCode
    918255
  • Title

    Approximate Query Processing in Cube Streams

  • Author

    Hsieh, Ming-Jyh ; Chen, Ming-Syan ; Yu, Philip S.

  • Author_Institution
    National Taiwan Univ., Taipei
  • Volume
    19
  • Issue
    11
  • fYear
    2007
  • Firstpage
    1557
  • Lastpage
    1570
  • Abstract
    Data cubes have become important components in most data warehouse systems and decision support systems. In such systems, users usually pose very complex queries to the online analytical processing (OLAP) system, and systems usually have to deal with a huge amount of data because of the large dimensionality of the sets; thus, approximating query processing has emerged as a viable solution. Specifically, the applications of cube streams handle multidimensional data sets in a continuous manner in contrast to the traditional cube approximation. Such an application collects data events for cube streams online, generates snapshots with limited resources, and keeps the approximated information in a synopsis memory for further analysis. Compared to the OLAP applications, applications of cube streams are subject to many more resource constraints on both the processing time and the memory and cannot be dealt with by existing methods due to the limited resources. In this paper, we propose the DAWA algorithm, which is a hybrid algorithm of discrete cosine transform (DCT) for data and the discrete wavelet transform (DWT), to approximate cube streams. Our algorithm combines the advantages of the high compression rate of DWT and the low memory cost of DCT. Consequently, DAWA requires much smaller working buffer and outperforms both DWT-based and DCT-based methods in execution efficiency. Also, it is shown that DAWA provides a good solution for an approximate query processing of cube streams with a small working buffer and a short execution time. The optimality of the DAWA algorithm is theoretically proved and empirically demonstrated by our experiments.
  • Keywords
    data mining; discrete cosine transforms; discrete wavelet transforms; query processing; DAWA algorithm; approximate query processing; complex queries; cube streams; data cubes; data events; discrete cosine transform; discrete wavelet transform; hybrid algorithm; online analytical processing; Cellular phones; Costs; Data warehouses; Databases; Decision support systems; Discrete cosine transforms; Discrete wavelet transforms; Information analysis; Multidimensional systems; Query processing; Cube Streams; Data Cubes; Data Streams; OLAP;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2007.190622
  • Filename
    4339219