DocumentCode
918255
Title
Approximate Query Processing in Cube Streams
Author
Hsieh, Ming-Jyh ; Chen, Ming-Syan ; Yu, Philip S.
Author_Institution
National Taiwan Univ., Taipei
Volume
19
Issue
11
fYear
2007
Firstpage
1557
Lastpage
1570
Abstract
Data cubes have become important components in most data warehouse systems and decision support systems. In such systems, users usually pose very complex queries to the online analytical processing (OLAP) system, and systems usually have to deal with a huge amount of data because of the large dimensionality of the sets; thus, approximating query processing has emerged as a viable solution. Specifically, the applications of cube streams handle multidimensional data sets in a continuous manner in contrast to the traditional cube approximation. Such an application collects data events for cube streams online, generates snapshots with limited resources, and keeps the approximated information in a synopsis memory for further analysis. Compared to the OLAP applications, applications of cube streams are subject to many more resource constraints on both the processing time and the memory and cannot be dealt with by existing methods due to the limited resources. In this paper, we propose the DAWA algorithm, which is a hybrid algorithm of discrete cosine transform (DCT) for data and the discrete wavelet transform (DWT), to approximate cube streams. Our algorithm combines the advantages of the high compression rate of DWT and the low memory cost of DCT. Consequently, DAWA requires much smaller working buffer and outperforms both DWT-based and DCT-based methods in execution efficiency. Also, it is shown that DAWA provides a good solution for an approximate query processing of cube streams with a small working buffer and a short execution time. The optimality of the DAWA algorithm is theoretically proved and empirically demonstrated by our experiments.
Keywords
data mining; discrete cosine transforms; discrete wavelet transforms; query processing; DAWA algorithm; approximate query processing; complex queries; cube streams; data cubes; data events; discrete cosine transform; discrete wavelet transform; hybrid algorithm; online analytical processing; Cellular phones; Costs; Data warehouses; Databases; Decision support systems; Discrete cosine transforms; Discrete wavelet transforms; Information analysis; Multidimensional systems; Query processing; Cube Streams; Data Cubes; Data Streams; OLAP;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/TKDE.2007.190622
Filename
4339219
Link To Document