DocumentCode :
918255
Title :
Approximate Query Processing in Cube Streams
Author :
Hsieh, Ming-Jyh ; Chen, Ming-Syan ; Yu, Philip S.
Author_Institution :
National Taiwan Univ., Taipei
Volume :
19
Issue :
11
fYear :
2007
Firstpage :
1557
Lastpage :
1570
Abstract :
Data cubes have become important components in most data warehouse systems and decision support systems. In such systems, users usually pose very complex queries to the online analytical processing (OLAP) system, and systems usually have to deal with a huge amount of data because of the large dimensionality of the sets; thus, approximating query processing has emerged as a viable solution. Specifically, the applications of cube streams handle multidimensional data sets in a continuous manner in contrast to the traditional cube approximation. Such an application collects data events for cube streams online, generates snapshots with limited resources, and keeps the approximated information in a synopsis memory for further analysis. Compared to the OLAP applications, applications of cube streams are subject to many more resource constraints on both the processing time and the memory and cannot be dealt with by existing methods due to the limited resources. In this paper, we propose the DAWA algorithm, which is a hybrid algorithm of discrete cosine transform (DCT) for data and the discrete wavelet transform (DWT), to approximate cube streams. Our algorithm combines the advantages of the high compression rate of DWT and the low memory cost of DCT. Consequently, DAWA requires much smaller working buffer and outperforms both DWT-based and DCT-based methods in execution efficiency. Also, it is shown that DAWA provides a good solution for an approximate query processing of cube streams with a small working buffer and a short execution time. The optimality of the DAWA algorithm is theoretically proved and empirically demonstrated by our experiments.
Keywords :
data mining; discrete cosine transforms; discrete wavelet transforms; query processing; DAWA algorithm; approximate query processing; complex queries; cube streams; data cubes; data events; discrete cosine transform; discrete wavelet transform; hybrid algorithm; online analytical processing; Cellular phones; Costs; Data warehouses; Databases; Decision support systems; Discrete cosine transforms; Discrete wavelet transforms; Information analysis; Multidimensional systems; Query processing; Cube Streams; Data Cubes; Data Streams; OLAP;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2007.190622
Filename :
4339219
Link To Document :
بازگشت