DocumentCode
935181
Title
A wavelet framework for adapting data cube views for OLAP
Author
Smith, John R. ; Li, Chung-Sheng ; Jhingran, Anant
Author_Institution
IBM Thomas J. Watson Res. Center, Hawthorne, NY, USA
Volume
16
Issue
5
fYear
2004
fDate
5/1/2004 12:00:00 AM
Firstpage
552
Lastpage
565
Abstract
This article presents a method for adaptively representing multidimensional data cubes using wavelet view elements in order to more efficiently support data analysis and querying involving aggregations. The proposed method decomposes the data cubes into an indexed hierarchy of wavelet view elements. The view elements differ from traditional data cube cells in that they correspond to partial and residual aggregations of the data cube. The view elements provide highly granular building blocks for synthesizing the aggregated and range-aggregated views of the data cubes. We propose a strategy for selectively materializing alternative sets of view elements based on the patterns of access of views. We present a fast and optimal algorithm for selecting a non-expansive set of wavelet view elements that minimizes the average processing cost for supporting a population of queries of data cube views. We also present a greedy algorithm for allowing the selective materialization of a redundant set of view element sets which, for measured increases in storage capacity, further reduces processing costs. Experiments and analytic results show that the wavelet view element framework performs better in terms of lower processing and storage cost than previous methods that materialize and store redundant views for online analytical processing (OLAP).
Keywords
data mining; very large databases; wavelet transforms; OLAP; aggregated views; data analysis; data cube views; granular building blocks; greedy algorithm; indexed hierarchy; multidimensional data cubes; multidimensional data management; online analytical processing; optimal algorithm; range-aggregated views; selective materialization; storage capacity; wavelet framework; wavelet view elements; Aggregates; Cost function; Data analysis; Greedy algorithms; Material storage; Multidimensional systems; Performance analysis; Performance gain; Relational databases; Wavelet analysis;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/TKDE.2004.1277817
Filename
1277817
Link To Document