Title :
A cluster architecture for parallel data warehousing
Author :
Dehne, Frank ; Eavis, Todd ; Rau-Chaplin, Andrew
Author_Institution :
Carleton Univ., Ottawa, Ont., Canada
Abstract :
Describes the parallel, cluster-based implementation of an algorithm for the computation of a database operator known as the datacube. Though a number of efficient sequential algorithms have recently been proposed for this problem, very little research effort has been expended upon cost-effective parallelization techniques. Our approach builds directly upon the existing sequential proposals and is designed to be both load-balanced and communication-efficient. We also provide experimental results that demonstrate the viability of our technique under a variety of test conditions. Ultimately, we show that parallel performance relative to the underlying sequential algorithm (speedup) is near-optimal
Keywords :
data warehouses; parallel algorithms; parallel architectures; parallel databases; resource allocation; software performance evaluation; workstation clusters; cluster architecture; cost-effective parallelization techniques; database operator; datacube computation; efficient sequential algorithms; load-balanced communication-efficient approach; near-optimal speedup; parallel cluster-based algorithm implementation; parallel data warehousing; parallel performance; test conditions; Clustering algorithms; Concurrent computing; Disaster management; Internet; Parallel algorithms; Partitioning algorithms; Technology management; Testing; Visual databases; Warehousing;
Conference_Titel :
Cluster Computing and the Grid, 2001. Proceedings. First IEEE/ACM International Symposium on
Conference_Location :
Brisbane, Qld.
Print_ISBN :
0-7695-1010-8
DOI :
10.1109/CCGRID.2001.923189