DocumentCode :
2394718
Title :
A model for distributing and querying a data warehouse on a computing grid
Author :
Wehrle, Pascal ; Miquel, Maryvonne ; Tchounikine, Anne
Author_Institution :
LIRIS, Lyon, France
Volume :
1
fYear :
2005
fDate :
20-22 July 2005
Firstpage :
203
Abstract :
Data warehouses store large volumes of data according to a multidimensional model with dimensions representing different axes of analysis. OLAP systems (online analytical processing) provide the ability to interactively explore the data warehouse. Rising volumes and complexity of data favor the use of more powerful distributed computing architectures. Computing grids in particular are built for decentralized management of heterogeneous distributed resources. Their lack of centralized control however conflicts with classic centralized data warehouse models. To take advantage of a computing grid infrastructure to operate a data warehouse, several problems need to be solved. First, the warehouse data must be uniquely identified and judiciously partitioned to allow efficient distribution, querying and exchange among the nodes of the grid. We propose a data model based on "chunks" as atomic entities of warehouse data that can be uniquely identified. We then build contiguous blocks of these chunks to obtain suitable fragments of the data warehouse. The fragments stored on each grid node must be indexed in a uniform way to effectively interact with existing grid services. Our indexing structure consists of a lattice structure mapping queries to warehouse fragments and a specialized spatial index structure formed by X-trees providing the information necessary for optimized query evaluation plans.
Keywords :
data mining; data models; data warehouses; grid computing; query processing; tree data structures; OLAP system; X-trees; centralized control; data complexity; data model; data warehouse; distributed computing architecture; grid computing; heterogeneous distributed resource; lattice structure; multidimensional model; online analytical processing; optimized query evaluation plan; spatial index structure; Centralized control; Computer architecture; Data models; Data warehouses; Distributed computing; Grid computing; Multidimensional systems; Power system management; Power system modeling; Resource management;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Systems, 2005. Proceedings. 11th International Conference on
ISSN :
1521-9097
Print_ISBN :
0-7695-2281-5
Type :
conf
DOI :
10.1109/ICPADS.2005.35
Filename :
1531128
Link To Document :
بازگشت