Title :
On the integration of autonomous data marts
Author :
Cabibbo, Luca ; Torlone, Riccardo
Author_Institution :
Dipt. di Informatica e Automazione, Roma Tre Universita, Rome, Italy
Abstract :
We address the problem of integrating a federation of dimensional data marts. This problem arises when, e.g., a large organization (or a federation thereof) needs to combine independently developed data warehouses. We show that this problem can be tackled in a systematic way because of two main reasons. First, data marts are structured in a rather uniform way, along dimensions and facts. Second, data quality in data marts is usually higher than in generic databases, since they are obtained by reconciling several data sources. Our scenario of reference is a federation (i.e., a logical integration) of various data marts, which we need to query in a unified way, that is, by means of drill-across operations. We propose a novel notion of dimension compatibility and characterize its general property. We then show the significance of dimension compatibility in performing drill-across queries over autonomous data marts. We also discuss general strategies for the integration of data marts.
Keywords :
data structures; data warehouses; distributed databases; autonomous data mart integration; data quality; data sources; data structure; data warehouses; dimension compatibility; dimensional data marts; drill-across operations; drill-across queries; generic databases; Buildings; Cleaning; Conference management; Data analysis; Data warehouses; Databases; Marketing and sales; Multidimensional systems; Performance analysis; Warehousing;
Conference_Titel :
Scientific and Statistical Database Management, 2004. Proceedings. 16th International Conference on
Print_ISBN :
0-7695-2146-0
DOI :
10.1109/SSDM.2004.1311214