DocumentCode :
1961584
Title :
Practical lineage tracing in data warehouses
Author :
Cui, Yingwei ; Widom, Jennifer
Author_Institution :
Dept. of Comput. Sci., Stanford Univ., CA, USA
fYear :
2000
fDate :
2000
Firstpage :
367
Lastpage :
378
Abstract :
We consider the view data lineage problem in a warehousing environment: for a given data item in a materialized warehouse view, we want to identify the set of source data items that produced the view item. We formalize the problem and we present a lineage tracing algorithm for relational views with aggregation. Based on our tracing algorithm, we propose a number of schemes for storing auxiliary views that enable consistent and efficient lineage tracing in a multi-source data warehouse. We report on a performance study of the various schemes, identifying which schemes perform best in which settings. Based on our results, we have implemented a lineage tracing package in the WHIPS data warehousing system prototype at Stanford. With this package, users can select view tuples of interest, then efficiently “drill through” to examine the exact source tuples that produced the view tuples of interest
Keywords :
data mining; data warehouses; query processing; relational databases; software performance evaluation; WHIPS system; aggregation; data mining; data warehouses; lineage tracing algorithm; materialized warehouse view; performance study; relational views; source data items; view data lineage problem; view tuples; Costs; Data analysis; Data mining; Data warehouses; Databases; Information analysis; Packaging; Prototypes; Reactive power; Warehousing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering, 2000. Proceedings. 16th International Conference on
Conference_Location :
San Diego, CA
ISSN :
1063-6382
Print_ISBN :
0-7695-0506-6
Type :
conf
DOI :
10.1109/ICDE.2000.839437
Filename :
839437
Link To Document :
بازگشت