DocumentCode :
2206173
Title :
Tracking and Sketching Distributed Data Provenance
Author :
Malik, Tanu ; Nistor, Ligia ; Gehani, Ashish
Author_Institution :
Univ. of Chicago, Chicago, IL, USA
fYear :
2010
fDate :
7-10 Dec. 2010
Firstpage :
190
Lastpage :
197
Abstract :
Current provenance collection systems typically gather metadata on remote hosts and submit it to a central server. In contrast, several data-intensive scientific applications require a decentralized architecture in which each host maintains an authoritative local repository of the provenance metadata gathered on that host. The latter approach allows the system to handle the large amounts of metadata generated when auditing occurs at fine granularity, and allows users to retain control over their provenance records. The decentralized architecture, however, increases the complexity of auditing, tracking, and querying distributed provenance. We describe a system for capturing data provenance in distributed applications, and the use of provenance sketches to optimize subsequent data provenance queries. Experiments with data gathered from distributed workflow applications demonstrate the feasibility of a decentralized provenance management system and improvements in the efficiency of provenance queries.
Keywords :
distributed databases; meta data; query formulation; auditing; authoritative local repository; central server; data-intensive scientific applications; decentralized architecture; distributed data provenance; metadata; provenance management system; provenance records; querying; remote hosts; tracking; Computational modeling; Computers; Data models; Distributed databases; Electronic mail; Monitoring; Auditing; Distributed; Lineage; Pedigree; Provenance; Sketch;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
e-Science (e-Science), 2010 IEEE Sixth International Conference on
Conference_Location :
Brisbane, QLD
Print_ISBN :
978-1-4244-8957-2
Electronic_ISBN :
978-0-7695-4290-4
Type :
conf
DOI :
10.1109/eScience.2010.51
Filename :
5693917
Link To Document :
بازگشت