Title :
Recording Process Documentation for Provenance
Author :
Groth, Paul ; Moreau, Luc
Author_Institution :
Inf. Sci. Inst., Univ. of Southern California, Marina del Rey, CA, USA
Abstract :
Scientific and business communities are adopting large-scale distributed systems as a means to solve a wide range of resource-intensive tasks. These communities also have requirements in terms of provenance. We define the provenance of a result produced by a distributed system as the process that led to that result. This paper describes a protocol for recording documentation of a distributed system´s execution. The distributed protocol guarantees that documentation with characteristics suitable for accurately determining the provenance of results is recorded. These characteristics are confirmed through a number of proofs based on an abstract state machine formalization.
Keywords :
distributed processing; program compilers; protocols; system documentation; abstract state machine formalization; distributed protocol; large-scale distributed systems; provenance; recording process documentation; resource-intensive tasks; Distributed debugging; Distributed systems; Information Storage and Retrieval; Information Technology and grids; Provenance; Systems and Software; data protocols; data protocols.; distributed systems; grids; lineage; provenance;
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on
Conference_Location :
10/10/2008 12:00:00 AM
DOI :
10.1109/TPDS.2008.215