DocumentCode
170287
Title
Study in Usefulness of Middleware-Only Provenance
Author
Quan Zhou ; Ghoshal, Devarshi ; Plale, Beth
Author_Institution
Sch. of Inf. & Comput., Indiana Univ. Bloomington, Bloomington, IN, USA
Volume
1
fYear
2014
fDate
20-24 Oct. 2014
Firstpage
215
Lastpage
222
Abstract
Data provenance is the lineage of a digital artifact or object. Its capture in workflow-controlled distributed applications is well studied but less is known about quality of provenance captured solely through existing control infrastructures (i.e., middleware frameworks used for high throughput computing). We study completeness of provenance in case where information is only available from the middleware layer. We use WorkQueue to validate our model. Our evaluation shows that provenance captured from a middleware framework is sufficient to represent the existence of output data and trace certain failures independent of the application semantics. We show the method´s limitations as well.
Keywords
middleware; program diagnostics; WorkQueue; application semantics; data provenance; digital artifact; middleware framework; middleware-only provenance; output data; trace certain failures; workflow-controlled distributed applications; Correlation; Data mining; Distributed databases; Engines; Instruments; Middleware; Throughput;
fLanguage
English
Publisher
ieee
Conference_Titel
e-Science (e-Science), 2014 IEEE 10th International Conference on
Conference_Location
Sao Paulo
Print_ISBN
978-1-4799-4288-6
Type
conf
DOI
10.1109/eScience.2014.49
Filename
6972267
Link To Document