• DocumentCode
    170287
  • Title

    Study in Usefulness of Middleware-Only Provenance

  • Author

    Quan Zhou ; Ghoshal, Devarshi ; Plale, Beth

  • Author_Institution
    Sch. of Inf. & Comput., Indiana Univ. Bloomington, Bloomington, IN, USA
  • Volume
    1
  • fYear
    2014
  • fDate
    20-24 Oct. 2014
  • Firstpage
    215
  • Lastpage
    222
  • Abstract
    Data provenance is the lineage of a digital artifact or object. Its capture in workflow-controlled distributed applications is well studied but less is known about quality of provenance captured solely through existing control infrastructures (i.e., middleware frameworks used for high throughput computing). We study completeness of provenance in case where information is only available from the middleware layer. We use WorkQueue to validate our model. Our evaluation shows that provenance captured from a middleware framework is sufficient to represent the existence of output data and trace certain failures independent of the application semantics. We show the method´s limitations as well.
  • Keywords
    middleware; program diagnostics; WorkQueue; application semantics; data provenance; digital artifact; middleware framework; middleware-only provenance; output data; trace certain failures; workflow-controlled distributed applications; Correlation; Data mining; Distributed databases; Engines; Instruments; Middleware; Throughput;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    e-Science (e-Science), 2014 IEEE 10th International Conference on
  • Conference_Location
    Sao Paulo
  • Print_ISBN
    978-1-4799-4288-6
  • Type

    conf

  • DOI
    10.1109/eScience.2014.49
  • Filename
    6972267