• DocumentCode
    588202
  • Title

    Provenance analysis: Towards quality provenance

  • Author

    You-Wei Cheah ; Plale, Beth

  • Author_Institution
    Sch. of Inf. & Comput., Indiana Univ., Bloomington, IN, USA
  • fYear
    2012
  • fDate
    8-12 Oct. 2012
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    Data provenance, a key piece of metadata that describes the lifecycle of a data product, is crucial in aiding scientists to better understand and facilitate reproducibility and reuse of scientific results. Provenance collection systems often capture provenance on the fly and the protocol between application and provenance tool may not be reliable. As a result, data provenance can become ambiguous or simply inaccurate. In this paper, we identify likely quality issues in data provenance. We also establish crucial quality dimensions that are especially critical for the evaluation of provenance quality. We analyze synthetic and real-world provenance based on these quality dimensions and summarize our contributions to provenance quality.
  • Keywords
    data handling; meta data; natural sciences computing; crucial quality dimensions; data product lifecycle; data provenance analysis; metadata; provenance collection systems; provenance quality evaluation; real-world provenance; synthetic provenance; Databases; Image edge detection; Microwave radiometry; Oceans; Snow; Standards; Data Provenance; Provenance Analysis; Provenance Quality; Scientific Workflows;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    E-Science (e-Science), 2012 IEEE 8th International Conference on
  • Conference_Location
    Chicago, IL
  • Print_ISBN
    978-1-4673-4467-8
  • Type

    conf

  • DOI
    10.1109/eScience.2012.6404480
  • Filename
    6404480