• DocumentCode
    70902
  • Title

    Implementation of Geospatial Data Provenance in a Web Service Workflow Environment With ISO 19115 and ISO 19115-2 Lineage Model

  • Author

    Liping Di ; Yuanzheng Shao ; Lingjun Kang

  • Author_Institution
    Dept. of Geogr. & Geo Inf. Sci., George Mason Univ., Fairfax, VA, USA
  • Volume
    51
  • Issue
    11
  • fYear
    2013
  • fDate
    Nov. 2013
  • Firstpage
    5082
  • Lastpage
    5089
  • Abstract
    Data provenance, also called data lineage, records the derivation history of a data product. In the earth science domain, geospatial data provenance is important because it plays a significant role in data quality and usability evaluation, data trail audition, workflow replication, and product reproducibility. The generation of the geospatial provenance metadata is usually coupled with the execution of geo-processing workflow. Their symbiotic relationship makes them complementary to each other and promises great benefit once they are integrated. However, the heterogeneity of data and computing resources in the distributed environment constructed under the service-oriented architecture (SOA) brings a great challenge to resource integration. Specifically, the issues, such as the lack of interoperability and compatibility among provenance metadata models and between provenance and workflow, create obstacles for the integration of provenance, and geo-processing workflow. In order to tackle these issues, on one hand, this paper breaks the provenance heterogeneity through recording provenance information in a standard lineage model defined in ISO 19115:2003 and ISO 19115-2:2009 standards. On the other hand, this paper bridges the gap between provenance and geo-processing workflow through extending both workflow language and service interface, making it possible for the automatic capture of provenance information in the geospatial web service environment. The proposed method is implemented in the GeoBrain, a SOA-based geospatial web service system. The testing result from implementation shows that the geospatial provenance information is successfully captured throughout the life cycle of geo-processing workflows and properly recorded in the ISO standard lineage model.
  • Keywords
    ISO standards; Web services; geographic information systems; geophysical techniques; geophysics computing; Earth science domain; ISO 19115-2 lineage model; ISO 19115-2:2009 standard; ISO 19115:2003 standard; ISO standard lineage model; SOA-based geospatial web service system; computing resource; data resource; data trail audition; distributed environment; geo-processing workflow; geospatial provenance metadata; provenance metadata models; resource integration; service interface; service-oriented architecture; workflow language; workflow replication; Data models; Geoscience; Geospatial analysis; ISO standards; Service-oriented architecture; Geo-processing workflow; geospatial provenance; lineage; science reproducibility;
  • fLanguage
    English
  • Journal_Title
    Geoscience and Remote Sensing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0196-2892
  • Type

    jour

  • DOI
    10.1109/TGRS.2013.2248740
  • Filename
    6517971