DocumentCode
70902
Title
Implementation of Geospatial Data Provenance in a Web Service Workflow Environment With ISO 19115 and ISO 19115-2 Lineage Model
Author
Liping Di ; Yuanzheng Shao ; Lingjun Kang
Author_Institution
Dept. of Geogr. & Geo Inf. Sci., George Mason Univ., Fairfax, VA, USA
Volume
51
Issue
11
fYear
2013
fDate
Nov. 2013
Firstpage
5082
Lastpage
5089
Abstract
Data provenance, also called data lineage, records the derivation history of a data product. In the earth science domain, geospatial data provenance is important because it plays a significant role in data quality and usability evaluation, data trail audition, workflow replication, and product reproducibility. The generation of the geospatial provenance metadata is usually coupled with the execution of geo-processing workflow. Their symbiotic relationship makes them complementary to each other and promises great benefit once they are integrated. However, the heterogeneity of data and computing resources in the distributed environment constructed under the service-oriented architecture (SOA) brings a great challenge to resource integration. Specifically, the issues, such as the lack of interoperability and compatibility among provenance metadata models and between provenance and workflow, create obstacles for the integration of provenance, and geo-processing workflow. In order to tackle these issues, on one hand, this paper breaks the provenance heterogeneity through recording provenance information in a standard lineage model defined in ISO 19115:2003 and ISO 19115-2:2009 standards. On the other hand, this paper bridges the gap between provenance and geo-processing workflow through extending both workflow language and service interface, making it possible for the automatic capture of provenance information in the geospatial web service environment. The proposed method is implemented in the GeoBrain, a SOA-based geospatial web service system. The testing result from implementation shows that the geospatial provenance information is successfully captured throughout the life cycle of geo-processing workflows and properly recorded in the ISO standard lineage model.
Keywords
ISO standards; Web services; geographic information systems; geophysical techniques; geophysics computing; Earth science domain; ISO 19115-2 lineage model; ISO 19115-2:2009 standard; ISO 19115:2003 standard; ISO standard lineage model; SOA-based geospatial web service system; computing resource; data resource; data trail audition; distributed environment; geo-processing workflow; geospatial provenance metadata; provenance metadata models; resource integration; service interface; service-oriented architecture; workflow language; workflow replication; Data models; Geoscience; Geospatial analysis; ISO standards; Service-oriented architecture; Geo-processing workflow; geospatial provenance; lineage; science reproducibility;
fLanguage
English
Journal_Title
Geoscience and Remote Sensing, IEEE Transactions on
Publisher
ieee
ISSN
0196-2892
Type
jour
DOI
10.1109/TGRS.2013.2248740
Filename
6517971
Link To Document