Title :
Semantic Provenance for eScience: Managing the Deluge of Scientific Data
Author :
Sahoo, Satya S. ; Sheth, Amit ; Henson, Cory
Author_Institution :
Kno.e.sis Center, Wright State Univ., Dayton, OH
Abstract :
Provenance information in eScience is metadata that\´s critical to effectively manage the exponentially increasing volumes of scientific data from industrial-scale experiment protocols. Semantic provenance, based on domain-specific provenance ontologies, lets software applications unambiguously interpret data in the correct context. The semantic provenance framework for eScience data comprises expressive provenance information and domain-specific provenance ontologies and applies this information to data management. The authors\´ "two degrees of separation" approach advocates the creation of high-quality provenance information using specialized services. In contrast to workflow engines generating provenance information as a core functionality, the specialized provenance services are integrated into a scientific workflow on demand. This article describes an implementation of the semantic provenance framework for glycoproteomics.
Keywords :
information management; natural sciences computing; data management; eScience; industrial-scale experiment protocols; meta data; provenance information; provenance ontologies; scientific data; semantic provenance; software applications; Application software; Cities and towns; Computer industry; Engines; Information management; Ontologies; Proteomics; Protocols; Software tools; Throughput; Spade; cyberinfrastructure; eScience; metadata; provenance; semantic provenance;
Journal_Title :
Internet Computing, IEEE
DOI :
10.1109/MIC.2008.86