Title :
Measuring Data Believability: A Provenance Approach
Author :
Prat, Nicolas ; Madnick, Stuart
Author_Institution :
ESSEC Bus. Sch., Cergy
Abstract :
Data quality is crucial for operational efficiency and sound decision making. This paper focuses on believability, a major aspect of quality, measured along three dimensions: trustworthiness, reasonableness, and temporality. We ground our approach on provenance, i.e. the origin and subsequent processing history of data. We present our provenance model and our approach for computing believability based on provenance metadata. The approach is structured into three increasingly complex building blocks: (1) definition of metrics for assessing the believability of data sources, (2) definition of metrics for assessing the believability of data resulting from one process run and (3) assessment of believability based on all the sources and processing history of data. We illustrate our approach with a scenario based on Internet data. To our knowledge, this is the first work to develop a precise approach to measuring data believability and making explicit use of provenance-based measurements.
Keywords :
security of data; Internet data; data believability measurement; data quality; data reasonableness; data security; data temporality; data trustworthiness; provenance metadata; Bioinformatics; Computational intelligence; Computational modeling; Decision making; Guidelines; History; Internet; Quality assessment; Terrorism; Warehousing;
Conference_Titel :
Hawaii International Conference on System Sciences, Proceedings of the 41st Annual
Conference_Location :
Waikoloa, HI
DOI :
10.1109/HICSS.2008.243