DocumentCode
2934261
Title
Adaptive Inference of Fine-grained Data Provenance to Achieve High Accuracy at Lower Storage Costs
Author
Huq, Mohammad Rezwanul ; Wombacher, Andreas ; Apers, Peter M G
Author_Institution
Comput. Sci. Dept., Univ. of Twente, Enschede, Netherlands
fYear
2011
fDate
5-8 Dec. 2011
Firstpage
202
Lastpage
209
Abstract
In stream data processing, data arrives continuously and is processed by decision making, process control and e-science applications. To control and monitor these applications, reproducibility of result is a vital requirement. However, it requires massive amount of storage space to store fine-grained provenance data especially for those transformations with overlapping sliding windows. In this paper, we propose techniques which can significantly reduce storage costs and can achieve high accuracy. Our evaluation shows that adaptive inference technique can achieve almost 100% accurate provenance information for a given dataset at lower storage costs than the other techniques. Moreover, we present a guideline about the usage of different provenance collection techniques described in this paper based on the transformation operation and stream characteristics.
Keywords
data handling; inference mechanisms; adaptive inference technique; decision making application; e-science application; fine-grained data provenance; process control application; provenance collection technique; storage cost reduction; stream characteristics; stream data processing; transformation operation; Accuracy; Conductivity; Data models; Data processing; Delay; Inference algorithms; Switches; Fine-grained data provenance; Inference; Storage; Stream Data;
fLanguage
English
Publisher
ieee
Conference_Titel
E-Science (e-Science), 2011 IEEE 7th International Conference on
Conference_Location
Stockholm
Print_ISBN
978-1-4577-2163-2
Type
conf
DOI
10.1109/eScience.2011.36
Filename
6123279
Link To Document