DocumentCode :
170305
Title :
Observational Data Patterns for Time Series Data Quality Assessment
Author :
Pastorello, G. ; Agarwal, D. ; Samak, T. ; Poindexter, C. ; Faybishenko, B. ; Gunter, D. ; Hollowgrass, R. ; Papale, D. ; Trotta, C. ; Ribeca, A. ; Canfora, E.
Author_Institution :
Lawerence Berkeley Nat. Lab., Berkeley, CA, USA
Volume :
1
fYear :
2014
fDate :
20-24 Oct. 2014
Firstpage :
271
Lastpage :
278
Abstract :
Observational data are fundamental for scientific research in almost any domain. Recent advances in sensor and data management technologies are enabling unprecedented amounts of observational data to be collected and analyzed. However, an essential part of using observational data is not currently as scalable as data collection and analysis methods: data quality assurance and control. While specialized tools for very narrow domains do exist, general methods are harder to create. This paper explores the identification of data issues that lead to the creation of data tests and tools to perform data quality control activities. Developing this identification step in a systematic manner allows for better and more general quality control tools. As our case study, we use carbon, water, and energy fluxes as well as micro-meteorological data collected at field sites that are part of FLUXNET, a network of over 400 ecosystem-level monitoring stations. In an effort toward the release of a new global data set of fluxes, we are doing data quality control for these data. The experience from this work led to the creation of a catalog of issues identified in the data. This paper presents this catalog and its generalization into a set of patterns of data quality issues that can be detected in observational data.
Keywords :
data analysis; geophysics computing; meteorology; quality control; time series; FLUXNET; carbon fluxes; data analysis methods; data collection; data management technologies; data quality assurance; data quality control tools; data quality issues; data tests; ecosystem-level monitoring stations; energy fluxes; micrometeorological data; observational data patterns; sensor management technologies; time series data quality assessment; water fluxes; Calibration; Heating; Instruments; Quality assessment; Soil; Wind speed; FLUXNET; data quality; observational data patterns; time series data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
e-Science (e-Science), 2014 IEEE 10th International Conference on
Conference_Location :
Sao Paulo
Print_ISBN :
978-1-4799-4288-6
Type :
conf
DOI :
10.1109/eScience.2014.45
Filename :
6972274
Link To Document :
بازگشت