• DocumentCode
    170305
  • Title

    Observational Data Patterns for Time Series Data Quality Assessment

  • Author

    Pastorello, G. ; Agarwal, D. ; Samak, T. ; Poindexter, C. ; Faybishenko, B. ; Gunter, D. ; Hollowgrass, R. ; Papale, D. ; Trotta, C. ; Ribeca, A. ; Canfora, E.

  • Author_Institution
    Lawerence Berkeley Nat. Lab., Berkeley, CA, USA
  • Volume
    1
  • fYear
    2014
  • fDate
    20-24 Oct. 2014
  • Firstpage
    271
  • Lastpage
    278
  • Abstract
    Observational data are fundamental for scientific research in almost any domain. Recent advances in sensor and data management technologies are enabling unprecedented amounts of observational data to be collected and analyzed. However, an essential part of using observational data is not currently as scalable as data collection and analysis methods: data quality assurance and control. While specialized tools for very narrow domains do exist, general methods are harder to create. This paper explores the identification of data issues that lead to the creation of data tests and tools to perform data quality control activities. Developing this identification step in a systematic manner allows for better and more general quality control tools. As our case study, we use carbon, water, and energy fluxes as well as micro-meteorological data collected at field sites that are part of FLUXNET, a network of over 400 ecosystem-level monitoring stations. In an effort toward the release of a new global data set of fluxes, we are doing data quality control for these data. The experience from this work led to the creation of a catalog of issues identified in the data. This paper presents this catalog and its generalization into a set of patterns of data quality issues that can be detected in observational data.
  • Keywords
    data analysis; geophysics computing; meteorology; quality control; time series; FLUXNET; carbon fluxes; data analysis methods; data collection; data management technologies; data quality assurance; data quality control tools; data quality issues; data tests; ecosystem-level monitoring stations; energy fluxes; micrometeorological data; observational data patterns; sensor management technologies; time series data quality assessment; water fluxes; Calibration; Heating; Instruments; Quality assessment; Soil; Wind speed; FLUXNET; data quality; observational data patterns; time series data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    e-Science (e-Science), 2014 IEEE 10th International Conference on
  • Conference_Location
    Sao Paulo
  • Print_ISBN
    978-1-4799-4288-6
  • Type

    conf

  • DOI
    10.1109/eScience.2014.45
  • Filename
    6972274