• DocumentCode
    2000418
  • Title

    A Data Intensive Statistical Aggregation Engine: A Case Study for Gridded Climate Records

  • Author

    Chapman, D. ; Simon, Tyler A. ; Nguyen, P. ; Halem, Milton

  • Author_Institution
    Comput. Sci. & Electr. Eng. Dept., Univ. of Maryland Baltimore County (UMBC), Baltimore, MD, USA
  • fYear
    2013
  • fDate
    20-24 May 2013
  • Firstpage
    2157
  • Lastpage
    2164
  • Abstract
    Satellite derived climate instrument records are often highly structured and conform to the "Data-Cube" topology. However, data scales on the order of tens to hundreds of Terabytes make it more difficult to perform the rigorous statistical aggregation and analytics necessary to investigate how our climate is changing over time and space. It is especially cumbersome to supply the full derivation (provenance) of this analysis, as is increasingly required by scientific conferences and journals. In this paper, we address our approach toward the creation of a 55 Terabyte decadal record of Outgoing Long wave Spectrum (OLS) from the NASA Atmospheric Infrared Sounder (AIRS), and describe our open source data-intensive statistical aggregation engine "Gridderama" intended primarily for climate trend analysis, and may be applicable to other aggregation problems involving large structured datasets.
  • Keywords
    artificial satellites; data handling; geophysics computing; parallel processing; public domain software; statistical analysis; topology; AIRS; Gridderama; NASA Atmospheric Infrared Sounder; OLS; Outgoing Longwave Spectrum; Satellite derived climate instrument records; climate trend analysis; data intensive statistical aggregation engine; data-cube topology; gridded climate records; large structured datasets; open source data-intensive statistical aggregation engine; rigorous statistical aggregation; rigorous statistical analytics; Arrays; Engines; Instruments; Market research; Meteorology; NASA; Runtime environment; Aggregation; Big-data; Gridderama; Scientific; Workflow;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2013 IEEE 27th International
  • Conference_Location
    Cambridge, MA
  • Print_ISBN
    978-0-7695-4979-8
  • Type

    conf

  • DOI
    10.1109/IPDPSW.2013.87
  • Filename
    6651122