• DocumentCode
    1999951
  • Title

    I/O Containers: Managing the Data Analytics and Visualization Pipelines of High End Codes

  • Author

    Dayal, Jai ; Jianting Cao ; Eisenhauer, Greg ; Schwan, Karsten ; Wolf, Michael ; Fang Zheng ; Abbasi, Hasan ; Klasky, Scott ; Podhorszki, Norbert ; Lofstead, Jay

  • fYear
    2013
  • fDate
    20-24 May 2013
  • Firstpage
    2015
  • Lastpage
    2024
  • Abstract
    Lack of I/O scalability is known to cause measurable slowdowns for large-scale scientific applications running on high end machines. This is prompting researchers to devise ´I/O staging´ methods in which outputs are processed via online analysis and visualization methods to support desired science outcomes. Organized as online workflows and carried out in I/O pipelines, these analysis components run concurrently with science simulations, often using a smaller set of nodes on the high end machine termed ´staging areas´. This paper presents a new approach to dealing with several challenges arising for such online analytics, including: how to efficiently run multiple analytics components on staging area resources providing them with the levels of end-to-end performance they need and how to manage staging resources when analytics actions change due to user or data-dependent behavior. Our approach designs and implements middleware constructs that delineate and manage I/O pipeline resources called ´I/O Containers´. Experimental evaluations of containers with realistic scientific applications demonstrate the feasibility and utility of the approach.
  • Keywords
    data analysis; data visualisation; middleware; pipeline processing; resource allocation; software performance evaluation; I-O pipeline resource management; I-O scalability; I-O staging methods; I/O containers; data analytics; data visualization pipelines; data-dependent behavior; end-to-end performance; high end codes; high end machine; high end machines; large-scale scientific applications; middleware constructs; online analysis methods; online visualization methods; online workflows; staging area resources; Analytical models; Computational modeling; Containers; Data models; Data visualization; Monitoring; Pipelines; Data Analytics; Data Staging; Runtime Management; Scalable I/O; Visualization; in-Situ; resource sharing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2013 IEEE 27th International
  • Conference_Location
    Cambridge, MA
  • Print_ISBN
    978-0-7695-4979-8
  • Type

    conf

  • DOI
    10.1109/IPDPSW.2013.198
  • Filename
    6651106