Title :
Supporting computational data model representation with high-performance I/O in parallel netCDF
Author :
Gao, Kui ; Jin, Chen ; Choudhary, Alok ; Liao, Wei-keng
Abstract :
Parallel computational scientific applications have been described by their computation and communication patterns. From a storage and I/O perspective, these applications can also be grouped into separate data models based on the way data is organized and accessed during simulation, analysis, and visualization. Parallel netCDF is a popular library used in many scientific applications to store scientific datasets and provides high-performance parallel I/O. Although the metadata-rich netCDF file format can effectively store and describe regular multi-dimensional array datasets, it does not address the full range of current and future computational science data models. In this paper, we present a new storage scheme in Parallel netCDF to represent a broad variety of data models used in modern computational scientific applications. This scheme also allows concurrent metadata construction for different data objects from multiple groups of application processes, an important feature in obtaining a high degree of I/O parallelism for data models exhibiting irregular data distribution. Furthermore, we employ non-blocking I/O functions to aggregate irregularly distributed data requests into large, contiguous data requests, to achieve high-performance I/O. Using an example of adaptive mesh refinement data model, we demonstrate the proposed scheme can produce scalable performance results for both data and metadata creation and access.
Keywords :
data structures; input-output programs; meta data; parallel processing; computational data model representation; high-performance I/O; metadata construction; metadata-rich netCDF file format; multidimensional array datasets; netCDF file format; parallel computational scientific applications; parallel netCDF; pattern communication; Arrays; Collaboration; Data models; Distributed databases; Libraries; Optimization; Parallel processing; Data Model; Parallel I/O; Parallel netCDF;
Conference_Titel :
High Performance Computing (HiPC), 2011 18th International Conference on
Conference_Location :
Bangalore
Print_ISBN :
978-1-4577-1951-6
Electronic_ISBN :
978-1-4577-1949-3
DOI :
10.1109/HiPC.2011.6152746