Title :
High performance file I/O for the Blue Gene/L supercomputer
Author :
H. Yu;R.K. Sahoo;C. Howson;G. Almasi;J.G. Castanos;M. Gupta;J.E. Moreira;J.J. Parker;T.E. Engelsiepen;R.B. Ross;R. Thakur;R. Latham;W.D. Gropp
Author_Institution :
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
fDate :
6/28/1905 12:00:00 AM
Abstract :
Parallel I/O plays a crucial role for most data-intensive applications running on massively parallel systems like Blue Gene/L that provides the promise of delivering enormous computational capability. We designed and implemented a highly scalable parallel file I/O architecture for Blue Gene/L, which leverages the benefit of the hierarchical and functional partitioning design of the system software with separate computational and I/O cores. The architecture exploits the scalability aspect of GPFS (General Parallel File System) at the backend, while using MPI I/O as an interface between the application I/O and the file system. We demonstrate the impact of our high performance I/O solution for Blue Gene/L with a comprehensive evaluation that consists of a number of widely used parallel I/O benchmarks and I/O intensive applications. Our design and implementation is not only able to deliver at least one order of magnitude speed up in terms of I/O bandwidth for a real-scale application HOMME (achieving aggregate bandwidth of 1.8 GB/Sec and 2.3 GB/Sec for write and read accesses, respectively), but also supports high-level parallel I/O data interfaces such as parallel HDF5 and parallel NetCDF scaling up to a large number of processors.
Keywords :
"Supercomputers","Concurrent computing","Computer architecture","File systems","Scalability","Bandwidth","Large-scale systems","Parallel programming","System software","Application software"
Conference_Titel :
High-Performance Computer Architecture, 2006. The Twelfth International Symposium on
Print_ISBN :
0-7803-9368-6
DOI :
10.1109/HPCA.2006.1598125