Title :
Scaling parallel I/O performance through I/O delegate and caching system
Author :
Nisar, Arifa ; Liao, Wei-keng ; Choudhary, Alok
Author_Institution :
Electr. Eng. & Comput. Sci. Dept., Northwestern Univ., Evanston, IL, USA
Abstract :
Increasingly complex scientific applications require massive parallelism to achieve the goals of fidelity and high computational performance. Such applications periodically offload checkpointing data to file system for post-processing and program resumption. As a side effect of high degree of parallelism, I/O contention at servers doesn´t allow overall performance to scale with increasing number of processors. To bridge the gap between parallel computational and I/O performance, we propose a portable MPI-IO layer where certain tasks, such as file caching, consistency control, and collective I/O optimization are delegated to a small set of compute nodes, collectively termed as I/O Delegate nodes. A collective cache design is incorporated to resolve cache coherence and hence alleviates the lock contention at I/O servers. By using popular parallel I/O benchmark and application I/O kernels, our experimental evaluation indicates considerable performance improvement with a small percentage of compute resources reserved for I/O.
Keywords :
cache storage; checkpointing; file organisation; input-output programs; message passing; parallel processing; I/O delegate; MPI-IO layer; caching system; file system; offload checkpointing data; scaling parallel I/O performance; Application software; Bandwidth; Checkpointing; Computer applications; Computer architecture; Concurrent computing; File systems; High performance computing; Parallel processing; Portable computers;
Conference_Titel :
High Performance Computing, Networking, Storage and Analysis, 2008. SC 2008. International Conference for
Conference_Location :
Austin, TX
Print_ISBN :
978-1-4244-2834-2
Electronic_ISBN :
978-1-4244-2835-9
DOI :
10.1109/SC.2008.5214358