مرکز منطقه ای اطلاع رساني علوم و فناوري - Scalable parallel I/O alternatives for massively parallel partitioned solver systems

DocumentCode :

2450756

Title :

Scalable parallel I/O alternatives for massively parallel partitioned solver systems

Author :

Jing Fu ; Ning Liu ; Sahni, Onkar ; Jansen, Kenneth E. ; Shephard, Mark S. ; Carothers, Christopher D.

Author_Institution :

Dept. of Comput. Sci., Rensselaer Polytech. Inst., Troy, NY, USA

fYear :

2010

fDate :

19-23 April 2010

Firstpage :

Lastpage :

Abstract :

With the development of high-performance computing, I/O issues have become the bottleneck for many massively parallel applications. This paper investigates scalable parallel I/O alternatives for massively parallel partitioned solver systems. Typically such systems have synchronized Â¿loopsÂ¿ and will write data in a well defined block I/O format consisting of a header and data portion. Our target use for such an parallel I/O subsystem is checkpoint-restart where writing is by far the most common operation and reading typically only happens during either initialization or during a restart operation because of a system failure. We compare four parallel I/O strategies: 1 POSIX File Per Processor (1PFPP), a synchronized parallel IO library (syncIO), Â¿Poor-Man´sÂ¿ Parallel I/O (PMPIO) and a new Â¿reduced blockingÂ¿ strategy (rbIO). Performance tests using real CFD solver data from PHASTA (an unstructured grid finite element Navier-Stokes solver) show that the syncIO strategy can achieve a read bandwidth of 6.6GB/Sec on Blue Gene/L using 16K processors which is significantly faster than 1PFPP or PMPIO approaches. The serial Â¿token-passingÂ¿ approach of PMPIO yields a 900 MB/sec write bandwidth on 16K processors using 1024 files and 1PFPP achieves 600 MB/sec on 8K processors while the Â¿reduced-blockedÂ¿ rbIO strategy achieves an actual writing performance of 2.3GB/sec and perceived/latency hiding writing performance of more than 21,000 GB/sec (i.e., 21TB/sec) on a 32,768 processor Blue Gene/L.

Keywords :

Unix; input-output programs; mathematics computing; parallel processing; 1 POSIX file per processor; CFD solver data; Navier Stokes solver; PHASTA; high performance computing; massively parallel partitioned solver systems; poor mans parallel I/O; reduced blocking strategy; scalable parallel I/O alternatives; synchronized parallel I/O library; token passing; Bandwidth; Computational fluid dynamics; Computer science; Concurrent computing; Economic indicators; Equations; Finite element methods; Libraries; Testing; Writing;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW), 2010 IEEE International Symposium on

Conference_Location :

Atlanta, GA

Print_ISBN :

978-1-4244-6533-0

Type :

conf

DOI :

10.1109/IPDPSW.2010.5470887

Filename :

5470887

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2450756