Title :
Facilitating co-design for extreme-scale systems through lightweight simulation
Author :
Engelmann, Christian ; Lauer, Frank
Author_Institution :
Comput. Sci. & Math. Div., Oak Ridge Nat. Lab., Oak Ridge, TN, USA
Abstract :
This work focuses on tools for investigating algorithm performance at extreme scale with millions of concurrent threads and for evaluating the impact of future architecture choices to facilitate the co-design of high-performance computing (HPC) architectures and applications. The approach focuses on lightweight simulation of extreme-scale HPC systems with the needed amount of accuracy. The prototype presented in this paper is able to provide this capability using a parallel discrete event simulation (PDES), such that a Message Passing Interface (MPI) application can be executed at extreme scale, and its performance properties can be evaluated. The results of an initial prototype are encouraging as a simple hello world MPI program could be scaled up to 1,048,576 virtual MPI processes on a four-node cluster, and the performance properties of two MPI programs could be evaluated at up to 16,384 virtual MPI processes on the same system.
Keywords :
discrete event simulation; message passing; MPI program; algorithm performance; extreme-scale system; high performance computing architecture; lightweight simulation; message passing interface application; parallel discrete event simulation; virtual MPI process; Computational modeling; Computer architecture; Context; Libraries; Message systems; Prototypes; Time measurement; Message Passing Interface; hardware/software co-design; high-performance computing; parallel discrete event simulation; performance evaluation;
Conference_Titel :
Cluster Computing Workshops and Posters (CLUSTER WORKSHOPS), 2010 IEEE International Conference on
Conference_Location :
Heraklion, Crete
Print_ISBN :
978-1-4244-8395-2
Electronic_ISBN :
978-1-4244-8397-6
DOI :
10.1109/CLUSTERWKSP.2010.5613113