DocumentCode :
1973735
Title :
Modeling Billion-Node Torus Networks Using Massively Parallel Discrete-Event Simulation
Author :
Liu, Ning ; Carothers, Christopher D.
Author_Institution :
Comput. Sci. Dept., Rensselaer Polytech. Inst., Troy, NY, USA
fYear :
2011
fDate :
14-17 June 2011
Firstpage :
1
Lastpage :
8
Abstract :
Exascale supercomputers will have millions or even hundreds of millions of processing cores and the potential for nearly billion-way parallelism. Exascale compute and data storage architectures will be critically dependent on the interconnection network. The most popular interconnection network for current and future supercomputer systems is the torus (e.g., k-ary, n-cube). This paper focuses on the modeling and simulation of ultra-large-scale torus networks using Rensselaer´s Optimistic Simulator System (ROSS). We compare real communication delays between our model and the actual torus network from the Blue Gene/L using 2,048 processors. Our performance experiments demonstrate the ability to simulate million to billion-node torus networks. The torus network model for a 16-million-node configuration shows a high degree of strong scaling when going from 1,024 cores to 32,768 cores on Blue Gene/L with a peak event-rate of nearly 5 billion events per second. Finally, we demonstrate the performance of our torus network model configured with 1-billion-nodes using up to 16,384 Blue Gene/L processors.
Keywords :
delays; discrete event simulation; hypercube networks; multiprocessing systems; network topology; parallel machines; Blue Gene/L; ROSS; Rensselaer Optimistic Simulator System; billion-node torus network modeling; communication delay; data storage architecture; exascale supercomputers; interconnection network; k-ary network; massively parallel discrete-event simulation; n-cube network; processing cores; supercomputer system; torus network topology; ultralarge-scale torus network; Computational modeling; Computer architecture; Delay; Heuristic algorithms; Program processors; Routing; Schedules;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Principles of Advanced and Distributed Simulation (PADS), 2011 IEEE Workshop on
Conference_Location :
Nice
ISSN :
1087-4097
Print_ISBN :
978-1-4577-1363-7
Electronic_ISBN :
1087-4097
Type :
conf
DOI :
10.1109/PADS.2011.5936761
Filename :
5936761
Link To Document :
بازگشت