DocumentCode :
3074931
Title :
Scalable, accurate multicore simulation in the 1000-core era
Author :
Lis, Mieszko ; Ren, Pengju ; Cho, Myong Hyon ; Shim, Keun Sup ; Fletcher, Christopher W. ; Khan, Omer ; Devadas, Srinivas
Author_Institution :
Massachusetts Inst. of Technol., Cambridge, MA, USA
fYear :
2011
fDate :
10-12 April 2011
Firstpage :
175
Lastpage :
185
Abstract :
We present HORNET, a parallel, highly configurable, cycle-level multicore simulator based on an ingress-queued worm-hole router NoC architecture. The parallel simulation engine offers cycle-accurate as well as periodic synchronization; while preserving functional accuracy, this permits tradeoffs between perfect timing accuracy and high speed with very good accuracy. When run on 6 separate physical cores on a single die, speedups can exceed a factor of over 5, and when run on a two-die 12-core system with 2-way hyperthreading, speedups exceed 11 ×. Most hardware parameters are configurable, including memory hierarchy, interconnect geometry, bandwidth, crossbar dimensions, and parameters driving power and thermal effects. A highly parametrized table-based NoC design allows a variety of routing and virtual channel allocation algorithms out of the box, ranging from simple DOR routing to complex Valiant, ROMM, or PROM schemes, BSOR, and adaptive routing. HORNET can run in network-only mode using synthetic traffic or traces, directly emulate a MIPS-based multicore, or function as the memory subsystem for native applications executed under the Pin instrumentation tool. HORNET is freely available under the open-source MIT license at http://csg.csail.mit.edu/hornet/.
Keywords :
multiprocessing systems; multiprocessor interconnection networks; network routing; network-on-chip; synchronisation; 1000-core era; 2-way hyperthreading; BSOR routing; DOR routing; HORNET; MlPS-based multicore; PROM schemes; ROMM schemes; adaptive routing; cycle-level multicore simulator; ingress-queued worm-hole router NoC architecture; multicore simulation; parallel simulation engine; pin instrumentation tool; synthetic traffic; virtual channel allocation algorithms; Accuracy; Analytical models; Multicore processing; Program processors; Routing; Synchronization; Tiles;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Performance Analysis of Systems and Software (ISPASS), 2011 IEEE International Symposium on
Conference_Location :
Austin, TX
Print_ISBN :
978-1-61284-367-4
Electronic_ISBN :
978-1-61284-368-1
Type :
conf
DOI :
10.1109/ISPASS.2011.5762734
Filename :
5762734
Link To Document :
بازگشت