DocumentCode :
3366967
Title :
Multi-level Parallelism for Time- and Cost-Efficient Parallel Discrete Event Simulation on GPUs
Author :
Kunz, Georg ; Schemmel, Daniel ; Gross, James ; Wehrle, Klaus
fYear :
2012
fDate :
15-19 July 2012
Firstpage :
23
Lastpage :
32
Abstract :
Developing complex technical systems requires a systematic exploration of the given design space in order to identify optimal system configurations. However, studying the effects and interactions of even a small number of system parameters often requires an extensive number of simulation runs. This in turn results in excessive runtime demands which severely hamper thorough design space explorations. In this paper, we present a parallel discrete event simulation scheme that enables cost- and time-efficient execution of large scale parameter studies on GPUs. In order to efficiently accommodate the stream-processing paradigm of GPUs, our parallelization scheme exploits two orthogonal levels of parallelism: External parallelism among the inherently independent simulations of a parameter study and internal parallelism among independent events within each individual simulation of a parameter study. Specifically, we design an event aggregation strategy based on external parallelism that generates workloads suitable for GPUs. In addition, we define a pipelined event execution mechanism based on internal parallelism to hide the transfer latencies between host- and GPU-memory. We analyze the performance characteristics of our parallelization scheme by means of a prototype implementation and show a 25-fold performance improvement over purely CPU-based execution.
Keywords :
discrete event simulation; graphics processing units; parallel processing; GPU; design space explorations; independent simulations; multilevel parallelism; optimal system configurations; stream processing paradigm; time and cost efficient parallel discrete event simulation; Computational modeling; Discrete event simulation; Graphics processing unit; Instruction sets; Parallel processing; Prototypes; GP-GPU; PDES; event aggregation; external parallelism; internal parallelism; latency hiding; parameter studies;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Principles of Advanced and Distributed Simulation (PADS), 2012 ACM/IEEE/SCS 26th Workshop on
Conference_Location :
Zhangjiajie
ISSN :
1087-4097
Print_ISBN :
978-1-4673-1797-9
Type :
conf
DOI :
10.1109/PADS.2012.27
Filename :
6305881
Link To Document :
بازگشت