Title :
Efficient Embarrassingly Parallel on Graphics Processor Unit
Author :
Gong, Chunye ; Liu, Jie ; Qin, Jin ; Hu, Qingfeng ; Gong, Zhenghu
Author_Institution :
Dept. of Comput. Sci., Nat. Univ. of Defense Technol., Changsha, China
Abstract :
The Embarrassingly Parallel (EP) is one kernel benchmark of NAS Parallel Benchmarks (NPB) which are a set of programs designed to help evaluate the performance of parallel supercomputers. In the EP benchmark, two-dimensional statistics are accumulated from a large number of Gaussian pseudo-random numbers, which produced by Linear Congruential Generator (LCG). In this paper, we present the design and implementation of EP on the powerful Graphics Processor Unit Tesla T10 with CUDA. While keeping the main framework of NPB EP, comparative results show that the performance of our GPU-based implementation is up to 871.57 Mop/s. This is roughly 1.38 times faster than the throughput previously achieved on the same GPU and outperforms equivalent 4 cores CPU by about 11.33 times.
Keywords :
Gaussian processes; benchmark testing; computer graphic equipment; coprocessors; parallel architectures; performance evaluation; random number generation; CUDA; GPU-based implementation; Gaussian pseudorandom number; NAS parallel benchmark; Tesla T10; embarrassingly parallel; graphics processor unit; kernel benchmark; linear congruential generator; parallel supercomputer; performance evaluation; two-dimensional statistics; Computational fluid dynamics; Concurrent computing; Educational technology; Graphics; Hardware; Kernel; Monte Carlo methods; Random number generation; Statistics; Supercomputers; CUDA; GPU; NPB Embarrassingly Parallel (EP); benchmark;
Conference_Titel :
Education Technology and Computer (ICETC), 2010 2nd International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-6367-1
DOI :
10.1109/ICETC.2010.5529656