• DocumentCode
    524453
  • Title

    Efficient Embarrassingly Parallel on Graphics Processor Unit

  • Author

    Gong, Chunye ; Liu, Jie ; Qin, Jin ; Hu, Qingfeng ; Gong, Zhenghu

  • Author_Institution
    Dept. of Comput. Sci., Nat. Univ. of Defense Technol., Changsha, China
  • Volume
    4
  • fYear
    2010
  • fDate
    22-24 June 2010
  • Abstract
    The Embarrassingly Parallel (EP) is one kernel benchmark of NAS Parallel Benchmarks (NPB) which are a set of programs designed to help evaluate the performance of parallel supercomputers. In the EP benchmark, two-dimensional statistics are accumulated from a large number of Gaussian pseudo-random numbers, which produced by Linear Congruential Generator (LCG). In this paper, we present the design and implementation of EP on the powerful Graphics Processor Unit Tesla T10 with CUDA. While keeping the main framework of NPB EP, comparative results show that the performance of our GPU-based implementation is up to 871.57 Mop/s. This is roughly 1.38 times faster than the throughput previously achieved on the same GPU and outperforms equivalent 4 cores CPU by about 11.33 times.
  • Keywords
    Gaussian processes; benchmark testing; computer graphic equipment; coprocessors; parallel architectures; performance evaluation; random number generation; CUDA; GPU-based implementation; Gaussian pseudorandom number; NAS parallel benchmark; Tesla T10; embarrassingly parallel; graphics processor unit; kernel benchmark; linear congruential generator; parallel supercomputer; performance evaluation; two-dimensional statistics; Computational fluid dynamics; Concurrent computing; Educational technology; Graphics; Hardware; Kernel; Monte Carlo methods; Random number generation; Statistics; Supercomputers; CUDA; GPU; NPB Embarrassingly Parallel (EP); benchmark;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Education Technology and Computer (ICETC), 2010 2nd International Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-4244-6367-1
  • Type

    conf

  • DOI
    10.1109/ICETC.2010.5529656
  • Filename
    5529656