• DocumentCode
    3080686
  • Title

    Evaluating the potential of graphics processors for high performance embedded computing

  • Author

    Mu, Shuai ; Wang, Chenxi ; Liu, Ming ; Li, Dongdong ; Zhu, Maohua ; Chen, Xiaoliang ; Xie, Xiang ; Deng, Yangdong

  • Author_Institution
    Tsinghua Univ., Beijing, China
  • fYear
    2011
  • fDate
    14-18 March 2011
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Today´s high performance embedded computing applications are posing significant challenges for processing throughout. Traditionally, such applications have been realized on application specific integrated circuits (ASICs) and/or digital signal processors (DSP). However, ASICs´ advantage in performance and power often could not justify the fast increasing fabrication cost, while current DSP offers a limited processing throughput that is usually lower than 100GFLOPS. On the other hand, current multi-core processors, especially graphics processing units (GPUs), deliver very high computing throughput, and at the same time maintain high flexibility and programmability. It is thus appealing to study the potential of GPUs for high performance embedded computing. In this work, we perform a comprehensive performance evaluation on GPUs with the high performance embedded computing (HPEC) benchmark suite, which consist a broad range of signal processing benchmarks with an emphasis on radar processing applications. We develop efficient GPU implementations that could outperform previous results for all the benchmarks. In addition, a systematic instruction level analysis for the GPU implementations is conducted with a GPU micro-architecture simulator. The results provide key insights on optimizing GPU hardware and software. Meanwhile, we also compared the performance and power efficiency between GPU and DSP with the HPEC benchmarks. The comparison reveals that the major hurdle for GPU´s applications in embedded computing is its relatively low power efficiency.
  • Keywords
    computer graphic equipment; coprocessors; embedded systems; multiprocessing systems; radar signal processing; application specific integrated circuits; digital signal processors; graphics processors; high performance embedded computing; multicore processors; radar processing application; Benchmark testing; Digital signal processing; Graphics processing unit; Instruction sets; Kernel; Parallel processing; Throughput; DSP; Fermi; GFLOPS; GPU; HPEC benchmark; Multi-core; parallel computing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Design, Automation & Test in Europe Conference & Exhibition (DATE), 2011
  • Conference_Location
    Grenoble
  • ISSN
    1530-1591
  • Print_ISBN
    978-1-61284-208-0
  • Type

    conf

  • DOI
    10.1109/DATE.2011.5763120
  • Filename
    5763120