• DocumentCode
    2906880
  • Title

    Automatic FFT Performance Tuning on OpenCL GPUs

  • Author

    Li, Yan ; Zhang, Yunquan ; Jia, Haipeng ; Long, Guoping ; Wang, Ke

  • Author_Institution
    Lab. of Parallel Software & Comput. Sci., Inst. of Software, Beijing, China
  • fYear
    2011
  • fDate
    7-9 Dec. 2011
  • Firstpage
    228
  • Lastpage
    235
  • Abstract
    Many fields of science and engineering, such as astronomy, medical imaging, seismology and spectroscopy, have been revolutionized by Fourier methods. The fast Fourier transform (FFT) is an efficient algorithm to compute the discrete Fourier transform (DFT) and its inverse. The emerging class of high performance computing architectures, such as GPU, seeks to achieve much higher performance and efficiency by exposing a hierarchy of distinct memories to programmers. However, the complexity of GPU programming poses a significant challenge for programmers. In this paper, based on the Kronecker product form multi-dimensional FFTs, we propose an automatic performance tuning framework for various OpenCL GPUs. Several key techniques of GPU programming on AMD and NVIDIA GPUs are also identified. Our OpenCL FFT library achieves up to 1.5 to 4 times, 1.5 to 40 times and 1.4 times the performance of clAmdFft 1.0 for 1D, 2D and 3D FFT respectively on an AMD GPU, and the overall performance is within 90% of CUFFT 4.0 on two NVIDIA GPUs.
  • Keywords
    discrete Fourier transforms; graphics processing units; performance evaluation; AMD GPU; GPU programming; Kronecker product; NVIDIA GPU; OpenCL FFT library; OpenCL GPU; astronomy; automatic FFT performance tuning; discrete Fourier transform; fast Fourier transform; graphics processing unit; medical imaging; multidimensional FFT; seismology; spectroscopy; Graphics processing unit; Instruction sets; Kernel; Memory management; Programming; Vectors; Auto-tuning; DFT; FFT; GPU; OpenCL;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Systems (ICPADS), 2011 IEEE 17th International Conference on
  • Conference_Location
    Tainan
  • ISSN
    1521-9097
  • Print_ISBN
    978-1-4577-1875-5
  • Type

    conf

  • DOI
    10.1109/ICPADS.2011.32
  • Filename
    6121282