• DocumentCode
    691903
  • Title

    Design and Implementation of Parallel FFT on CUDA

  • Author

    Xueqin Zhang ; Kai Shen ; Chengguang Xu ; Kaifang Wang

  • Author_Institution
    Sch. of Electron. Eng., Univ. of Electron. Sci. & Technol. of China, Chengdu, China
  • fYear
    2013
  • fDate
    21-22 Dec. 2013
  • Firstpage
    583
  • Lastpage
    589
  • Abstract
    Fast Fourier Transform (FFT) algorithm has an important role in the image processing and scientific computing, and it´s a highly parallel divide-and-conquer algorithm. In this paper, we exploited the Compute Unified Device Architecture CUDA technology and contemporary graphics processing units (GPUs) to achieve higher performance. We focused on two aspects to optimize the ordinary FFT algorithm, multi-threaded parallelism and memory hierarchy. We also proposed parallelism optimization strategies when the data volume occurs and predicted the possible situation when the amount of data increased further.it can be seen from the results that Parallel FFT algorithm is more efficient than the ordinary FFT algorithm.
  • Keywords
    fast Fourier transforms; graphics processing units; optimisation; parallel algorithms; parallel architectures; CUDA technology; Compute Unified Device Architecture; GPUs; fast Fourier transform; graphics processing units; parallel FFT algorithm; parallelism optimization; Algorithm design and analysis; Discrete Fourier transforms; Educational institutions; Graphics processing units; Instruction sets; Optimization; Parallel processing; CUDA technology; memory hierarchy; thread/thread block;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Dependable, Autonomic and Secure Computing (DASC), 2013 IEEE 11th International Conference on
  • Conference_Location
    Chengdu
  • Print_ISBN
    978-1-4799-3380-8
  • Type

    conf

  • DOI
    10.1109/DASC.2013.130
  • Filename
    6844430