DocumentCode
691903
Title
Design and Implementation of Parallel FFT on CUDA
Author
Xueqin Zhang ; Kai Shen ; Chengguang Xu ; Kaifang Wang
Author_Institution
Sch. of Electron. Eng., Univ. of Electron. Sci. & Technol. of China, Chengdu, China
fYear
2013
fDate
21-22 Dec. 2013
Firstpage
583
Lastpage
589
Abstract
Fast Fourier Transform (FFT) algorithm has an important role in the image processing and scientific computing, and it´s a highly parallel divide-and-conquer algorithm. In this paper, we exploited the Compute Unified Device Architecture CUDA technology and contemporary graphics processing units (GPUs) to achieve higher performance. We focused on two aspects to optimize the ordinary FFT algorithm, multi-threaded parallelism and memory hierarchy. We also proposed parallelism optimization strategies when the data volume occurs and predicted the possible situation when the amount of data increased further.it can be seen from the results that Parallel FFT algorithm is more efficient than the ordinary FFT algorithm.
Keywords
fast Fourier transforms; graphics processing units; optimisation; parallel algorithms; parallel architectures; CUDA technology; Compute Unified Device Architecture; GPUs; fast Fourier transform; graphics processing units; parallel FFT algorithm; parallelism optimization; Algorithm design and analysis; Discrete Fourier transforms; Educational institutions; Graphics processing units; Instruction sets; Optimization; Parallel processing; CUDA technology; memory hierarchy; thread/thread block;
fLanguage
English
Publisher
ieee
Conference_Titel
Dependable, Autonomic and Secure Computing (DASC), 2013 IEEE 11th International Conference on
Conference_Location
Chengdu
Print_ISBN
978-1-4799-3380-8
Type
conf
DOI
10.1109/DASC.2013.130
Filename
6844430
Link To Document