Title :
Parallel transposing and communication strategies for FFT on cluster of SMP architectures with multicore processors
Author :
Jie, Yuan ; Jian-ping, Wu ; Zheng-hua, Wang
Author_Institution :
Nat. Lab. of Parallel & Distrib. Process., Nat. Univ. of Defense Technol., Changsha, China
Abstract :
This paper presents a high performance parallel formulation for 1-D FFT based on transpose algorithm. The parallel scheme of FFT trades off some efficiency for a more consistent level of parallel performance. It involves matrix transposition, and a new in-place transposing algorithm is introduced into the parallel matrix transposition to improve the efficiency. Depending on the size of the input n, the number of processes p, and the memory or network bandwidth, this method can achieve better parallel performance than the other on cluster of SMP architectures. Test shows us that our scheme can achieve a good speedup.
Keywords :
fast Fourier transforms; multiprocessing systems; parallel processing; FFT; SMP architectures; communication strategies; fast Fourier transforms; multicore processors; parallel transposing; transpose algorithm; Algorithm design and analysis; Arrays; Clustering algorithms; Computers; Parallel processing; Partitioning algorithms; Program processors; FFT; SMP; cluster; in-place transpose; parallel;
Conference_Titel :
Image and Signal Processing (CISP), 2010 3rd International Congress on
Conference_Location :
Yantai
Print_ISBN :
978-1-4244-6513-2
DOI :
10.1109/CISP.2010.5647466