Title :
Parallelization and Performance Analysis of the Cooley–Tukey FFT Algorithm for Shared-Memory Architectures
Author :
Norton, Alan ; Silberger, Allan J.
Author_Institution :
IBM Thomas J. Watson Research Center
fDate :
5/1/1987 12:00:00 AM
Abstract :
We present here a study of parallelization of the Cooley-Tukey radix two FFT algorithm for MIMD (nonvector) architectures. Parallel algorithms are presented for one and multidimensional Fourier transforms. From instruction traces obtained by executing Fortran kernels derived from our algorithms, we determined the precise instructions to be executed by each processor in the parallel system. We used these instruction races to predict the performance of the IBM Research Parallel Processing Prototype, RP3, as a computer of FFT´s. Our performance results are depicted in graphs included in this paper.
Keywords :
Cooley-Tukey; MIMD; fast Fourier transform; parallelism; performance analysis; shared memory; Computer architecture; Concurrent computing; Cost function; Fast Fourier transforms; Fourier transforms; Kernel; Multidimensional systems; Parallel algorithms; Parallel processing; Performance analysis; Cooley-Tukey; MIMD; fast Fourier transform; parallelism; performance analysis; shared memory;
Journal_Title :
Computers, IEEE Transactions on
DOI :
10.1109/TC.1987.1676943