DocumentCode
1267335
Title
Improved Twiddle Access for Fast Fourier Transforms
Author
Bowers, Kevin J. ; Lippert, Ross A. ; Dror, Ron O. ; Shaw, David E.
Author_Institution
D.E. Shaw Res., New York, NY, USA
Volume
58
Issue
3
fYear
2010
fDate
3/1/2010 12:00:00 AM
Firstpage
1122
Lastpage
1130
Abstract
Optimizing the number of arithmetic operations required in fast Fourier transform (FFT) algorithms has been the focus of extensive research, but memory management is of comparable importance on modern processors. In this article, we investigate two known FFT algorithms, G and GT , that are similar to Cooley-Tukey decimation-in-time and decimation-in-frequency FFT algorithms but that give an asymptotic reduction in the number of twiddle factor loads required for depth-first recursions. The algorithms also allow for aggressive vectorization (even for non-power-of-2 orders) and easier optimization of trivial twiddle factor multiplies. We benchmark G and GT implementations with comparable Cooley-Tukey implementations on commodity hardware. In a comparison designed to isolate the effect of twiddle factor access optimization, these benchmarks show typical speedups ranging from 10% to 65%, depending on transform order, precision, and vectorization. A more heavily optimized implementation of GT yields substantial performance improvements over the widely used code FFTW for many transform orders. The twiddle factor access optimization technique can be generalized to other common FFT algorithms, including real-data FFTs, split-radix FFTs, and multidimensional FFTs.
Keywords
fast Fourier transforms; Cooley-Tukey algorithm; G algorithm; GT algorithm; decimation-in-frequency algorithm; decimation-in-time algorithm; fast Fourier transforms; memory management; twiddle access; Algorithms; discrete Fourier transforms; memory management;
fLanguage
English
Journal_Title
Signal Processing, IEEE Transactions on
Publisher
ieee
ISSN
1053-587X
Type
jour
DOI
10.1109/TSP.2009.2035984
Filename
5313934
Link To Document