DocumentCode
463611
Title
SIMD Vectorization of Non-Two-Power Sized FFTs
Author
Franchetti, F. ; Puschel, Markus
Author_Institution
Dept. of Electr. & Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA, USA
Volume
2
fYear
2007
fDate
15-20 April 2007
Abstract
SIMD (single instruction multiple data) vector instructions, such as Intel´s SSE family, are available on most architectures, but are difficult to exploit for speed-up. In many cases, such as the fast Fourier transform (FFT), signal processing algorithms have to undergo major transformations to map efficiently. Using the Kronecker product formalism, we rigorously derive a novel variant of the general-radix Cooley-Tukey FFT that is structured to map efficiently for any vector length v and radix. Then, we include the new FFT into the program generator spiral to generate actual C implementations. Benchmarks on Intel´s SSE show that the new algorithms perform better on practically all sizes than the best available libraries Intel´s MKL and FFTW.
Keywords
fast Fourier transforms; signal processing; Kronecker product formalism; fast Fourier transform; general-radix Cooley-Tukey FFT; nontwo-power sized FFTS; signal processing algorithms; single instruction multiple data vector instructions; Automatic programming; Computer aided instruction; Computer architecture; Discrete Fourier transforms; Embedded computing; Fast Fourier transforms; Flexible printed circuits; Libraries; Signal processing algorithms; Spirals; DFT; Discrete Fourier transform; Kronecker product; SSE; performance; program generation; vector instructions;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location
Honolulu, HI
ISSN
1520-6149
Print_ISBN
1-4244-0727-3
Type
conf
DOI
10.1109/ICASSP.2007.366161
Filename
4217334
Link To Document