DocumentCode :
463611
Title :
SIMD Vectorization of Non-Two-Power Sized FFTs
Author :
Franchetti, F. ; Puschel, Markus
Author_Institution :
Dept. of Electr. & Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA, USA
Volume :
2
fYear :
2007
fDate :
15-20 April 2007
Abstract :
SIMD (single instruction multiple data) vector instructions, such as Intel´s SSE family, are available on most architectures, but are difficult to exploit for speed-up. In many cases, such as the fast Fourier transform (FFT), signal processing algorithms have to undergo major transformations to map efficiently. Using the Kronecker product formalism, we rigorously derive a novel variant of the general-radix Cooley-Tukey FFT that is structured to map efficiently for any vector length v and radix. Then, we include the new FFT into the program generator spiral to generate actual C implementations. Benchmarks on Intel´s SSE show that the new algorithms perform better on practically all sizes than the best available libraries Intel´s MKL and FFTW.
Keywords :
fast Fourier transforms; signal processing; Kronecker product formalism; fast Fourier transform; general-radix Cooley-Tukey FFT; nontwo-power sized FFTS; signal processing algorithms; single instruction multiple data vector instructions; Automatic programming; Computer aided instruction; Computer architecture; Discrete Fourier transforms; Embedded computing; Fast Fourier transforms; Flexible printed circuits; Libraries; Signal processing algorithms; Spirals; DFT; Discrete Fourier transform; Kronecker product; SSE; performance; program generation; vector instructions;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
ISSN :
1520-6149
Print_ISBN :
1-4244-0727-3
Type :
conf
DOI :
10.1109/ICASSP.2007.366161
Filename :
4217334
Link To Document :
بازگشت