• DocumentCode
    463611
  • Title

    SIMD Vectorization of Non-Two-Power Sized FFTs

  • Author

    Franchetti, F. ; Puschel, Markus

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA, USA
  • Volume
    2
  • fYear
    2007
  • fDate
    15-20 April 2007
  • Abstract
    SIMD (single instruction multiple data) vector instructions, such as Intel´s SSE family, are available on most architectures, but are difficult to exploit for speed-up. In many cases, such as the fast Fourier transform (FFT), signal processing algorithms have to undergo major transformations to map efficiently. Using the Kronecker product formalism, we rigorously derive a novel variant of the general-radix Cooley-Tukey FFT that is structured to map efficiently for any vector length v and radix. Then, we include the new FFT into the program generator spiral to generate actual C implementations. Benchmarks on Intel´s SSE show that the new algorithms perform better on practically all sizes than the best available libraries Intel´s MKL and FFTW.
  • Keywords
    fast Fourier transforms; signal processing; Kronecker product formalism; fast Fourier transform; general-radix Cooley-Tukey FFT; nontwo-power sized FFTS; signal processing algorithms; single instruction multiple data vector instructions; Automatic programming; Computer aided instruction; Computer architecture; Discrete Fourier transforms; Embedded computing; Fast Fourier transforms; Flexible printed circuits; Libraries; Signal processing algorithms; Spirals; DFT; Discrete Fourier transform; Kronecker product; SSE; performance; program generation; vector instructions;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
  • Conference_Location
    Honolulu, HI
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0727-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2007.366161
  • Filename
    4217334