• DocumentCode
    3346232
  • Title

    Automatic generation of implementations for DSP transforms on fused multiply-add architectures

  • Author

    Voronenko, Yevgen ; Püschel, Markus

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA, USA
  • Volume
    5
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    Many modern computer architectures feature fused multiply-add (FMA) instructions, which offer potentially faster performance for numerical applications. For DSP transforms, compilers can only generate FMA code to a very limited extent because optimal use of FMAs requires modifying the chosen algorithm. In this paper, we present a framework for automatically generating FMA code for every linear DSP transform, which we implemented as an extension to the SPIRAL code generation system. We show that for many transforms and transform sizes, our generated FMA code matches the best-known hand-derived FMA algorithms in terms of arithmetic cost. Further, we present actual runtime results that show the speed-up obtained by using FMA instructions.
  • Keywords
    automatic programming; digital arithmetic; discrete Fourier transforms; discrete cosine transforms; program compilers; DCT; DFT; DSP transform implementations; FMA code arithmetic cost; FMA instructions; SPIRAL code generation system; automatic code generation; compilers; fused multiply-add architectures; linear DSP transform; Arithmetic; Computer architecture; Costs; Digital signal processing; Discrete Fourier transforms; Discrete cosine transforms; Discrete transforms; Runtime; Signal processing algorithms; Spirals;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1327057
  • Filename
    1327057