• DocumentCode
    624352
  • Title

    Transforming a linear algebra core to an FFT accelerator

  • Author

    Pedram, Ardavan ; McCalpin, John ; Gerstlauer, Andreas

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Univ. of Texas at Austin, Austin, TX, USA
  • fYear
    2013
  • fDate
    5-7 June 2013
  • Firstpage
    175
  • Lastpage
    184
  • Abstract
    This paper considers the modifications required to transform a highly-efficient, specialized linear algebra core into an efficient engine for computing Fast Fourier Transforms (FFTs). We review the minimal changes required to support Radix-4 FFT computations and propose extensions to the micro-architecture of the baseline linear algebra core. Along the way, we study the critical differences between the two classes of algorithms. Special attention is paid to the configuration of the on-chip memory system to support high utilization. We examine design trade-offs between efficiency, specialization and flexibility, and their effects both on the core and memory hierarchy for a unified design as compared to dedicated accelerators for each application. The final design is a flexible architecture that can perform both classes of applications. Results show that the proposed hybrid FFT/Linear Algebra core can achieve 26.6 GFLOPS/S with a power efficiency of 40 GFLOPS/W, which is up to 100× and 40× more energy efficient than cutting-edge CPUs and GPUs, respectively.
  • Keywords
    digital arithmetic; fast Fourier transforms; linear algebra; memory architecture; CPU; FFT accelerator; GFLOPS/S; GFLOPS/W; GPU; baseline linear algebra core; fast Fourier transform; flexible architecture; highly-efficient specialized linear algebra core; hybrid FFT/linear algebra core; memory hierarchy; microarchitecture; on-chip memory system; power efficiency; radix-4 FFT computation; Algorithm design and analysis; Computer architecture; Hardware; Linear algebra; Pipelines; Signal processing algorithms; Transforms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Application-Specific Systems, Architectures and Processors (ASAP), 2013 IEEE 24th International Conference on
  • Conference_Location
    Washington, DC
  • ISSN
    2160-0511
  • Print_ISBN
    978-1-4799-0494-5
  • Type

    conf

  • DOI
    10.1109/ASAP.2013.6567572
  • Filename
    6567572