• DocumentCode
    1994372
  • Title

    Architecture Exploration of High-Performance Floating-Point Fused Multiply-Add Units and their Automatic Use in High-Level Synthesis

  • Author

    Liebig, Bjorn ; Huthmann, Jens ; Koch, Andreas

  • fYear
    2013
  • fDate
    20-24 May 2013
  • Firstpage
    134
  • Lastpage
    143
  • Abstract
    Multiply-add operations form a crucial part of many digital signal processing and control engineering applications. Since their performance is crucial for the application-level speed-up, it is worthwhile to explore a wide spectrum of implementations alternatives, trading increased area/energy usage to speed-up units on the critical path of the computation. This paper examines existing solutions and proposes two new architectures for floating-point fused multiply-adds, and also considers the impact of different in-fabric features of recent FPGA architectures. The units rely on different degrees of carry-save arithmetic improve performance by up to 2.5x over the closest state-of-the-art competitor. They are evaluated at the application level by modifying an existing high-level synthesis system to automatically insert the new units for computations on the critical path of three different convex solvers.
  • Keywords
    field programmable gate arrays; floating point arithmetic; high level synthesis; FPGA architectures; application-level speed-up; architecture exploration; convex solvers; high-level synthesis; high-performance floating-point fused multiply-add units; in-fabric features; multiply-add operations; state-of-the-art competitor; Acceleration; Accuracy; Adders; Computer architecture; Field programmable gate arrays; Multiplexing; Standards; FMA; FPGA; carry save; floating-point; fused; multiply-add;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2013 IEEE 27th International
  • Conference_Location
    Cambridge, MA
  • Print_ISBN
    978-0-7695-4979-8
  • Type

    conf

  • DOI
    10.1109/IPDPSW.2013.106
  • Filename
    6650880