• DocumentCode
    188180
  • Title

    Speeding Up FPGA Placement: Parallel Algorithms and Methods

  • Author

    An, Matthew ; Steffan, J. Gregory ; Betz, Vaughn

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Univ. of Toronto, Toronto, ON, Canada
  • fYear
    2014
  • fDate
    11-13 May 2014
  • Firstpage
    178
  • Lastpage
    185
  • Abstract
    Placement of a large FPGA design now commonly requires several hours, significantly hindering designer productivity. Furthermore, FPGA capacity is growing faster than CPU speed, which will further increase placement time unless new approaches are found. Multi-core processors are now ubiquitous, however, and some recent processors also have hardware support for transactional memory (TM), making parallelism an increasingly attractive approach for speeding up placement. We investigate methods to parallelize the simulated annealing placement algorithm in VPR, which is widely used in FPGA research. We explore both algorithmic changes and the use of different parallel programming paradigms and hardware, including TM, thread-level speculation (TLS) and lock-free techniques. We find that hardware TM enables large speedups (8.1x on average), but compromises “move fairness” and leads to an unacceptable quality loss. TLS scales poorly, with a maximum 2.2x speedup, but preserves quality. A new dependency checking parallel strategy achieves the best balance: the deterministic version achieves 5.9x speedup and no quality loss, while the non-deterministic, lock-free version can scale to a 34x speedup.
  • Keywords
    field programmable gate arrays; multiprocessing systems; parallel algorithms; parallel programming; simulated annealing; CPU; FPGA placement; TLS; VPR; dependency checking parallel strategy; lock-free technique; multicore processor; parallel algorithm; parallel programming; simulated annealing placement algorithm; thread-level speculation; transactional memory; Field programmable gate arrays; Hardware; Instruction sets; Parallel processing; Programming; Proposals; Tracking; FPGA placement; parallel placement; simulated annealing; transactional memory;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Field-Programmable Custom Computing Machines (FCCM), 2014 IEEE 22nd Annual International Symposium on
  • Conference_Location
    Boston, MA
  • Print_ISBN
    978-1-4799-5110-9
  • Type

    conf

  • DOI
    10.1109/FCCM.2014.60
  • Filename
    6861622