DocumentCode :
188180
Title :
Speeding Up FPGA Placement: Parallel Algorithms and Methods
Author :
An, Matthew ; Steffan, J. Gregory ; Betz, Vaughn
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Toronto, Toronto, ON, Canada
fYear :
2014
fDate :
11-13 May 2014
Firstpage :
178
Lastpage :
185
Abstract :
Placement of a large FPGA design now commonly requires several hours, significantly hindering designer productivity. Furthermore, FPGA capacity is growing faster than CPU speed, which will further increase placement time unless new approaches are found. Multi-core processors are now ubiquitous, however, and some recent processors also have hardware support for transactional memory (TM), making parallelism an increasingly attractive approach for speeding up placement. We investigate methods to parallelize the simulated annealing placement algorithm in VPR, which is widely used in FPGA research. We explore both algorithmic changes and the use of different parallel programming paradigms and hardware, including TM, thread-level speculation (TLS) and lock-free techniques. We find that hardware TM enables large speedups (8.1x on average), but compromises “move fairness” and leads to an unacceptable quality loss. TLS scales poorly, with a maximum 2.2x speedup, but preserves quality. A new dependency checking parallel strategy achieves the best balance: the deterministic version achieves 5.9x speedup and no quality loss, while the non-deterministic, lock-free version can scale to a 34x speedup.
Keywords :
field programmable gate arrays; multiprocessing systems; parallel algorithms; parallel programming; simulated annealing; CPU; FPGA placement; TLS; VPR; dependency checking parallel strategy; lock-free technique; multicore processor; parallel algorithm; parallel programming; simulated annealing placement algorithm; thread-level speculation; transactional memory; Field programmable gate arrays; Hardware; Instruction sets; Parallel processing; Programming; Proposals; Tracking; FPGA placement; parallel placement; simulated annealing; transactional memory;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Field-Programmable Custom Computing Machines (FCCM), 2014 IEEE 22nd Annual International Symposium on
Conference_Location :
Boston, MA
Print_ISBN :
978-1-4799-5110-9
Type :
conf
DOI :
10.1109/FCCM.2014.60
Filename :
6861622
Link To Document :
بازگشت