DocumentCode :
3327392
Title :
Exposing ILP in custom hardware with a dataflow compiler IR
Author :
Zaidi, Athar Mohsin
Author_Institution :
Comput. Lab., Univ. of Cambridge, Cambridge, UK
fYear :
2013
fDate :
7-11 Sept. 2013
Firstpage :
411
Lastpage :
411
Abstract :
With the impending Dark Silicon problem spelling doom for multicore performance scaling, there is an ever increasing need for processor architectures with much better energy efficiency. To address this, designers are increasingly utilising custom (and reconfigurable) computing to provide orders-of-magnitude improvements in energy efficiency over equivalent software implementations of the same code. Unfortunately, one of the key issues with custom computing is that it is often unable to match the performance of conventional processors when implementing irregular code with complex control-flow. Out-of-order superscalar processors implement aggressive branch prediction to speculate across multiple branches with very high accuracy, dynamically exposing ILP. But custom hardware lacks an efficient and safe control-flow speculation mechanism (particularly for loops), as it is difficult to implement misprediction roll-back and recovery in hardware without introducing a centralized synchronization bottleneck. By providing a mechanism for compiling high level languages to hardware, as well as improving performance of control-flow intensive code in hardware, much more pervasive utilization of custom and reconfigurable hardware were enable for general-purpose computation, thereby helping to mitigate the effects of dark silicon.
Keywords :
multiprocessing systems; parallel architectures; performance evaluation; ILP; centralized synchronization bottleneck; complex control-flow; control-flow intensive code; control-flow speculation mechanism; custom hardware; dark silicon; dataflow compiler IR; high level languages; high-level synthesis; instruction level parallelism; multicore performance scaling; out-of-order superscalar processors; processor architectures; Computer architecture; Computers; Dynamic scheduling; Field programmable gate arrays; Hardware; Parallel processing; Silicon; Dark Silicon; Dataflow; High-level Synthesis; Instruction Level Parallelism;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel Architectures and Compilation Techniques (PACT), 2013 22nd International Conference on
Conference_Location :
Edinburgh
ISSN :
1089-795X
Print_ISBN :
978-1-4799-1018-2
Type :
conf
DOI :
10.1109/PACT.2013.6618841
Filename :
6618841
Link To Document :
بازگشت