Title :
A New Dataflow Compiler IR for Accelerating Control-Intensive Code in Spatial Hardware
Author :
Zaidi, Athar Mohsin ; Greaves, David
Author_Institution :
Comput. Lab., Univ. of Cambridge, Cambridge, UK
Abstract :
While custom (and reconfigurable) computing can provide orders-of-magnitude improvements in energy efficiency and performance for many numeric, data-parallel applications, performance on non-numeric, sequential code is often worse than what is achievable using conventional superscalar processors. This work attempts to address the problem of improving sequential performance in custom hardware by (a) switching from a statically scheduled to a dynamically scheduled (dataflow) execution model, and (b) developing a new compiler IR for high-level synthesis that enables aggressive exposition of ILP even in the presence of complex control flow. This new IR is directly implemented as a static dataflow graph in hardware by our prototype high-level synthesis tool-chain, and shows an average speedup of 1.13× over equivalent hardware generated using LegUp, an existing HLS tool. In addition, our new IR allows us to further trade area and energy for performance, increasing the average speedup to 1.55×, through loop unrolling, with a peak speedup of 4.05×. Our custom hardware is able to approach the sequential cycle counts of an Intel Nehalem Core i7 superscalar processor, while consuming on average only 0.25× the energy of an in-order Altera Nios IIf processor.
Keywords :
data flow computing; data flow graphs; high level synthesis; program compilers; HLS tool; ILP; Intel Nehalem Core i7 superscalar processor; LegUp; accelerating control-intensive code; complex control flow; dataflow compiler IR; dynamically scheduled execution model; high-level synthesis tool-chain; in-order Altera Nios IIf processor; instruction level parallelism; loop unrolling; sequential cycle counts; spatial hardware; static dataflow graph; Benchmark testing; Dynamic scheduling; Hardware; Parallel processing; Process control; Runtime; Silicon; Compilers; Custom Computing; Dark Silicon; High-level Synthesis; Instruction Level Parallelism;
Conference_Titel :
Parallel & Distributed Processing Symposium Workshops (IPDPSW), 2014 IEEE International
Conference_Location :
Phoenix, AZ
Print_ISBN :
978-1-4799-4117-9
DOI :
10.1109/IPDPSW.2014.18