Title :
Throughput-oriented kernel porting onto FPGAs
Author :
Papakonstantinou, Alexandros ; Deming Chen ; Wen-mei Hwu ; Cong, J. ; Yun Liang
Author_Institution :
ECE Dept., Univ. of Illinois, Urbana, IL, USA
fDate :
May 29 2013-June 7 2013
Abstract :
Reconfigurable devices are often employed in heterogeneous systems due to their low power and parallel processing advantages. An important usability requirement is the support of a homogeneous programming interface. Nevertheless, homogeneous programming interfaces do not eliminate the need for code tweaking to enable efficient mapping of the computation across heterogeneous architectures. In this work we propose a code optimization framework which analyzes and restructures CUDA kernels that are optimized for GPU devices in order to facilitate synthesis of high-throughput custom accelerators on FPGAs. The proposed framework enables efficient performance porting without manual code tweaking or annotation by the user. A hierarchical region graph in tandem with code motions and graph coloring of array variables is employed to restructure the kernel for high throughput execution on FPGAs.
Keywords :
field programmable gate arrays; graph theory; optimising compilers; parallel architectures; CUDA kernels; FPGA; GPU devices; array variables; code motions; code optimization framework; graph coloring; heterogeneous systems; hierarchical region graph; high throughput execution; high-throughput custom accelerators; homogehomogeneous programming interface; parallel processing advantages; performance porting; reconfigurable devices; throughput-oriented kernel porting; usability requirement; Field programmable gate arrays; Graphics processing units; Kernel; Optimization; Resource management; Synchronization; Throughput;
Conference_Titel :
Design Automation Conference (DAC), 2013 50th ACM/EDAC/IEEE
Conference_Location :
Austin, TX