DocumentCode
703996
Title
Accelerating arithmetic kernels with coherent attached FPGA coprocessors
Author
Giefers, Heiner ; Polig, Raphael ; Hagleitner, Christoph
Author_Institution
IBM Res. - Zurich, Zurich, Switzerland
fYear
2015
fDate
9-13 March 2015
Firstpage
1072
Lastpage
1077
Abstract
The energy efficiency of computer systems can be increased by migrating computational kernels that are known to under-utilize the CPU to an FPGA based coprocessor. In contrast to traditional I/O-based coprocessors that require explicit data movement, coherently attached accelerators can operate on the same virtual address space than the host CPU. A shared memory organization enables widely accepted programming models and helps to deploy energy efficient accelerators in general purpose computing systems. In this paper we study an FFT accelerator on FPGA attached via the Coherent Accelerator Processor Interface (CAPI) to a POWER8 processor. Our results show that the coherent attached accelerator outperforms device driver based approaches in terms of latency. Hardware acceleration delivers a 5× gain in energy efficiency compared to an optimized parallel software FFT running on a 12-core CPU and improves single thread performance by more than 2×. We conclude that the integration of CAPI into heterogeneous programming frameworks such as OpenCL will facilitate latency critical operations and will further enhance programmability of hybrid systems.
Keywords
coprocessors; fast Fourier transforms; field programmable gate arrays; parallel processing; power aware computing; shared memory systems; 12-core CPU; CAPI; CPU; FFT accelerator; FPGA based coprocessor; I/O-based coprocessors; OpenCL; POWER8 processor; arithmetic kernel acceleration; coherent accelerator processor interface; coherent attached FPGA coprocessors; computational kernels; computer system energy efficiency; device driver based approach; heterogeneous programming frameworks; hybrid system programmability; optimized parallel software FFT; programming models; shared memory organization; Computer architecture; Coprocessors; Field programmable gate arrays; Hardware; Kernel; Programming;
fLanguage
English
Publisher
ieee
Conference_Titel
Design, Automation & Test in Europe Conference & Exhibition (DATE), 2015
Conference_Location
Grenoble
Print_ISBN
978-3-9815-3704-8
Type
conf
Filename
7092548
Link To Document