DocumentCode
1772601
Title
Analyzing the energy-efficiency of dense linear algebra kernels by power-profiling a hybrid CPU/FPGA system
Author
Giefers, Heiner ; Polig, Raphael ; Hagleitner, Christoph
Author_Institution
IBM Res. - Zurich, Zurich, Switzerland
fYear
2014
fDate
18-20 June 2014
Firstpage
92
Lastpage
99
Abstract
It has been shown that FPGA accelerators can outperform pure CPU systems for highly parallel applications and they are considered as a power-efficient alternative to software programmable processors. However, when using FPGA accelerator cards in a server environment multiple sources of power consumption have to get taken into account in order to rate the systems energy-efficiency. In this paper we study the energy-efficiency of a hybrid CPU/FPGA system for a dense linear algebra kernel. We present an FPGA GEMM accelerator architecture that can be tailored to various data types. The performance and energy consumption is compared against tuned, multi-threaded GEMM functions running on the host CPU. We measure the power consumption with internal current/voltage sensors and break down the power draw to the systems components in order to classify the energy consumed by the processor cores, the memory, the I/O bus system and the FPGA card. Our experimental results show that the FPGA-accelerated DGEMM is less energy-efficient than a multi-threaded software implementation with respect to the full systems power consumption, but the most efficient choice when only the dynamic parts of the power are factored in.
Keywords
field programmable gate arrays; linear algebra; multi-threading; power aware computing; FPGA GEMM accelerator architecture; FPGA accelerator cards; FPGA card; I/O bus system; dense linear algebra kernels; energy efficiency analysis; hybrid CPU/FPGA system; internal current-voltage sensors; multithreaded software implementation; parallel applications; power consumption; power profiling; software programmable processors; Adders; Computer architecture; Field programmable gate arrays; Kernel; Power demand; Servers; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Application-specific Systems, Architectures and Processors (ASAP), 2014 IEEE 25th International Conference on
Conference_Location
Zurich
Type
conf
DOI
10.1109/ASAP.2014.6868642
Filename
6868642
Link To Document