DocumentCode :
258500
Title :
Characterization of OpenCL on a scalable FPGA architecture
Author :
Shanyuan Gao ; Chritz, Jeremy
fYear :
2014
fDate :
8-10 Dec. 2014
Firstpage :
1
Lastpage :
6
Abstract :
The recent release of Altera´s SDK for OpenCL has greatly eased the development of FPGA-based systems. Research have shown performance improvements brought by OpenCL using a single FPGA device. However, to meet the objectives of high performance computing, OpenCL needs to be evaluated using multiple FPGAs. This work has proposed a scalable FPGA architecture for high performance computing. The design includes multiple FPGA modules and a high performance backplane. The modular nature of this architecture supports the combination of different FPGAs, as well as provides for easy hardware updates. FPGA modules based on Stratix V are compatible with Altera´s OpenCL tool flow. The evaluation has tested the native IO performance of the architecture and the results have demonstrated scalability using six FPGAs. The host-to-device peak bandwidth is measured as 13.1 GB/s for read operation and 12.1 GB/s for write operation. The FPGA-to-memory bandwidth is measured as 64.5 GB/s in total. An OpenCL AES kernel is selected to test the scalable multi-FPGA architecture. The test results have shown peak throughput is achiveded when six FPGAs are used. The throughput per watt shows 5× improvement using four FPGAs, over a general-purpose processor.
Keywords :
field programmable gate arrays; memory architecture; parallel processing; Altera SDK; FPGA module; FPGA-based system; FPGA-to-memory bandwidth; OpenCL AES kernel; OpenCL tool flow; Stratix V; general-purpose processor; hardware update; high performance backplane; high performance computing; host-to-device peak bandwidth; multiFPGA architecture; multiple FPGA; native IO performance; peak throughput; read operation; scalable FPGA architecture; single FPGA device; write operation; Backplanes; Bandwidth; Computer architecture; Field programmable gate arrays; Hardware design languages; Kernel; Throughput;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
ReConFigurable Computing and FPGAs (ReConFig), 2014 International Conference on
Conference_Location :
Cancun
Print_ISBN :
978-1-4799-5943-3
Type :
conf
DOI :
10.1109/ReConFig.2014.7032505
Filename :
7032505
Link To Document :
بازگشت