Title :
Performance Comparison of Graphics Processors to Reconfigurable Logic: A Case Study
Author :
Cope, B. ; Cheung, Peter Y K ; Luk, Wayne ; Howes, Lee
Author_Institution :
Dept. of Electr. & Electron. Eng., Imperial Coll. London, London, UK
fDate :
4/1/2010 12:00:00 AM
Abstract :
A systematic approach to the comparison of the graphics processor (GPU) and reconfigurable logic is defined in terms of three throughput drivers. The approach is applied to five case study algorithms, characterized by their arithmetic complexity, memory access requirements, and data dependence, and two target devices: the nVidia GeForce 7900 GTX GPU and a Xilinx Virtex-4 field programmable gate array (FPGA). Two orders of magnitude speedup, over a general-purpose processor, is observed for each device for arithmetic intensive algorithms. An FPGA is superior, over a GPU, for algorithms requiring large numbers of regular memory accesses, while the GPU is superior for algorithms with variable data reuse. In the presence of data dependence, the implementation of a customized data path in an FPGA exceeds GPU performance by up to eight times. The trends of the analysis to newer and future technologies are analyzed.
Keywords :
computational complexity; computer graphics; coprocessors; field programmable gate arrays; FPGA; Xilinx Virtex-4 field programmable gate array; arithmetic complexity; arithmetic intensive algorithms; data dependence; general-purpose processor; graphics processor; graphics processors; memory access requirements; nVidia GeForce 7900 GTX GPU; reconfigurable logic; target devices:; Algorithm design and analysis; Arithmetic; Field programmable gate arrays; Graphics; Kernel; Motion estimation; Programmable logic arrays; Reconfigurable logic; Signal processing algorithms; Throughput; Graphics processors; performance measures; real-time and embedded systems; reconfigurable hardware; signal processing systems; video.;
Journal_Title :
Computers, IEEE Transactions on
DOI :
10.1109/TC.2009.179