Title :
Mapping a Jacobi Iterative Solver onto a High-Performance Heterogeneous Computer
Author :
Morris, G.R. ; Abed, K.H.
Author_Institution :
Eng. R&D Center, US Army, Vicksburg, MS, USA
Abstract :
High-performance heterogeneous computers that employ field programmable gate arrays (FPGAs) as computational elements are known as high-performance reconfigurable computers (HPRCs). For floating-point applications, these FPGA-based processors must satisfy a variety of heuristics and rules of thumb to achieve a speedup compared with their software counterparts. By way of a simple sparse matrix Jacobi iterative solver, this paper illustrates some of the issues associated with mapping floating-point kernels onto HPRCs. The Jacobi method was chosen based on heuristics developed from earlier research. Furthermore, Jacobi is relatively easy to understand, yet is complex enough to illustrate the mapping issues. This paper is not trying to demonstrate the speedup of a particular application nor is it suggesting that Jacobi is the best way to solve equations. The results demonstrate a nearly threefold wall clock runtime speedup when compared with a software implementation. A formal analysis shows that these results are reasonable. The purpose of this paper is to illuminate the challenging floating-point mapping process while simultaneously showing that such mappings can result in significant speedups. The ideas revealed by research such as this have already been and should continue to be used to facilitate a more automated mapping process.
Keywords :
Jacobian matrices; field programmable gate arrays; floating point arithmetic; iterative methods; mathematics computing; reconfigurable architectures; sparse matrices; FPGA-based processor; HPRC; Jacobi iterative solver; automated mapping process; computational elements; field programmable gate arrays; floating point applications; floating point kernels; floating point mapping process; formal analysis; high-performance heterogeneous computer; high-performance reconfigurable computers; sparse matrix; wall clock runtime speedup; Field programmable gate arrays; Iterative methods; Jacobian matrices; Reconfigurable architectures; Field programmable gate array (FPGA); Jacobi iterative method; high-performance heterogeneous computer (HPHC); high-performance reconfigurable computer (HPRC); reconfigurable computer (RC);
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on
DOI :
10.1109/TPDS.2012.121