DocumentCode :
1121316
Title :
High-Performance Designs for Linear Algebra Operations on Reconfigurable Hardware
Author :
Zhuo, Ling ; Prasanna, Viktor K.
Author_Institution :
Ming Hsieh Dept. of Electr. Eng., Southern California Univ., Los Angeles, CA
Volume :
57
Issue :
8
fYear :
2008
Firstpage :
1057
Lastpage :
1071
Abstract :
Numerical linear algebra operations are key primitives in scientific computing. Performance optimizations of such operations have been extensively investigated. With the rapid advances in technology, hardware acceleration of linear algebra applications using FPGAs (field programmable gate arrays) has become feasible. In this paper, we propose FPGA-based designs for several basic linear algebra operations, including dot product, matrix-vector multiplication, matrix multiplication and matrix factorization. By identifying the parameters for each operation, we analyze the trade-offs and propose a high-performance design. In the implementations of the designs, the values of the parameters are determined according to the hardware constraints, such as the available chip area, the size of available memory, the memory bandwidth, and the number of I/O pins. The proposed designs are implemented on Xilinx Virtex-II Pro FPGAs. Experimental results show that our designs scale with the available hardware resources. Also, the performance of our designs compares favorably with that of general-purpose processor based designs. We also show that with faster floating-point units and larger devices, the performance of our designs increases accordingly.
Keywords :
field programmable gate arrays; floating point arithmetic; linear algebra; matrix decomposition; reconfigurable architectures; FPGA; Xilinx Virtex-II Pro; available chip area; dot product; field programmable gate arrays; floating-point units; hardware acceleration; hardware constraints; high-performance designs; matrix factorization; matrix-vector multiplication; memory bandwidth; numerical linear algebra operations; reconfigurable hardware; scientific computing; Acceleration; Application software; Bandwidth; Computer architecture; Delay; Field programmable gate arrays; Hardware; Linear algebra; Pins; Scientific computing; Computations on matrices; Parallel algorithms; Reconfigurable hardware;
fLanguage :
English
Journal_Title :
Computers, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9340
Type :
jour
DOI :
10.1109/TC.2008.55
Filename :
4483504
Link To Document :
بازگشت