DocumentCode :
2571198
Title :
Single-precision LU decomposition based on FPGA compared with CPU
Author :
Fang, Baoguang ; Chen, Shuqiang ; Wei, Xulong
Author_Institution :
Sch. of Phys. Electron., Univ. of Electron. Sci. & Technol. China, Chengdu, China
fYear :
2012
fDate :
19-21 Oct. 2012
Firstpage :
302
Lastpage :
305
Abstract :
FPGA (field programmable gate array) has become an effective way to accelerate the numerical computing. In this paper, we proposed an architecture for small matrix single-precision LU decomposition based on FPGA and show the resource utilization of this design. We also present a novel structure for block multiply in block LU decomposition for large size matrix. The speed of this novel structure will be faster than implemented in general matrix multiplication in block LU decomposition. Our implementation on a Xilinx Virtex-6 XC6VLX240T can speed 2x compared with floating-point LU with MKL on Core-i3.
Keywords :
field programmable gate arrays; logic design; matrix multiplication; CPU; Core-i3; FPGA; MKL; Xilinx Virtex-6 XC6VLX240T; block LU decomposition; field programmable gate array; floating-point LU; matrix multiplication; single-precision LU decomposition; Algorithm design and analysis; Computer architecture; Field programmable gate arrays; Matrix decomposition; Random access memory; Vectors; FPGA; LU decompositon; accelerate; architecture; block LU;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Problem-Solving (ICCP), 2012 International Conference on
Conference_Location :
Leshan
Print_ISBN :
978-1-4673-1696-5
Electronic_ISBN :
978-1-4673-1695-8
Type :
conf
DOI :
10.1109/ICCPS.2012.6384247
Filename :
6384247
Link To Document :
بازگشت