Title :
Single-precision LU decomposition based on FPGA compared with CPU
Author :
Fang, Baoguang ; Chen, Shuqiang ; Wei, Xulong
Author_Institution :
Sch. of Phys. Electron., Univ. of Electron. Sci. & Technol. China, Chengdu, China
Abstract :
FPGA (field programmable gate array) has become an effective way to accelerate the numerical computing. In this paper, we proposed an architecture for small matrix single-precision LU decomposition based on FPGA and show the resource utilization of this design. We also present a novel structure for block multiply in block LU decomposition for large size matrix. The speed of this novel structure will be faster than implemented in general matrix multiplication in block LU decomposition. Our implementation on a Xilinx Virtex-6 XC6VLX240T can speed 2x compared with floating-point LU with MKL on Core-i3.
Keywords :
field programmable gate arrays; logic design; matrix multiplication; CPU; Core-i3; FPGA; MKL; Xilinx Virtex-6 XC6VLX240T; block LU decomposition; field programmable gate array; floating-point LU; matrix multiplication; single-precision LU decomposition; Algorithm design and analysis; Computer architecture; Field programmable gate arrays; Matrix decomposition; Random access memory; Vectors; FPGA; LU decompositon; accelerate; architecture; block LU;
Conference_Titel :
Computational Problem-Solving (ICCP), 2012 International Conference on
Conference_Location :
Leshan
Print_ISBN :
978-1-4673-1696-5
Electronic_ISBN :
978-1-4673-1695-8
DOI :
10.1109/ICCPS.2012.6384247