DocumentCode :
3139247
Title :
An FPGA-Based Singular Value Decomposition Processor
Author :
Ma, Weiwei ; Kaye, M.E. ; Luke, D.M. ; Doraiswami, R.
Author_Institution :
Dept. of Electr. & Comput. Eng., New Brunswick Univ., Fredericton, NB
fYear :
2006
fDate :
38838
Firstpage :
1047
Lastpage :
1050
Abstract :
A two-sided rotation Jacobi SVD algorithm is used to compute the SVD and is implemented on a two million gate FPGA. A mesh-connected array structure is proposed based on Brent, Luk, and Van Loan´s idea of an expandable square systolic array of simple 2x2 processors to compute the SVD of a large matrix, so as to shorten the iteration time and thus increase the implementation speed. The array consists of an n/2xn/2 array of 2x2 processor elements to compute the SVD of an nxn matrix. The trigonometric functions and the vector multiplication in the algorithm are tailored to the use of CORDIC (coordinate rotation digital computer) algorithms for hardware-efficient solutions. Two SVD processors, the basic SVD processor and the extended SVD processor, were developed. The algorithms to decompose the matrix were first evaluated in Matlab and then the processors were implemented using the Virtex-II FPGA from Xilinx as the target device. The basic SVD processor utilizes the proposed mesh-connected array structure and CORDIC algorithm. The implementation concentrates on utilizing the features of the FPGA to speed up operations and reduce the area required. In order to compute a large SVD without increasing the size of the FPGA, the extended SVD processor was developed to reuse the SVD array of the basic SVD processor. These two processors were successfully implemented on the FPGA device. Speed data and comparisons are presented
Keywords :
Jacobian matrices; field programmable gate arrays; finite state machines; mathematics computing; mesh generation; singular value decomposition; FPGA; Jacobi SVD algorithm; Matlab; Virtex-II; Xilinx; coordinate rotation digital computer algorithm; field programmable gate array; hardware-efficient solution; mesh-connected array structure; singular value decomposition processor; trigonometric function; vector multiplication; Algebra; Computer architecture; Concurrent computing; Field programmable gate arrays; Jacobian matrices; Matrix decomposition; Parallel processing; Robustness; Singular value decomposition; Systolic arrays; fpga; jacobi; singular value decomposition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electrical and Computer Engineering, 2006. CCECE '06. Canadian Conference on
Conference_Location :
Ottawa, Ont.
Print_ISBN :
1-4244-0038-4
Electronic_ISBN :
1-4244-0038-4
Type :
conf
DOI :
10.1109/CCECE.2006.277355
Filename :
4054809
Link To Document :
بازگشت