An FPGA-Based Singular Value Decomposition Processor

Author

Ma, Weiwei ; Kaye, M.E. ; Luke, D.M. ; Doraiswami, R.

Author_Institution

Dept. of Electr. & Comput. Eng., New Brunswick Univ., Fredericton, NB

fYear

2006

fDate

38838

Firstpage

1047

Lastpage

1050

Abstract

A two-sided rotation Jacobi SVD algorithm is used to compute the SVD and is implemented on a two million gate FPGA. A mesh-connected array structure is proposed based on Brent, Luk, and Van Loan´s idea of an expandable square systolic array of simple 2x2 processors to compute the SVD of a large matrix, so as to shorten the iteration time and thus increase the implementation speed. The array consists of an n/2xn/2 array of 2x2 processor elements to compute the SVD of an nxn matrix. The trigonometric functions and the vector multiplication in the algorithm are tailored to the use of CORDIC (coordinate rotation digital computer) algorithms for hardware-efficient solutions. Two SVD processors, the basic SVD processor and the extended SVD processor, were developed. The algorithms to decompose the matrix were first evaluated in Matlab and then the processors were implemented using the Virtex-II FPGA from Xilinx as the target device. The basic SVD processor utilizes the proposed mesh-connected array structure and CORDIC algorithm. The implementation concentrates on utilizing the features of the FPGA to speed up operations and reduce the area required. In order to compute a large SVD without increasing the size of the FPGA, the extended SVD processor was developed to reuse the SVD array of the basic SVD processor. These two processors were successfully implemented on the FPGA device. Speed data and comparisons are presented

Keywords

Jacobian matrices; field programmable gate arrays; finite state machines; mathematics computing; mesh generation; singular value decomposition; FPGA; Jacobi SVD algorithm; Matlab; Virtex-II; Xilinx; coordinate rotation digital computer algorithm; field programmable gate array; hardware-efficient solution; mesh-connected array structure; singular value decomposition processor; trigonometric function; vector multiplication; Algebra; Computer architecture; Concurrent computing; Field programmable gate arrays; Jacobian matrices; Matrix decomposition; Parallel processing; Robustness; Singular value decomposition; Systolic arrays; fpga; jacobi; singular value decomposition;

fLanguage

English

Publisher

ieee

Conference_Titel

Electrical and Computer Engineering, 2006. CCECE '06. Canadian Conference on

Conference_Location

Ottawa, Ont.

Print_ISBN

1-4244-0038-4

Electronic_ISBN

1-4244-0038-4

Type

conf

DOI

10.1109/CCECE.2006.277355

Filename

4054809