• DocumentCode
    3139247
  • Title

    An FPGA-Based Singular Value Decomposition Processor

  • Author

    Ma, Weiwei ; Kaye, M.E. ; Luke, D.M. ; Doraiswami, R.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., New Brunswick Univ., Fredericton, NB
  • fYear
    2006
  • fDate
    38838
  • Firstpage
    1047
  • Lastpage
    1050
  • Abstract
    A two-sided rotation Jacobi SVD algorithm is used to compute the SVD and is implemented on a two million gate FPGA. A mesh-connected array structure is proposed based on Brent, Luk, and Van Loan´s idea of an expandable square systolic array of simple 2x2 processors to compute the SVD of a large matrix, so as to shorten the iteration time and thus increase the implementation speed. The array consists of an n/2xn/2 array of 2x2 processor elements to compute the SVD of an nxn matrix. The trigonometric functions and the vector multiplication in the algorithm are tailored to the use of CORDIC (coordinate rotation digital computer) algorithms for hardware-efficient solutions. Two SVD processors, the basic SVD processor and the extended SVD processor, were developed. The algorithms to decompose the matrix were first evaluated in Matlab and then the processors were implemented using the Virtex-II FPGA from Xilinx as the target device. The basic SVD processor utilizes the proposed mesh-connected array structure and CORDIC algorithm. The implementation concentrates on utilizing the features of the FPGA to speed up operations and reduce the area required. In order to compute a large SVD without increasing the size of the FPGA, the extended SVD processor was developed to reuse the SVD array of the basic SVD processor. These two processors were successfully implemented on the FPGA device. Speed data and comparisons are presented
  • Keywords
    Jacobian matrices; field programmable gate arrays; finite state machines; mathematics computing; mesh generation; singular value decomposition; FPGA; Jacobi SVD algorithm; Matlab; Virtex-II; Xilinx; coordinate rotation digital computer algorithm; field programmable gate array; hardware-efficient solution; mesh-connected array structure; singular value decomposition processor; trigonometric function; vector multiplication; Algebra; Computer architecture; Concurrent computing; Field programmable gate arrays; Jacobian matrices; Matrix decomposition; Parallel processing; Robustness; Singular value decomposition; Systolic arrays; fpga; jacobi; singular value decomposition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electrical and Computer Engineering, 2006. CCECE '06. Canadian Conference on
  • Conference_Location
    Ottawa, Ont.
  • Print_ISBN
    1-4244-0038-4
  • Electronic_ISBN
    1-4244-0038-4
  • Type

    conf

  • DOI
    10.1109/CCECE.2006.277355
  • Filename
    4054809