• DocumentCode
    103777
  • Title

    High-Performance Architecture for the Conjugate Gradient Solver on FPGAs

  • Author

    Guiming Wu ; Xianghui Xie ; Yong Dou ; Miao Wang

  • Author_Institution
    Key Lab. of Math. Eng. & Adv. Comput., Wuxi, China
  • Volume
    60
  • Issue
    11
  • fYear
    2013
  • fDate
    Nov. 2013
  • Firstpage
    791
  • Lastpage
    795
  • Abstract
    The conjugate gradient (CG) solver is an important algorithm for solving the symmetric positive define systems. However, existing CG architectures on field-programmable gate arrays (FPGAs) either need aggressive zero padding or can only be applied for small matrices and particular matrix sparsity patterns. This brief proposes a high-performance architecture for the CG solver on FPGAs, which can handle sparse linear systems with arbitrary size and sparsity pattern. Furthermore, it does not need aggressive zero padding. Our CG architecture mainly consists of a high-throughput sparse matrix-vector multiplication design including a multi-output adder tree, a reduction circuit, and a sum sequencer. Our experimental results demonstrate that our CG architecture can achieve speedup of 4.62X-9.24X on a Virtex5-330 FPGA, relative to a software implementation.
  • Keywords
    adders; conjugate gradient methods; field programmable gate arrays; matrix multiplication; sparse matrices; trees (mathematics); CG architectures; CG solver; FPGA; aggressive zero padding; conjugate gradient solver; field-programmable gate arrays; high-performance architecture; high-throughput sparse matrix-vector multiplication design; matrix sparsity patterns; multioutput adder tree; reduction circuit; software implementation; sparse linear systems; sum sequencer; symmetric positive define systems; Adders; Clocks; Computer architecture; Field programmable gate arrays; Random access memory; Sparse matrices; Vectors; Conjugate gradient (CG) solver; field-programmable gate array (FPGA); sparse matrix-vector multiplication (SpMV);
  • fLanguage
    English
  • Journal_Title
    Circuits and Systems II: Express Briefs, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1549-7747
  • Type

    jour

  • DOI
    10.1109/TCSII.2013.2278111
  • Filename
    6587761