• DocumentCode
    2499189
  • Title

    A floating-point solver for band structured linear equations

  • Author

    Lopes, Antonio Roldao ; Constantinides, George A. ; Kerrigan, Eric C.

  • Author_Institution
    Electr. & Electron. Eng. Dept., Imperial Coll. London, London
  • fYear
    2008
  • fDate
    8-10 Dec. 2008
  • Firstpage
    353
  • Lastpage
    356
  • Abstract
    Field programmable gate arrays (FPGAs) have gradually been increasing their capacities and started to incorporate optimized coarse-grained modules such as BlockRAMs, multipliers, and even processors. These developments have extended their field of applications and one field that has been gaining significant interest is the acceleration of floating-point scientific computing. In this field, a recurring subtask is the solution of systems of linear equations. One well studied method that has proven to be very efficient in software and robust at finding such solutions is the conjugate gradient (CG) algorithm. In this paper we present a hardware CG method which takes advantage of the banded structure present in many common problems. With the flexibility provided by FPGAs, this implementation employs wide-parallelization to convert the per iteration computation time for an order n matrix with band width w from Theta(nw) clock cycles for a software implementation to Theta(n) in hardware. It also explores deep-pipelining so that solutions to P problems are produced every Theta(n) cycles opposed to every Theta(Pnw) cycles in software. Results demonstrate that performances up to 32 GFLOPs are achievable on a Virtex5-330T FPGA and a software comparison reports significant speed-ups in relation to high-end CPUs.
  • Keywords
    conjugate gradient methods; field programmable gate arrays; floating point arithmetic; BlockRAM; Virtex5-330T FPGA; band structured linear equations; banded structure; conjugate gradient algorithm; field programmable gate arrays; floating-point scientific computing; floating-point solver; optimized coarse-grained modules; Acceleration; Application software; Character generation; Equations; Field programmable gate arrays; Hardware; Matrix converters; Robustness; Scientific computing; Software algorithms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    ICECE Technology, 2008. FPT 2008. International Conference on
  • Conference_Location
    Taipei
  • Print_ISBN
    978-1-4244-3783-2
  • Electronic_ISBN
    978-1-4244-2796-3
  • Type

    conf

  • DOI
    10.1109/FPT.2008.4762416
  • Filename
    4762416