DocumentCode
2499189
Title
A floating-point solver for band structured linear equations
Author
Lopes, Antonio Roldao ; Constantinides, George A. ; Kerrigan, Eric C.
Author_Institution
Electr. & Electron. Eng. Dept., Imperial Coll. London, London
fYear
2008
fDate
8-10 Dec. 2008
Firstpage
353
Lastpage
356
Abstract
Field programmable gate arrays (FPGAs) have gradually been increasing their capacities and started to incorporate optimized coarse-grained modules such as BlockRAMs, multipliers, and even processors. These developments have extended their field of applications and one field that has been gaining significant interest is the acceleration of floating-point scientific computing. In this field, a recurring subtask is the solution of systems of linear equations. One well studied method that has proven to be very efficient in software and robust at finding such solutions is the conjugate gradient (CG) algorithm. In this paper we present a hardware CG method which takes advantage of the banded structure present in many common problems. With the flexibility provided by FPGAs, this implementation employs wide-parallelization to convert the per iteration computation time for an order n matrix with band width w from Theta(nw) clock cycles for a software implementation to Theta(n) in hardware. It also explores deep-pipelining so that solutions to P problems are produced every Theta(n) cycles opposed to every Theta(Pnw) cycles in software. Results demonstrate that performances up to 32 GFLOPs are achievable on a Virtex5-330T FPGA and a software comparison reports significant speed-ups in relation to high-end CPUs.
Keywords
conjugate gradient methods; field programmable gate arrays; floating point arithmetic; BlockRAM; Virtex5-330T FPGA; band structured linear equations; banded structure; conjugate gradient algorithm; field programmable gate arrays; floating-point scientific computing; floating-point solver; optimized coarse-grained modules; Acceleration; Application software; Character generation; Equations; Field programmable gate arrays; Hardware; Matrix converters; Robustness; Scientific computing; Software algorithms;
fLanguage
English
Publisher
ieee
Conference_Titel
ICECE Technology, 2008. FPT 2008. International Conference on
Conference_Location
Taipei
Print_ISBN
978-1-4244-3783-2
Electronic_ISBN
978-1-4244-2796-3
Type
conf
DOI
10.1109/FPT.2008.4762416
Filename
4762416
Link To Document