Title :
Non-Preconditioned Conjugate Gradient on Cell and FPGA Based Hybrid Supercomputer Nodes
Author :
DuBois, David ; DuBois, Andrew ; Boorman, Thomas ; Connor, Carolyn
Author_Institution :
Los Alamos Nat. Lab., Los Alamos, NM, USA
Abstract :
This work presents a detailed implementation of a double precision, non-preconditioned, conjugate gradient algorithm on a Roadrunner heterogeneous supercomputer node. These nodes utilize the Cell Broadband Engine Architecturetrade in conjunction with x86 Opterontrade processors from AMD. We implement a common conjugate gradient algorithm, on a variety of systems, to compare and contrast performance. Implementation results are presented for the Roadrunner hybrid supercomputer, SRC Computers, Inc. MAPStation SRC-6 FPGA enhanced hybrid supercomputer, and AMD Opteron only. In all hybrid implementations wall clock time is measured, including all transfer overhead and compute timings.
Keywords :
conjugate gradient methods; field programmable gate arrays; mainframes; FPGA based hybrid supercomputer nodes; Roadrunner heterogeneous supercomputer node; cell broadband engine architecture; conjugate gradient algorithm; Blades; Character generation; Computer architecture; Field programmable gate arrays; Iterative methods; Kernel; Linear systems; Sparse matrices; Supercomputers; Vectors;
Conference_Titel :
Field Programmable Custom Computing Machines, 2009. FCCM '09. 17th IEEE Symposium on
Conference_Location :
Napa, CA
Print_ISBN :
978-0-7695-3716-0
DOI :
10.1109/FCCM.2009.26