DocumentCode
451260
Title
Merrimac: Supercomputing with Streams
Author
Dally, William J. ; Labonté, François ; Das, Abhishek ; Hanrahan, Patrick ; Ahn, Jung-Ho ; Gummaraju, Jayanth ; Erez, Mattan ; Jayasena, Nuwan ; Buck, Ian ; Knight, Timothy J. ; Kapasi, Ujval J.
fYear
2003
fDate
15-21 Nov. 2003
Firstpage
35
Lastpage
35
Abstract
Merrimac uses stream architecture and advanced interconnection networks to give an order of magnitude more performance per unit cost than cluster-based scientific computers built from the same technology. Organizing the computation into streams and exploiting the resulting locality using a register hierarchy enables a stream architecture to reduce the memory bandwidth required by representative applications by an order of magnitude or more. Hence a processing node with a fixed bandwidth (expensive) can support an order of magnitude more arithmetic units (inexpensive). This in turn allows a given level of performance to be achieved with fewer nodes (a 1-PFLOPS machine, for example, with just 8,192 nodes) resulting in greater reliability, and simpler system management. We sketch the design of Merrimac, a streaming scientific computer that can be scaled from a $20K 2 TFLOPS workstation to a $20M 2 PFLOPS supercomputer and present the results of some initial application experiments on this architecture.
Keywords
Application software; Arithmetic; Bandwidth; Computer architecture; Computer networks; Costs; Multiprocessor interconnection networks; Organizing; Registers; Workstations;
fLanguage
English
Publisher
ieee
Conference_Titel
Supercomputing, 2003 ACM/IEEE Conference
Print_ISBN
1-58113-695-1
Type
conf
DOI
10.1109/SC.2003.10043
Filename
1592938
Link To Document