Title : 
Merrimac: Supercomputing with Streams
         
        
            Author : 
Dally, William J. ; Labonté, François ; Das, Abhishek ; Hanrahan, Patrick ; Ahn, Jung-Ho ; Gummaraju, Jayanth ; Erez, Mattan ; Jayasena, Nuwan ; Buck, Ian ; Knight, Timothy J. ; Kapasi, Ujval J.
         
        
        
        
        
        
            Abstract : 
Merrimac uses stream architecture and advanced interconnection networks to give an order of magnitude more performance per unit cost than cluster-based scientific computers built from the same technology. Organizing the computation into streams and exploiting the resulting locality using a register hierarchy enables a stream architecture to reduce the memory bandwidth required by representative applications by an order of magnitude or more. Hence a processing node with a fixed bandwidth (expensive) can support an order of magnitude more arithmetic units (inexpensive). This in turn allows a given level of performance to be achieved with fewer nodes (a 1-PFLOPS machine, for example, with just 8,192 nodes) resulting in greater reliability, and simpler system management. We sketch the design of Merrimac, a streaming scientific computer that can be scaled from a $20K 2 TFLOPS workstation to a $20M 2 PFLOPS supercomputer and present the results of some initial application experiments on this architecture.
         
        
            Keywords : 
Application software; Arithmetic; Bandwidth; Computer architecture; Computer networks; Costs; Multiprocessor interconnection networks; Organizing; Registers; Workstations;
         
        
        
        
            Conference_Titel : 
Supercomputing, 2003 ACM/IEEE Conference
         
        
            Print_ISBN : 
1-58113-695-1
         
        
        
            DOI : 
10.1109/SC.2003.10043