Title : 
A. novel approach to reduce L2 miss latency in shared-memory multiprocessors
         
        
            Author : 
Acacio, M.E. ; Gonzalez, J. ; Garcia, J.M. ; Duato, J.
         
        
            Author_Institution : 
Dpto. Ing. y Tecnologia de Computadores, Murcia Univ., Spain
         
        
        
        
            Abstract : 
Recent technology improvements allow multiprocessor designers to put some key components inside the processor chip, such as the memory controller, the coherence hardware and the network interface/router. In this work we exploit such integration scale, presenting a novel node architecture aimed at reducing the long L2 miss latencies and the memory overhead of using directories that characterize cc-NUMA machines and limit their scalability. Our proposal replaces the traditional directory with a novel three-level directory architecture and adds a small shared data cache to each of the nodes of a multiprocessor system. Due to their small size, the first-level directory and the shared data cache are integrated into the processor chip in every node. A taxonomy of the L2 misses, according to the actions performed by the directory to satisfy them is also presented. Using execution-driven simulations, we show significant L2 miss latency reductions (more than 60% in some cases). These important improvements translate into reductions of more than 30% in the application execution time in some cases.
         
        
            Keywords : 
cache storage; parallel architectures; performance evaluation; shared memory systems; L2 miss latency reduction; cc-NUMA machines; coherence hardware; execution-driven simulations; memory controller; memory overhead; network interface; node architecture; scalability; shared data cache; shared-memory multiprocessors; three-level directory architecture; Computer interfaces; Computer networks; Delay; Hardware; Hip; Network interfaces; Process design; Protocols; Scalability; Taxonomy;
         
        
        
        
            Conference_Titel : 
Parallel and Distributed Processing Symposium., Proceedings International, IPDPS 2002, Abstracts and CD-ROM
         
        
            Conference_Location : 
Ft. Lauderdale, FL
         
        
            Print_ISBN : 
0-7695-1573-8
         
        
        
            DOI : 
10.1109/IPDPS.2002.1015554