• DocumentCode
    299697
  • Title

    Dynamic self-invalidation: reducing coherence overhead in shared-memory multiprocessors

  • Author

    Lebeck, Alvin R. ; Wood, David A.

  • Author_Institution
    Dept. of Comput. Sci., Wisconsin Univ., Madison, WI, USA
  • fYear
    1995
  • fDate
    22-24 June 1995
  • Firstpage
    48
  • Lastpage
    59
  • Abstract
    The paper introduces dynamic self-invalidation (DSI), a new technique for reducing cache coherence overhead in shared-memory multiprocessors. DSI eliminates invalidation messages by having a processor automatically invalidate its local copy of a cache block before a conflicting access by another processor. Eliminating invalidation overhead is particularly important under sequential consistency: where the latency of invalidating outstanding copies can increase a program´s critical path. DSI is applicable to software, hardware, and hybrid coherence schemes. We evaluate DSI in the context of hardware directory-based write-invalidate coherence protocols. Our results show that DSI reduces execution time of a sequentially consistent full-map coherence protocol by as much as 41%. This is comparable to an implementation of weak consistency that uses a coalescing write-buffer to allow up to 16 outstanding requests for exclusive blocks. When used in conjunction with weak consistency DSI can exploit tear-off blocks-which eliminate both invalidation and acknowledgment messages-for a total reduction in messages of up to 26%.
  • Keywords
    cache storage; protocols; shared memory systems; software performance evaluation; acknowledgment message elimination; cache coherence overhead reduction; coalescing write-buffer; conflicting access; dynamic self-invalidation; exclusive blocks; execution time; hardware; hardware directory-based write-invalidate coherence protocols; hybrid coherence schemes; invalidation message elimination; local cache block copy; outstanding copies; outstanding requests; program critical path; sequential consistency; sequentially consistent full-map coherence protocol; shared-memory multiprocessors; software; tear-off blocks; Access protocols; Aerospace electronics; Bandwidth; Cache memory; Delay; Distributed computing; Government; Hardware; Permission;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Architecture, 1995. Proceedings., 22nd Annual International Symposium on
  • Conference_Location
    Santa Margherita Ligure, Italy
  • ISSN
    1063-6897
  • Print_ISBN
    0-89791-698-0
  • Type

    conf

  • Filename
    524548