• DocumentCode
    3548196
  • Title

    Hybrid limited-pointer linked-list cache directory and cache coherence protocol

  • Author

    Mahmoud, Mohamed ; Wassal, A.

  • Author_Institution
    Comput. Eng. Dept., Cairo Univ., Cairo, Egypt
  • fYear
    2013
  • fDate
    17-19 Dec. 2013
  • Firstpage
    77
  • Lastpage
    82
  • Abstract
    The rise of Chip-Multiprocessors (CMPs) as a promising trend for the state of the art high-performance processors design raised the need for a scalable cache directory organization along with a simple cache coherence protocol as a hot research area. While thousands of cores are expected to fit on a single chip soon, the previously proposed cache directory schemes still lacks the scalability to accommodate more than tens of cores. The inefficiencies of these directory schemes come in the form of unaffordable memory overhead, excessive coherence traffic leading to performance degradation due to inexact representation of sharers and very complex coherence protocols. In this paper we introduce a new cache directory scheme for many core CMPs. The proposed scheme acquires, and actually improves, the scalability and low coherence traffic of cache-based linked list directory schemes while avoiding its completely sequential operation by exploiting the parallel operation of limited pointer directory schemes. We compare the proposed organization with these two previously proposed ones on different CMP configurations starting with a 4-core CMP and ending with a 32-core CMP. We show that the proposed scheme can avoid one third of the excessive broadcasted invalidation messages and two thirds of the extraneous acks in case of directory pointer overflows in limited pointer schemes. On the other hand, the proposed scheme achieves around 10% better performance than that of the completely sequential cache-based linked list directory while reducing the number of invalidation messages per invalidation event by 24%.
  • Keywords
    cache storage; multiprocessing systems; parallel processing; CMP configurations; cache coherence protocol; chip-multiprocessors; completely sequential cache-based linked list directory; directory pointer overflows; hybrid limited-pointer linked-list cache directory; invalidation event; invalidation messages; low coherence traffic; many core CMP; parallel operation; scalability; Benchmark testing; Coherence; Computers; Program processors; Protocols; Sea measurements; Vectors; Cache directory; cache coherence protocol; chip multiprocessor; coherence traffic; computer architecture; high performance processors; many-core CMP; on-chip network traffic;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electronics, Communications and Computers (JEC-ECC), 2013 Japan-Egypt International Conference on
  • Conference_Location
    6th of October City
  • Type

    conf

  • DOI
    10.1109/JEC-ECC.2013.6766389
  • Filename
    6766389