Title :
Hybrid limited-pointer linked-list cache directory and cache coherence protocol
Author :
Mahmoud, Mohamed ; Wassal, A.
Author_Institution :
Comput. Eng. Dept., Cairo Univ., Cairo, Egypt
Abstract :
The rise of Chip-Multiprocessors (CMPs) as a promising trend for the state of the art high-performance processors design raised the need for a scalable cache directory organization along with a simple cache coherence protocol as a hot research area. While thousands of cores are expected to fit on a single chip soon, the previously proposed cache directory schemes still lacks the scalability to accommodate more than tens of cores. The inefficiencies of these directory schemes come in the form of unaffordable memory overhead, excessive coherence traffic leading to performance degradation due to inexact representation of sharers and very complex coherence protocols. In this paper we introduce a new cache directory scheme for many core CMPs. The proposed scheme acquires, and actually improves, the scalability and low coherence traffic of cache-based linked list directory schemes while avoiding its completely sequential operation by exploiting the parallel operation of limited pointer directory schemes. We compare the proposed organization with these two previously proposed ones on different CMP configurations starting with a 4-core CMP and ending with a 32-core CMP. We show that the proposed scheme can avoid one third of the excessive broadcasted invalidation messages and two thirds of the extraneous acks in case of directory pointer overflows in limited pointer schemes. On the other hand, the proposed scheme achieves around 10% better performance than that of the completely sequential cache-based linked list directory while reducing the number of invalidation messages per invalidation event by 24%.
Keywords :
cache storage; multiprocessing systems; parallel processing; CMP configurations; cache coherence protocol; chip-multiprocessors; completely sequential cache-based linked list directory; directory pointer overflows; hybrid limited-pointer linked-list cache directory; invalidation event; invalidation messages; low coherence traffic; many core CMP; parallel operation; scalability; Benchmark testing; Coherence; Computers; Program processors; Protocols; Sea measurements; Vectors; Cache directory; cache coherence protocol; chip multiprocessor; coherence traffic; computer architecture; high performance processors; many-core CMP; on-chip network traffic;
Conference_Titel :
Electronics, Communications and Computers (JEC-ECC), 2013 Japan-Egypt International Conference on
Conference_Location :
6th of October City
DOI :
10.1109/JEC-ECC.2013.6766389