DocumentCode :
235161
Title :
Performance enhancement in shared-memory multiprocessors using dynamically classified sharing information
Author :
Ferdous, Nilufar ; Byeong Kil Lee ; John, Eugene
Author_Institution :
Dept. Electr. & Comput. Eng., Univ. of Texas San Antonio, San Antonio, TX, USA
fYear :
2014
fDate :
5-7 Dec. 2014
Firstpage :
1
Lastpage :
8
Abstract :
Advances in process technology has enabled the integration of many cores on a single die. The advent of many core systems has led to a commensurate increase in cache coherence complexity. As a solution to this problem, researches have proposed directory based protocols, which are scalable alternatives to snoop-based protocols. Although write-invalidation based directory protocols enhance the performance of large-scale multiprocessors, coherence misses are intrinsic impediments in such systems. Write-update protocols were proposed as a means to reduce these coherence misses. However, previous researches have shown that pure write-update protocol is highly undesirable because of the heavy traffic caused by the aggressive updates. In order to remedy these limitations, we propose a performance-aware mechanism which dynamically classifies the sharers of each cache block, either as a weak-sharing-group or an efficient-sharing-group and exploit this dynamic classification as a metric for seamless dynamic adaptation between write-invalidate and write-update strategy on a per block basis. Exploitation of the dynamic adaptation of the protocol, based on the sharing-group speculation, reduces unnecessary accesses to the shared last level cache and hence reduces the traffic caused by coherence misses and directory accesses. Simulation results on a 64-core CMP show that our proposed method can achieve 15 % (average) speedup over the baseline directory-based MOESI cache coherence protocol with PARSEC workloads. Our proposed work also reduces the L1 cache miss rate by 17 %(average). The network traffic caused by directory accesses and L1 read misses are also reduced by 16% and 17% respectively.
Keywords :
cache storage; protocols; shared memory systems; L1 cache miss rate; MOESI cache coherence protocol; PARSEC workload; cache block; cache coherence complexity; coherence miss; directory access; directory based protocols; information sharing; large-scale multiprocessors; last level cache; many core systems; performance enhancement; performance-aware mechanism; shared-memory multiprocessors; sharing-group speculation; snoop-based protocols; write-invalidate strategy; write-invalidation based directory protocols; write-update protocol; write-update strategy; Bandwidth; Coherence; Equations; Multicore processing; Protocols; Telecommunication traffic; Time-frequency analysis; MOESI; Multicore systems; cache coherence; competent-sharing based speculated update policy; core locality; efficient-sharing-group; weak-sharing-group;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Performance Computing and Communications Conference (IPCCC), 2014 IEEE International
Conference_Location :
Austin, TX
Type :
conf
DOI :
10.1109/PCCC.2014.7017063
Filename :
7017063
Link To Document :
بازگشت