Performance enhancement in shared-memory multiprocessors using dynamically classified sharing information

Author

Ferdous, Nilufar ; Byeong Kil Lee ; John, Eugene

Author_Institution

Dept. Electr. & Comput. Eng., Univ. of Texas San Antonio, San Antonio, TX, USA

fYear

2014

fDate

5-7 Dec. 2014

Firstpage

Lastpage

Abstract

Advances in process technology has enabled the integration of many cores on a single die. The advent of many core systems has led to a commensurate increase in cache coherence complexity. As a solution to this problem, researches have proposed directory based protocols, which are scalable alternatives to snoop-based protocols. Although write-invalidation based directory protocols enhance the performance of large-scale multiprocessors, coherence misses are intrinsic impediments in such systems. Write-update protocols were proposed as a means to reduce these coherence misses. However, previous researches have shown that pure write-update protocol is highly undesirable because of the heavy traffic caused by the aggressive updates. In order to remedy these limitations, we propose a performance-aware mechanism which dynamically classifies the sharers of each cache block, either as a weak-sharing-group or an efficient-sharing-group and exploit this dynamic classification as a metric for seamless dynamic adaptation between write-invalidate and write-update strategy on a per block basis. Exploitation of the dynamic adaptation of the protocol, based on the sharing-group speculation, reduces unnecessary accesses to the shared last level cache and hence reduces the traffic caused by coherence misses and directory accesses. Simulation results on a 64-core CMP show that our proposed method can achieve 15 % (average) speedup over the baseline directory-based MOESI cache coherence protocol with PARSEC workloads. Our proposed work also reduces the L1 cache miss rate by 17 %(average). The network traffic caused by directory accesses and L1 read misses are also reduced by 16% and 17% respectively.

Keywords

cache storage; protocols; shared memory systems; L1 cache miss rate; MOESI cache coherence protocol; PARSEC workload; cache block; cache coherence complexity; coherence miss; directory access; directory based protocols; information sharing; large-scale multiprocessors; last level cache; many core systems; performance enhancement; performance-aware mechanism; shared-memory multiprocessors; sharing-group speculation; snoop-based protocols; write-invalidate strategy; write-invalidation based directory protocols; write-update protocol; write-update strategy; Bandwidth; Coherence; Equations; Multicore processing; Protocols; Telecommunication traffic; Time-frequency analysis; MOESI; Multicore systems; cache coherence; competent-sharing based speculated update policy; core locality; efficient-sharing-group; weak-sharing-group;

fLanguage

English

Publisher

ieee

Conference_Titel

Performance Computing and Communications Conference (IPCCC), 2014 IEEE International

Conference_Location

Austin, TX

Type

conf

DOI

10.1109/PCCC.2014.7017063

Filename

7017063

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=235161