• DocumentCode
    235161
  • Title

    Performance enhancement in shared-memory multiprocessors using dynamically classified sharing information

  • Author

    Ferdous, Nilufar ; Byeong Kil Lee ; John, Eugene

  • Author_Institution
    Dept. Electr. & Comput. Eng., Univ. of Texas San Antonio, San Antonio, TX, USA
  • fYear
    2014
  • fDate
    5-7 Dec. 2014
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    Advances in process technology has enabled the integration of many cores on a single die. The advent of many core systems has led to a commensurate increase in cache coherence complexity. As a solution to this problem, researches have proposed directory based protocols, which are scalable alternatives to snoop-based protocols. Although write-invalidation based directory protocols enhance the performance of large-scale multiprocessors, coherence misses are intrinsic impediments in such systems. Write-update protocols were proposed as a means to reduce these coherence misses. However, previous researches have shown that pure write-update protocol is highly undesirable because of the heavy traffic caused by the aggressive updates. In order to remedy these limitations, we propose a performance-aware mechanism which dynamically classifies the sharers of each cache block, either as a weak-sharing-group or an efficient-sharing-group and exploit this dynamic classification as a metric for seamless dynamic adaptation between write-invalidate and write-update strategy on a per block basis. Exploitation of the dynamic adaptation of the protocol, based on the sharing-group speculation, reduces unnecessary accesses to the shared last level cache and hence reduces the traffic caused by coherence misses and directory accesses. Simulation results on a 64-core CMP show that our proposed method can achieve 15 % (average) speedup over the baseline directory-based MOESI cache coherence protocol with PARSEC workloads. Our proposed work also reduces the L1 cache miss rate by 17 %(average). The network traffic caused by directory accesses and L1 read misses are also reduced by 16% and 17% respectively.
  • Keywords
    cache storage; protocols; shared memory systems; L1 cache miss rate; MOESI cache coherence protocol; PARSEC workload; cache block; cache coherence complexity; coherence miss; directory access; directory based protocols; information sharing; large-scale multiprocessors; last level cache; many core systems; performance enhancement; performance-aware mechanism; shared-memory multiprocessors; sharing-group speculation; snoop-based protocols; write-invalidate strategy; write-invalidation based directory protocols; write-update protocol; write-update strategy; Bandwidth; Coherence; Equations; Multicore processing; Protocols; Telecommunication traffic; Time-frequency analysis; MOESI; Multicore systems; cache coherence; competent-sharing based speculated update policy; core locality; efficient-sharing-group; weak-sharing-group;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Performance Computing and Communications Conference (IPCCC), 2014 IEEE International
  • Conference_Location
    Austin, TX
  • Type

    conf

  • DOI
    10.1109/PCCC.2014.7017063
  • Filename
    7017063