• DocumentCode
    75502
  • Title

    CLU: Co-Optimizing Locality and Utility in Thread-Aware Capacity Management for Shared Last Level Caches

  • Author

    Dongyuan Zhan ; Hong Jiang ; Seth, Sharad C.

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Univ. of Nebraska-Lincoln, Lincoln, NE, USA
  • Volume
    63
  • Issue
    7
  • fYear
    2014
  • fDate
    Jul-14
  • Firstpage
    1656
  • Lastpage
    1667
  • Abstract
    Most chip-multiprocessors nowadays adopt a large shared last-level cache (SLLC). This paper is motivated by our analysis and evaluation of state-of-the-art cache management proposals which reveal a common weakness. That is, the existing alternative replacement policies and cache partitioning schemes, targeted at optimizing either locality or utility of co-scheduled threads, cannot deliver consistently the best performance under a variety of workloads. Therefore, we propose a novel adaptive scheme, called CLU, to interactively co-optimize the locality and utility of co-scheduled threads in thread-aware SLLC capacity management. CLU employs lightweight monitors to dynamically profile the LRU (least recently used) and BIP (bimodal insertion policy) hit curves of individual threads on runtime, enabling the scheme to co-optimize the locality and utility of concurrent threads and thus adapt to more diverse workloads than the existing approaches. We provide results from extensive execution-driven simulation experiments to demonstrate the feasibility and efficacy of CLU over the existing approaches (TADIP, NUCACHE, TA-DRRIP, UCP, and PIPP).
  • Keywords
    cache storage; multiprocessing systems; BIP; CLU scheme; LRU; NUCACHE approach; PIPP approach; SLLC capacity management; TA-DRRIP approach; TADIP approach; UCP approach; bimodal insertion policy; cache management; cache partitioning schemes; chip-multiprocessors; least recently used; replacement policies; shared last level caches; thread-aware capacity management; Benchmark testing; Instruction sets; Message systems; Monitoring; Resource management; Runtime; Throughput; Capacity management; chip multiprocessors; locality and utility co-optimization; shared last level caches;
  • fLanguage
    English
  • Journal_Title
    Computers, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9340
  • Type

    jour

  • DOI
    10.1109/TC.2012.277
  • Filename
    6361381