• DocumentCode
    2174273
  • Title

    Adaptive prefetching for shared cache based chip multiprocessors

  • Author

    Kandemir, Mahmut ; Zhang, Yuanrui ; Ozturk, Ozcan

  • Author_Institution
    Comput. Sci. & Eng. Dept., Pennsylvania State Univ., University Park, PA
  • fYear
    2009
  • fDate
    20-24 April 2009
  • Firstpage
    773
  • Lastpage
    778
  • Abstract
    Chip multiprocessors (CMPs) present a unique scenario for software data prefetching with subtle tradeoffs between memory bandwidth and performance. In a shared L2 based CMP, multiple cores compete for the shared on-chip cache space and limited off-chip pin bandwidth. Purely software based prefetching techniques tend to increase this contention, leading to degradation in performance. In some cases, prefetches can become harmful by kicking out useful data from the shared cache whose next usage is earlier than the prefetched data, and the fraction of such harmful prefetches usually increases when we increase the number of cores used for executing a multi-threaded application code. In this paper, we propose two complementary techniques to address the problem of harmful prefetches in the context of shared L2 based CMPs. These techniques, namely, suppressing select data prefetches (if they are found to be harmful) and pinning select data in the L2 cache (if they are found to be frequent victim of harmful prefetches), are evaluated in this paper using two embedded application codes. Our experiments demonstrate that these two techniques are very effective in mitigating the impact of harmful prefetches, and as a result, we extract significant benefits from software prefetching even with large core counts.
  • Keywords
    cache storage; multi-threading; multiprocessing systems; storage management; complementary techniques; embedded application codes; multithreaded application code; off-chip pin bandwidth; on-chip cache space; shared cache based chip multiprocessors; software based prefetching techniques; software data prefetching; Application software; Bandwidth; Data engineering; Data mining; Degradation; Prefetching; Processor scheduling; Scheduling algorithm; Software algorithms; Software performance;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Design, Automation & Test in Europe Conference & Exhibition, 2009. DATE '09.
  • Conference_Location
    Nice
  • ISSN
    1530-1591
  • Print_ISBN
    978-1-4244-3781-8
  • Type

    conf

  • DOI
    10.1109/DATE.2009.5090768
  • Filename
    5090768