• DocumentCode
    900608
  • Title

    Evaluation of hardware-based stride and sequential prefetching in shared-memory multiprocessors

  • Author

    Dahlgren, Fredrik ; Stenström, Per

  • Author_Institution
    Dept. of Comput. Eng., Lund Univ., Sweden
  • Volume
    7
  • Issue
    4
  • fYear
    1996
  • fDate
    4/1/1996 12:00:00 AM
  • Firstpage
    385
  • Lastpage
    398
  • Abstract
    We study the efficiency of previously proposed stride and sequential prefetching-two promising hardware-based prefetching schemes to reduce read-miss penalties in shared-memory multiprocessors. Although stride accesses dominate in four out of six of the applications we study, we find that sequential prefetching does as well as and in same cases even better than stride prefetching for five applications. This is because 1) most strides are shorter than the block size (we assume 32 byte blocks), which means that sequential prefetching is as effective for these stride accesses, and 2) sequential prefetching also exploits the locality of read misses with nonstride accesses. However, since stride prefetching in general results in fewer useless prefetches, it offers the extra advantage of consuming less memory-system bandwidth
  • Keywords
    parallel architectures; performance evaluation; shared memory systems; latency tolerance; performance evaluation; prefetching schemes; read-miss penalties; relaxed memory consistency; sequential prefetching; shared-memory multiprocessors; stride and sequential prefetching; stride prefetching; Application software; Bandwidth; Coherence; Computer Society; Counting circuits; Delay; Hardware; Multiprocessor interconnection networks; Prefetching; Read-write memory;
  • fLanguage
    English
  • Journal_Title
    Parallel and Distributed Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1045-9219
  • Type

    jour

  • DOI
    10.1109/71.494633
  • Filename
    494633