DocumentCode :
2562210
Title :
Effectiveness of hardware-based stride and sequential prefetching in shared-memory multiprocessors
Author :
Dahlgren, Fredrik ; Stenstrom, Per
Author_Institution :
Dept. of Comput. Eng., Lund Univ., Sweden
fYear :
1995
fDate :
1995
Firstpage :
68
Lastpage :
77
Abstract :
We study the relative efficiency of previously proposed stride and sequential prefetching-two promising hardware-based prefetching schemes to reduce read-miss penalties in shared-memory multiprocessors. Although stride accesses dominate in four out of six of the applications we study, we find that sequential prefetching does better than stride prefetching for three applications. This is because (i) most strides are shorter than the block size (we assume 32 byte blocks), which means that sequential prefetching is as effective for stride accesses, and (ii) sequential prefetching also exploits the locality of read misses for non-stride accesses. However we find that since stride prefetching causes fewer useless prefetches, it consumes less memory-system bandwidth
Keywords :
cache storage; communication complexity; multiprocessor interconnection networks; shared memory systems; storage management; hardware-based sequential prefetching; hardware-based stride prefetching; memory-system bandwidth; read misses; read-miss penalties; shared-memory multiprocessors; Application software; Bandwidth; Coherence; Contracts; Counting circuits; Delay; Hardware; Large-scale systems; Prefetching; Read-write memory;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High-Performance Computer Architecture, 1995. Proceedings., First IEEE Symposium on
Conference_Location :
Raleigh, NC
Print_ISBN :
0-8186-6445-2
Type :
conf
DOI :
10.1109/HPCA.1995.386554
Filename :
386554
Link To Document :
بازگشت