DocumentCode
451097
Title
Measuring Memory Hierarchy Performance of Cache-Coherent Multiprocessors Using Micro Benchmarks
Author
Hristea, Cristina ; Lenoski, Daniel ; Keen, John
Author_Institution
Massachusetts Institute of Technology
fYear
1997
fDate
15-21 Nov. 1997
Firstpage
45
Lastpage
45
Abstract
Even with today´s large caches, the increasing performance gap between processors and memory systems imposes a memory bottleneck for many important scientific and com mercial applications. This bottleneck is intensified in shared-memory multiprocessors by contention and the ef fects of cache coherency. Under heavy memory contention, the memory latency may increase two or three times. Nonethless, as more sophisticated techniques are used to hide latency and increase bandwidth, measuring memory performance has become increasingly difficult. Previous simple methods to measure memory performance can overestimate unipro cessor memory latency and underestimate bandwidth by tens of percent. We introduce a micro benchmark suite that measures memory hierarchy performance in light of both uniprocessor optimizations and the contention and coherence effects of multiprocessors. The benchmark suite has been used to improve the memory system performance of the SGI Origin multiprocessor.
Keywords
Bandwidth; Delay effects; Filling; Graphics; Microprocessors; Prefetching; Read-write memory; Registers; Silicon; Timing;
fLanguage
English
Publisher
ieee
Conference_Titel
Supercomputing, ACM/IEEE 1997 Conference
Print_ISBN
0-89791-985-8
Type
conf
DOI
10.1109/SC.1997.10042
Filename
1592626
Link To Document