DocumentCode :
1829578
Title :
Complexity/performance tradeoffs with non-blocking loads
Author :
Farkas, Keith I. ; Jouppi, Norman P.
Author_Institution :
Dept. of Electr. & Comput. Eng., Toronto Univ., Ont., Canada
fYear :
1994
fDate :
18-21 Apr 1994
Firstpage :
211
Lastpage :
222
Abstract :
Non-blocking loads are a very effective technique for tolerating the cache-miss latency on data cache references. The authors describe several methods for implementing non-blocking loads. A range of resulting hardware complexity/performance tradeoffs are investigated using an object-code translation and instrumentation system. The authors investigate the SPEC92 benchmarks and have found that for the integer benchmarks, a simple hit-under-miss implementation achieves almost all of the available performance improvement for relatively little cost. However, for most of the numeric benchmarks, more expensive implementations are worthwhile. The results also point out the importance of using a compiler capable of scheduling load instructions for cache misses rather than cache hits in non-blocking systems
Keywords :
buffer storage; memory architecture; performance evaluation; program compilers; SPEC92 benchmarks; cache misses; cache-miss latency; compiler; complexity/performance tradeoffs; data cache references; load instructions; non-blocking loads; numeric benchmarks; performance improvement; Buffer storage; Costs; Delay; Educational institutions; Hardware; Instruments; Microprocessors; Processor scheduling; Registers;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Architecture, 1994., Proceedings the 21st Annual International Symposium on
Conference_Location :
Chicago, IL
Print_ISBN :
0-8186-5510-0
Type :
conf
DOI :
10.1109/ISCA.1994.288148
Filename :
288148
Link To Document :
بازگشت