DocumentCode :
2279931
Title :
Cache performance in vector supercomputers
Author :
Kontothanassis, L.I. ; Sugumar, R.A. ; Faanes, G.J. ; Smith, J.E. ; Scott, M.L.
Author_Institution :
Dept. of Comput. Sci., Rochester Univ., NY, USA
fYear :
1994
fDate :
14-18 Nov 1994
Firstpage :
255
Lastpage :
264
Abstract :
Traditional supercomputers use a flat multi-bank SRAM memory organization to supply high bandwidth at low latency. Most other computers use a hierarchical organization with a small SRAM cache and a slower, cheaper DRAM for the main memory. Such systems rely heavily on data locality for achieving optimum performance. This paper evaluates cache-based memory systems for vector supercomputers. We develop a simulation model for a cache-based version of the Cray Research C90 and use the NAS parallel benchmarks to provide a large-scale workload. We show that while caches reduce memory traffic and improve the performance of plain DRAM memory, they still lag behind cacheless SRAM. We identify the performance bottlenecks in DRAM-based memory systems and quantify their contribution to program performance degradation. We find the data fetch strategy to be a significant parameter affecting performance, we evaluate the performance of several fetch policies, and we show that small fetch sizes improve performance by maximizing the use of available memory bandwidth
Keywords :
DRAM chips; SRAM chips; cache storage; parallel machines; performance evaluation; vector processor systems; virtual machines; Cray C90; DRAM main memory; NAS parallel benchmarks; SRAM cache; cache performance; cache-based memory systems; cacheless SRAM; data fetch strategy; data locality; fetch policies; fetch sizes; flat multi-bank SRAM memory organization; hierarchical organization; large-scale workload; memory bandwidth; memory latency; memory traffic; optimum performance; performance bottlenecks; program performance degradation; simulation model; vector supercomputers; Application software; Bandwidth; Computer science; Costs; Delay; Large-scale systems; Multiprocessor interconnection networks; Neck; Random access memory; Supercomputers;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Supercomputing '94., Proceedings
Conference_Location :
Washington, DC
Print_ISBN :
0-8186-6605-6
Type :
conf
DOI :
10.1109/SUPERC.1994.344285
Filename :
344285
Link To Document :
بازگشت