DocumentCode :
869914
Title :
Cache replacement algorithms with nonuniform miss costs
Author :
Jeong, Jaeheon ; Dubois, Michel
Author_Institution :
Intel Corp., Hillsboro, OR, USA
Volume :
55
Issue :
4
fYear :
2006
fDate :
4/1/2006 12:00:00 AM
Firstpage :
353
Lastpage :
365
Abstract :
Cache replacement algorithms originally developed in the context of uniprocessors executing one instruction at a time implicitly assume that all cache misses have the same cost. However, in modern systems, some cache misses are more expensive than others. The cost may be latency, penalty, power consumption, bandwidth consumption, or any other ad hoc numerical property attached to a miss. We call the class of replacement algorithms designed to minimize a nonuniform miss cost function "cost-sensitive replacement algorithms". In this paper, we first introduce and analyze an optimum cost-sensitive replacement algorithm (CSOPT) in the context of multiple nonuniform miss costs. CSOPT can significantly improve the cost function over OPT (the replacement algorithm minimizing miss count) in large regions of the design space. Although CSOPT is an offline and unrealizable replacement policy, it serves as a lower bound on the achievable cost by realistic cost-sensitive replacement algorithms. Using the practical example of latency cost in CC-NUMA multiprocessors, we demonstrate that there is a lot of room left to improve current replacement algorithms in many situations beyond the promise of OPT. Next, we introduce three practical extensions of LRU inspired by CSOPT and we compare their performance to LRU, OPT, and CSOPT. Finally, as a practical application, we evaluate these realizable cost-sensitive replacement algorithms in the context of the second-level caches of a CC-NUMA multiprocessor with superscalar processors, using the miss latency as the cost function. By applying simple replacement policies sensitive to the latency of misses, we can improve the execution time of some parallel applications by up to 18 percent.
Keywords :
cache storage; multiprocessing systems; CC-NUMA multiprocessor; CSOPT; cache replacement algorithm; nonuniform miss cost function; optimum cost-sensitive replacement algorithm; superscalar processors; Aggregates; Algorithm design and analysis; Bandwidth; Context modeling; Cost function; Delay; Energy consumption; Multiprocessing systems; Optimized production technology; Cache; latency; memory system; power; replacement policy; trace-driven simulations.;
fLanguage :
English
Journal_Title :
Computers, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9340
Type :
jour
DOI :
10.1109/TC.2006.50
Filename :
1607999
Link To Document :
بازگشت