DocumentCode :
3010065
Title :
A Prediction Based CMP Cache Migration Policy
Author :
Hao, Song ; Du, Zhihui ; Bader, David ; Wang, Man
Author_Institution :
Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing
fYear :
2008
fDate :
25-27 Sept. 2008
Firstpage :
374
Lastpage :
381
Abstract :
The large L2 cache´s access latency, which is mainly caused by wire delay, is a critical problem to improve the performance of CMP (Chip Multi-Processor) in NUCA (Non-Uniform Cache Architecture). A CMP L2 cache accessing performance model is provided first to analyze and evaluate the L2 access efficiency in this paper. The total L2 cache access latency problem is formalized as an optimal problem and the lower bound of L2 cache access latency is given based on this model. A novel PBM (Prediction based L2 cache data Migration) algorithm, which employs the sequential prediction technology to identify the data to be accessed in the near future, is designed to migrate the data to be accessed toward their users in early and this method can enable the cores to perform their accesses to the L2 cache in close banks. The analysis results show that this active data migration algorithm can take advantage of the principle of locality to reduce the data access latency much more than the traditional lazy data migration policy. To evaluate the theoretic analysis results, the HMTT toolkit is used to capture the complete memory trace of the SPEC 2000 benchmark running on an SMP computer. The memory trace shows that our prediction technology can work well and at the same time, an L2 cache access simulator is developed to deal with the memory trace data. The simulation experiments show that both the shorter block transfer distance and the lower average access latency can be achieved in the PBM policy. The average block transfer distance can be reduced by up to 16.9%, and the average L2 access latency can be reduced by up to 8.4%.
Keywords :
cache storage; memory architecture; microprocessor chips; multiprocessing systems; CMP; L2 cache access latency; NUCA; cache migration policy; chip multiprocessor; nonuniform cache architecture; performance evaluation; sequential prediction; Algorithm design and analysis; Computational modeling; Computer architecture; Computer science; Delay; High performance computing; Information science; Performance analysis; Predictive models; Wire; Chip Multi-Processor; L2-cache; Migration policy; Non-Uniform Cache Architecture; sequential prediction;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Computing and Communications, 2008. HPCC '08. 10th IEEE International Conference on
Conference_Location :
Dalian
Print_ISBN :
978-0-7695-3352-0
Type :
conf
DOI :
10.1109/HPCC.2008.83
Filename :
4637721
Link To Document :
بازگشت