DocumentCode
2010759
Title
Effects of MSHR and Prefetch Mechanisms on an On-Chip Cache of the Vector Architecture
Author
Musa, Akihiro ; Sato, Yoshiei ; Soga, Takashi ; Egawa, Ryusuke ; Takizawa, Hiroyuki ; Okabe, Koki ; Kobayashi, Hiroaki
Author_Institution
Tohoku Univ., NEC Corp. Sendai, Sendai, Japan
fYear
2008
fDate
10-12 Dec. 2008
Firstpage
335
Lastpage
342
Abstract
Vector supercomputers have been encountering the memory wall problem and their memory bandwidth per flop/s rate has decreased. To cover the insufficient memory bandwidth per flop/s rate, an on-chip vector cache has been proposed for the vector processors. Although vector caching is effective to increase the sustained performance to a certain degree, it still needs software and hardware supporting mechanisms to extract its potential. To this end, we propose miss status handling registers (MSHR) and a prefetch mechanism. This paper evaluates the performance of the vector cache with the MSHR and the prefetch mechanism on the vector supercomputer across three leading scientific applications. The MSHR is an effective mechanism for handling subsequent vector loads of the same data, which frequently appear in different schemes. The experimental results indicate that the MSHR can improve the computational performance of scientific applications by 1.45Ã. Moreover, we examine the performance of the prefetch mechanism on the vector cache. The prefetch mechanism increases the computational performance by 1.6Ã. Accordingly, the MSHR and the prefetching mechanism are very effective optimization options for vector caching of future vector supercomputers even if the vector supercomputers cannot maintain the current memory bandwidth per flop/s rate.
Keywords
cache storage; parallel machines; MSHR; memory bandwidth; memory wall problem; miss status handling registers; onchip cache; prefetch mechanisms; vector architecture; vector supercomputers; Bandwidth; Delay; Distributed processing; Hardware; National electric code; Prefetching; Registers; Supercomputers; System-on-a-chip; Vector processors; MSHR; Memory system; Performance characterization; Prefetch; Vector architecture; Vector cache;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Processing with Applications, 2008. ISPA '08. International Symposium on
Conference_Location
Sydney, NSW
Print_ISBN
978-0-7695-3471-8
Type
conf
DOI
10.1109/ISPA.2008.100
Filename
4725165
Link To Document