• DocumentCode
    2010759
  • Title

    Effects of MSHR and Prefetch Mechanisms on an On-Chip Cache of the Vector Architecture

  • Author

    Musa, Akihiro ; Sato, Yoshiei ; Soga, Takashi ; Egawa, Ryusuke ; Takizawa, Hiroyuki ; Okabe, Koki ; Kobayashi, Hiroaki

  • Author_Institution
    Tohoku Univ., NEC Corp. Sendai, Sendai, Japan
  • fYear
    2008
  • fDate
    10-12 Dec. 2008
  • Firstpage
    335
  • Lastpage
    342
  • Abstract
    Vector supercomputers have been encountering the memory wall problem and their memory bandwidth per flop/s rate has decreased. To cover the insufficient memory bandwidth per flop/s rate, an on-chip vector cache has been proposed for the vector processors. Although vector caching is effective to increase the sustained performance to a certain degree, it still needs software and hardware supporting mechanisms to extract its potential. To this end, we propose miss status handling registers (MSHR) and a prefetch mechanism. This paper evaluates the performance of the vector cache with the MSHR and the prefetch mechanism on the vector supercomputer across three leading scientific applications. The MSHR is an effective mechanism for handling subsequent vector loads of the same data, which frequently appear in different schemes. The experimental results indicate that the MSHR can improve the computational performance of scientific applications by 1.45×. Moreover, we examine the performance of the prefetch mechanism on the vector cache. The prefetch mechanism increases the computational performance by 1.6×. Accordingly, the MSHR and the prefetching mechanism are very effective optimization options for vector caching of future vector supercomputers even if the vector supercomputers cannot maintain the current memory bandwidth per flop/s rate.
  • Keywords
    cache storage; parallel machines; MSHR; memory bandwidth; memory wall problem; miss status handling registers; onchip cache; prefetch mechanisms; vector architecture; vector supercomputers; Bandwidth; Delay; Distributed processing; Hardware; National electric code; Prefetching; Registers; Supercomputers; System-on-a-chip; Vector processors; MSHR; Memory system; Performance characterization; Prefetch; Vector architecture; Vector cache;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing with Applications, 2008. ISPA '08. International Symposium on
  • Conference_Location
    Sydney, NSW
  • Print_ISBN
    978-0-7695-3471-8
  • Type

    conf

  • DOI
    10.1109/ISPA.2008.100
  • Filename
    4725165