• DocumentCode
    1902135
  • Title

    Avoiding store misses to fully modified cache blocks

  • Author

    Hu, Shiwen ; John, Lizy

  • Author_Institution
    Networking & Comput. Syst. Group, Freescale Semicond., Inc., Austin, TX
  • fYear
    2006
  • fDate
    10-12 April 2006
  • Lastpage
    296
  • Abstract
    Memory bandwidth limitation is one of the major impediments to high-performance microprocessors. This paper investigates a class of store misses that can be eliminated to reduce data traffic. Those store misses fetch cache blocks whose original data is never used. If fully overwritten by subsequent stores, those blocks can be installed directly in the cache without accessing lower levels of the memory hierarchy, eliminating the corresponding data traffic. Our results indicate that for a 1 MB data cache, 28% of cache misses are avoidable across SPEC CPU INT 2000 benchmarks. We propose a simple hardware mechanism, the store fill buffer (SFB), which directly installs blocks for store misses, and substantially reduces the data traffic. A 16-entry SFB eliminates 16% of overall misses to a 64 KB data cache, resulting in 6% speedup. This mechanism enables other bandwidth-hungry techniques to further improve system performance
  • Keywords
    benchmark testing; cache storage; microprocessor chips; SPEC CPU INT 2000 benchmark; bandwidth-hungry technique; data traffic; microprocessor; modified cache block; Bandwidth; Buffer storage; Computer architecture; Computer networks; Delay; Hardware; Impedance; Laboratories; Microprocessors; System performance;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Performance, Computing, and Communications Conference, 2006. IPCCC 2006. 25th IEEE International
  • Conference_Location
    Phoenix, AZ
  • Print_ISBN
    1-4244-0198-4
  • Type

    conf

  • DOI
    10.1109/.2006.1629419
  • Filename
    1629419