• DocumentCode
    855984
  • Title

    Efficient Runahead Execution: Power-Efficient Memory Latency Tolerance

  • Author

    Mutlu, Onur ; Kim, Hyesoon ; Patt, Yale N.

  • Author_Institution
    Texas Univ., Austin, TX
  • Volume
    26
  • Issue
    1
  • fYear
    2006
  • Firstpage
    10
  • Lastpage
    20
  • Abstract
    Today´s high-performance processors face main-memory latencies on the order of hundreds of processor clock cycles. As a result, even the most aggressive processors spend a significant portion of their execution time stalling and waiting for main-memory accesses to return data to the execution core. Runahead execution is a promising way to tolerate long main-memory latencies because it has modest hardware cost and doesn´t significantly increase processor complexity. Runahead execution improves a processors performance by speculatively pre-executing the application program while the processor services a long-latency (1,2) data cache miss, instead of stalling the processor for the duration of the L2 miss. For runahead execution to be efficiently implemented in current or future high-performance processors which will be energy-constrained, processor designers must develop techniques to reduce these extra instructions. Our solution to this problem includes both hardware and software mechanisms that are simple, implementable, and effective
  • Keywords
    cache storage; instruction sets; memory architecture; power consumption; data cache miss; high-performance processors; main-memory latencies; power-efficient memory latency tolerance; runahead execution; Buffer storage; Clocks; Costs; Delay; Energy consumption; Hardware; Out of order; Prefetching; Process design; Registers; Runahead execution; memory latency tolerance; processors;
  • fLanguage
    English
  • Journal_Title
    Micro, IEEE
  • Publisher
    ieee
  • ISSN
    0272-1732
  • Type

    jour

  • DOI
    10.1109/MM.2006.10
  • Filename
    1603492