DocumentCode
855984
Title
Efficient Runahead Execution: Power-Efficient Memory Latency Tolerance
Author
Mutlu, Onur ; Kim, Hyesoon ; Patt, Yale N.
Author_Institution
Texas Univ., Austin, TX
Volume
26
Issue
1
fYear
2006
Firstpage
10
Lastpage
20
Abstract
Today´s high-performance processors face main-memory latencies on the order of hundreds of processor clock cycles. As a result, even the most aggressive processors spend a significant portion of their execution time stalling and waiting for main-memory accesses to return data to the execution core. Runahead execution is a promising way to tolerate long main-memory latencies because it has modest hardware cost and doesn´t significantly increase processor complexity. Runahead execution improves a processors performance by speculatively pre-executing the application program while the processor services a long-latency (1,2) data cache miss, instead of stalling the processor for the duration of the L2 miss. For runahead execution to be efficiently implemented in current or future high-performance processors which will be energy-constrained, processor designers must develop techniques to reduce these extra instructions. Our solution to this problem includes both hardware and software mechanisms that are simple, implementable, and effective
Keywords
cache storage; instruction sets; memory architecture; power consumption; data cache miss; high-performance processors; main-memory latencies; power-efficient memory latency tolerance; runahead execution; Buffer storage; Clocks; Costs; Delay; Energy consumption; Hardware; Out of order; Prefetching; Process design; Registers; Runahead execution; memory latency tolerance; processors;
fLanguage
English
Journal_Title
Micro, IEEE
Publisher
ieee
ISSN
0272-1732
Type
jour
DOI
10.1109/MM.2006.10
Filename
1603492
Link To Document