DocumentCode :
3663933
Title :
Cost-effective speculative scheduling in high performance processors
Author :
Arthur Perais;André Seznec;Pierre Michaud;Andreas Sembrant;Erik Hagersten
Author_Institution :
IRISA/INRIA, France
fYear :
2015
fDate :
6/1/2015 12:00:00 AM
Firstpage :
247
Lastpage :
259
Abstract :
To maximize peiformance, out-of-order execution processors sometimes issue instructions without having the guarantee that operands will be available in time; e.g. loads are typically assumed to hit in the LI cache and dependent instructions are issued accordingly. This form of speculation - that we refer to as speculative scheduling - has been used for two decades in real processors, but has received little attention from the research community. In particular, as pipeline depth grows, and the distance between the Issue and the Execute stages increases, it becomes critical to issue instructions dependent on variable-latency instructions as soon as possible rather than wait for the actual cycle at which the result becomes available. Unfortunately, due to the uncertain nature of speculative scheduling, the scheduler may wrongly issue an instruction that will not have its source( s) available on the bypass network when it reaches the Execute stage. In that event, the instruction is canceled and replayed, potentially impairing peiformance and increasing energy consumption. In this work, we do not present a new replay mechanism. Rather, we focus on ways to reduce the number of replays that are agnostic of the replay scheme. First, we propose an easily implementable, low-cost solution to reduce the number of replays caused by Ll bank conflicts. Schedule shifting always assumes that, given a dual-load issue capacity, the second load issued in a given cycle will be delayed because of a bank conflict. Its dependents are thus always issued with the corresponding delay. Second, we also improve on existing Ll hit/miss prediction schemes by taking into account instruction criticality. That is, for some criterion of criticality and for loads whose hit/miss behavior is hard to predict, we show that it is more cost-effective to stall dependents if the load is not predicted critical.
Keywords :
"Legged locomotion","Microarchitecture"
Publisher :
ieee
Conference_Titel :
Computer Architecture (ISCA), 2015 ACM/IEEE 42nd Annual International Symposium on
Type :
conf
DOI :
10.1145/2749469.2749470
Filename :
7284070
Link To Document :
بازگشت