DocumentCode :
3306944
Title :
Store vectors for scalable memory dependence prediction and scheduling
Author :
Subramaniam, Samantika ; Loh, Gabriel H.
Author_Institution :
Coll. of Comput., Georgia Inst. of Technol., Atlanta, GA, USA
fYear :
2006
fDate :
11-15 Feb. 2006
Firstpage :
65
Lastpage :
76
Abstract :
Allowing loads to issue out-of-order with respect to earlier unresolved store addresses is very important for extracting parallelism in large-window superscalar processors. Blindly allowing all loads to issue as soon as their addresses are ready can lead to a net performance loss due to a large number of load-store ordering violations. Previous research has proposed memory dependence prediction algorithms to prevent only loads with true memory dependencies from issuing in the presence of unresolved stores. Techniques such as load-store pair identification and store sets have been very successful in achieving performance levels close to that attained by an oracle dependence predictor. These techniques tend to employ relatively complex CAM-based designs, which we believe have been obstacles to the industrial adoption of these algorithms. In this paper, we use the idea of dependency vectors from matrix schedulers for non-memory instructions, and adapt them to implement a new dependence prediction algorithm. For applications that experience frequent memory ordering violations, our "store vector" prediction algorithm delivers an 8.4% speedup over blind speculation (compared to 8.5% for perfect dependence prediction), achieves better performance than store sets (8.1%), and the store vector algorithm\´s matrix implementation is considerably simpler.
Keywords :
instruction sets; multiprocessing systems; scheduling; storage allocation; dependency vectors; load-store pair identification; matrix schedulers; nonmemory instructions; scalable memory dependence prediction; scheduling; store vector prediction algorithm; superscalar processors; Computer aided instruction; Educational institutions; Job shop scheduling; Microarchitecture; Out of order; Parallel processing; Pipelines; Prediction algorithms; Processor scheduling; Read-write memory;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High-Performance Computer Architecture, 2006. The Twelfth International Symposium on
ISSN :
1530-0897
Print_ISBN :
0-7803-9368-6
Type :
conf
DOI :
10.1109/HPCA.2006.1598113
Filename :
1598113
Link To Document :
بازگشت