Title :
Selection of the Register File Size and the Resource Allocation Policy on SMT Processors
Author :
Alastruey, Jesus ; Monreal, Teresa ; Cazorla, Francisco ; Vinals, Victor ; Valero, Mateo
Author_Institution :
DIIS-I3A, Univ. de Zaragoza, Zaragoza
fDate :
Oct. 29 2008-Nov. 1 2008
Abstract :
The performance impact of the Physical Register File(PRF) size on Simultaneous Multithreading processors has not been extensively studied in spite of being a critical shared resource. In this paper we analyze the effect on performance of the PRF size for a broad set of resource allocation policies (Icount, Stall, Flush, Flush++, Static,Dcra and Hill-climbing) and evaluate them under two metrics: instructions per second (IPS) for throughput and harmonic mean of weighted IPCs (Hmean-wIPC) for fairness. We have found that resource allocation policy and PRF size should be considered together in order to obtain the best score in the proposed metrics. For instance, for the analyzed 2 and 4-threaded SPEC CPU2000 workloads,small PRFs are best managed by Flush, whereas for larger PRFs, Hill-climbing and Static lead to the best values for the throughput and fairness metrics, respectively.The second contribution of this work is a simple procedure that, for a given resource allocation policy, selects the PRF size that maximizes IPS and obtains for Hmean-wIPC a value close to its maximum. According to our results, Hill-climbing with a 320-entry PRF achieves the best figures for 2-threaded workloads. When executing 4-threaded workloads, Hill-Climbing with a 384-entry PRF achieves the best throughput whereas Static obtains the best throughput-fairness balance.
Keywords :
file organisation; multi-threading; multiprocessing systems; performance evaluation; resource allocation; surface mount technology; SMT processors; best throughput-fairness balance; fairness metrics; harmonic mean; instructions per second; physical register file; register file size; resource allocation policies; resource allocation policy; simultaneous multithreading processor; throughput metrics; weighted IPC; Computer architecture; Frequency; Multithreading; Performance analysis; Process design; Registers; Resource management; Surface-mount technology; Throughput; Yarn; Simultaneous multithreading (SMT); fetch policies; register file; resource allocation policies;
Conference_Titel :
Computer Architecture and High Performance Computing, 2008. SBAC-PAD '08. 20th International Symposium on
Conference_Location :
Campo Grande, MS
Print_ISBN :
978-0-7695-3423-7
DOI :
10.1109/SBAC-PAD.2008.17