DocumentCode :
1093476
Title :
Hierarchical execution to speed up pipeline interlock in mainframe computers
Author :
Shintani, Yooichi ; Shonai, Toru ; Kurokawa, Hiroshi ; Kuriyama, Kazunori ; Yamaoka, Akira
Author_Institution :
Res. & Dev. Div., Hitachi America Ltd., San Jose, CA, USA
Volume :
45
Issue :
5
fYear :
1996
fDate :
5/1/1996 12:00:00 AM
Firstpage :
589
Lastpage :
599
Abstract :
This paper introduces a methodology, called hierarchical execution, which reduces stalls caused by pipeline interlocks such as data and control dependencies. Since a lot of software has been accumulated in mainframe computer systems as object code, it is important to improve performance without having to recompile the code for optimization. Our methodology consists of a simple pre-ALU that generates results, with shorter latency than the main ALU, asynchronously, which reduces the overhead especially for address generation interlocks and branch instructions. This method was implemented in Hitachi´s mainframe processors, M-680 and M-880. In M-680, the pre-ALU, together with the instruction decoder, processes instructions in superpipelined fashion, which further improves performance. The aggregate effect of hierarchical execution on CPU time, for evaluated benchmarks, is 10% on average, with only a 1.6% increase in hardware. Therefore, we can roughly say that the hierarchical execution method improved cost performance by 8%
Keywords :
parallel architectures; performance evaluation; pipeline processing; benchmark; code optimization; compiler; hierarchical execution; mainframe computers; performance; pipeline interlock; pipeline interlocks; pre-ALU; Aggregates; Central Processing Unit; Costs; Decoding; Delay; Dynamic scheduling; Hardware; Optimizing compilers; Pipelines; Software performance;
fLanguage :
English
Journal_Title :
Computers, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9340
Type :
jour
DOI :
10.1109/12.509910
Filename :
509910
Link To Document :
بازگشت