DocumentCode :
3147559
Title :
Squeezing more CPU performance out of a Cray-2 by vector block scheduling
Author :
Eisenbeis, C. ; Jalby, W. ; Lichnewsky, A.
Author_Institution :
INRIA, Le Chesnay, France
fYear :
1988
fDate :
14-18 Nov 1988
Firstpage :
237
Lastpage :
245
Abstract :
Compile-time scheduling of vector activities on the Cray 2 is studied using a simplified model of the vector instruction stream. An approach based on experience with an array-processor microde scheduling by the authors is shown to be practical. It calls for a pass of loop scheduling followed by a pass of resource allocation. Actual benchmarks of the resulting code are shown, exhibiting speedups as large as 50% over the current CFT77 compiler. The results also give a novel perspective on vector chaining vs. nonchaining processor architectures
Keywords :
parallel processing; performance evaluation; CFT77 compiler; CPU performance; Cray-2; benchmarks; compile time scheduling; resource allocation; vector block scheduling; vector instruction stream; Bandwidth; Computer architecture; Concurrent computing; Hardware; Parallel processing; Processor scheduling; Reduced instruction set computing; Resource management; Scheduling algorithm; VLIW;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Supercomputing '88. [Vol.1]., Proceedings.
Conference_Location :
Orlando, FL
Print_ISBN :
0-8186-0882-X
Type :
conf
DOI :
10.1109/SUPERC.1988.44659
Filename :
44659
Link To Document :
بازگشت