DocumentCode :
3219511
Title :
Compiler support for data forwarding in scalable shared-memory multiprocessors
Author :
Koufaty, David ; Torrellas, Josep
Author_Institution :
Dept. of Comput. Sci., Illinois Univ., Urbana, IL, USA
fYear :
1999
fDate :
1999
Firstpage :
181
Lastpage :
190
Abstract :
As the difference in speed between processor and memory system continues to increase, it is becoming crucial to develop and refine techniques that enhance the effectiveness of cache hierarchies. One promising technique in the context of scalable shared-memory multiprocessors is data forwarding. Forwarding hides the latency of communication-induced misses by having producer processors send data to the caches of potential consumer processors in advance. Forwarding can hide the latency effectively, has low instruction overhead, and uses few machine resources. This paper presents a complete implementation of a data forwarding pass in an industrial-strength parallelizing compiler. Complete Fortran applications are analyzed for dependences and, based on the analysis, automatically annotated with forwarding directives. We propose a forwarding framework that includes 4 new instructions: write-forward, write-broadcast, write-update, and write-through. New microarchitectural support is proposed. In our analysis, we assume that the assignment of loop iterations to processors is known. We perform simulations of multiprocessors with different cache, memory, machine sharing, and process migration parameters. We conclude that data forwarding delivers large speedups (six 32-processor applications ran an average of 40% faster), gets close to the upper bound in performance, and needs compiler support of only medium complexity
Keywords :
digital simulation; parallelising compilers; shared memory systems; cache hierarchies; communication-induced misses; compiler support; data forwarding; loop iterations; microarchitectural support; parallelizing compiler; scalable shared-memory multiprocessors; simulations; Context; Delay; Microarchitecture; Radio access networks; Upper bound;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel Processing, 1999. Proceedings. 1999 International Conference on
Conference_Location :
Aizu-Wakamatsu City
ISSN :
0190-3918
Print_ISBN :
0-7695-0350-0
Type :
conf
DOI :
10.1109/ICPP.1999.797403
Filename :
797403
Link To Document :
بازگشت