DocumentCode :
3459686
Title :
Performance Portable Optimizations for Loops Containing Communication Operations
Author :
Iancu, Costin ; Chen, Wei ; Yelick, Katherine
Author_Institution :
Lawrence Berkeley Nat. Lab., Berkeley
fYear :
2007
fDate :
15-19 Sept. 2007
Firstpage :
411
Lastpage :
411
Abstract :
As high end computing systems continue to scale in CPU computational power and overall node count, optimization techniques that can reduce communication overhead have proven important. We present a loop optimization framework designed to achieve both efficient communication/computation overlap and performance portability. The framework has been implemented in the Berkeley UPC compiler and uses a combination of compile time analysis and runtime mechanisms. We extend the compiler to perform message vectorization and message strip mining optimizations. At compile time loop nests are analyzed, their communication requirements are determined, and the computation overhead is estimated. The compiler passes analysis information to the runtime and performance portability is achieved by decoupling data movement from local computation. We generate template code that uses the transferred data without making any assumptions about the communication mechanism.
Keywords :
optimising compilers; program control structures; communication operations; compile time analysis; message strip mining optimizations; message vectorization; portable loop optimization framework; runtime mechanisms; Communication system control; High performance computing; Laboratories; Optimizing compilers; Performance analysis; Portable computers; Productivity; Programming profession; Quality of service; Runtime;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel Architecture and Compilation Techniques, 2007. PACT 2007. 16th International Conference on
Conference_Location :
Brasov
ISSN :
1089-795X
Print_ISBN :
978-0-7695-2944-8
Type :
conf
DOI :
10.1109/PACT.2007.4336239
Filename :
4336239
Link To Document :
بازگشت