DocumentCode
3189652
Title
Polynomial-time nested loop fusion with full parallelism
Author
Sha, Edwin H M ; Lang, Chenhua ; Passos, Nelson L.
Author_Institution
Dept. of Comput. Sci. & Eng., Notre Dame Univ., IN, USA
Volume
3
fYear
1996
fDate
12-16 Aug 1996
Firstpage
9
Abstract
Data locality and synchronization overhead are two important factors that affect the performance of applications on multiprocessors. Loop fusion is an effective way of reducing synchronization and improving data locality. Traditional fusion techniques, however either cannot address the case when fusion-preventing dependence exists in nested loops, or cannot achieve good parallelism after fusion. This paper gives a significant improvement by presenting several efficient polynomial-time algorithms to solve these problems. These algorithms combined with the retiming technique allow nested loop fusion in the existence of outmost loop-carried dependence as in the presence of fusion-preventing dependence. Furthermore, the technique is proved to achieve fully parallel execution of the fused loops
Keywords
computational complexity; multiprocessing systems; parallel algorithms; parallel programming; program control structures; programming theory; software performance evaluation; synchronisation; application performance; data locality; fusion-preventing dependence; multiprocessors; outmost loop-carried dependence; parallel program; polynomial-time algorithms; polynomial-time nested loop fusion; retiming technique; synchronization overhead; Application software; Computer science; Concurrent computing; Fuses; High performance computing; Law; Legal factors; Parallel processing; Polynomials; Weather forecasting;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel Processing, 1996. Vol.3. Software., Proceedings of the 1996 International Conference on
Conference_Location
Ithaca, NY
ISSN
0190-3918
Print_ISBN
0-8186-7623-X
Type
conf
DOI
10.1109/ICPP.1996.538554
Filename
538554
Link To Document