DocumentCode :
1384932
Title :
On supernode transformation with minimized total running time
Author :
Hodzic, Edin ; Shang, Weijia
Author_Institution :
AT&T Lab., San Jose, CA, USA
Volume :
9
Issue :
5
fYear :
1998
fDate :
5/1/1998 12:00:00 AM
Firstpage :
417
Lastpage :
428
Abstract :
With the objective of minimizing the total execution time of a parallel program on a distributed memory parallel computer, this paper discusses how to find an optimal supernode size and optimal supernode relative side lengths of a supernode transformation (also known as tiling). We identify three parameters of supernode transformation: supernode size, relative side lengths, and cutting hyperplane directions. For algorithms with perfectly nested loops and uniform dependencies, for sufficiently large supernodes and number of processors, and for the case where multiple supernodes are mapped to a single processor, we give an order n polynomial whose real positive roots include the optimal supernode size. For two special cases, 1) two-dimensional algorithm problems and 2) n-dimensional algorithm problems, where the communication cost is dominated by the startup penalty and, therefore, can be approximated by a constant, we give a closed form expression for the optimal supernode size, which is independent of the supernode relative side lengths and cutting hyperplanes. For the case where the algorithm iteration index space and the supernodes are hyperrectangular, we give closed form expressions for the optimal supernode relative side lengths. Our experiment shows a good match of the closed form expressions with experimental data
Keywords :
distributed memory systems; parallel architectures; parallelising compilers; cutting hyperplanes; distributed memory multicomputer; distributed memory parallel computer; minimizing running time; optimal supernode size; parallel program; parallelizing compilers; supernode partitioning; supernode transformation; tiling; total execution time; Concurrent computing; Cost function; Delay effects; Distributed computing; Partitioning algorithms; Polynomials;
fLanguage :
English
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on
Publisher :
ieee
ISSN :
1045-9219
Type :
jour
DOI :
10.1109/71.679213
Filename :
679213
Link To Document :
بازگشت