Title :
On supernode transformation with minimized total running time
Author :
Hodzic, Edin ; Shang, Weijia
Author_Institution :
AT&T Lab., San Jose, CA, USA
fDate :
5/1/1998 12:00:00 AM
Abstract :
With the objective of minimizing the total execution time of a parallel program on a distributed memory parallel computer, this paper discusses how to find an optimal supernode size and optimal supernode relative side lengths of a supernode transformation (also known as tiling). We identify three parameters of supernode transformation: supernode size, relative side lengths, and cutting hyperplane directions. For algorithms with perfectly nested loops and uniform dependencies, for sufficiently large supernodes and number of processors, and for the case where multiple supernodes are mapped to a single processor, we give an order n polynomial whose real positive roots include the optimal supernode size. For two special cases, 1) two-dimensional algorithm problems and 2) n-dimensional algorithm problems, where the communication cost is dominated by the startup penalty and, therefore, can be approximated by a constant, we give a closed form expression for the optimal supernode size, which is independent of the supernode relative side lengths and cutting hyperplanes. For the case where the algorithm iteration index space and the supernodes are hyperrectangular, we give closed form expressions for the optimal supernode relative side lengths. Our experiment shows a good match of the closed form expressions with experimental data
Keywords :
distributed memory systems; parallel architectures; parallelising compilers; cutting hyperplanes; distributed memory multicomputer; distributed memory parallel computer; minimizing running time; optimal supernode size; parallel program; parallelizing compilers; supernode partitioning; supernode transformation; tiling; total execution time; Concurrent computing; Cost function; Delay effects; Distributed computing; Partitioning algorithms; Polynomials;
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on