DocumentCode :
288992
Title :
Automatic data and computation decomposition for distributed memory machines
Author :
Ning, Qi ; Van Dongen, Vincent ; Gao, Guang R.
Author_Institution :
Centre de Recherche Inf. de Montreal, Que., Canada
Volume :
2
fYear :
1995
fDate :
3-6 Jan 1995
Firstpage :
103
Abstract :
We have developed an automatic compile-time computation and data decomposition technique for distributed memory machines. Our method can handle complex programs containing perfect and nonperfect loop nests with or without loop-carried dependences. Applying our decomposition algorithms, a program is divided into collections (called clusters) of loop nests, such that data redistributions are allowed only between the clusters. Within each cluster of loop nests, decomposition and data locality constraints are formulated as a system of homogeneous linear equations which is solved by polynomial time algorithms. Our algorithm can selectively relax data locality constraints within a cluster to achieve a balance between parallelism and data locality. Such relaxations are guided by exploiting the hierarchical program nesting structures from outer to inner nesting levels to keep the communications at an outer-most level possible. This work is central to the on-going compiler development effort under the EPPP (Environment for Portable Parallel Programming) project. A brief discussion of the current implementation is included
Keywords :
computational complexity; data handling; distributed memory systems; equations; parallel programming; parallelising compilers; program control structures; relaxation; software portability; EPPP project; Environment for Portable Parallel Programming; automatic computation decomposition; automatic data decomposition; compile-time technique; compiler development; complex programs; data locality constraint relaxation; data redistributions; distributed memory machines; hierarchical program nesting structures; homogeneous linear equations; loop nest clusters; loop-carried dependences; nesting levels; nonperfect loop nests; parallelism; perfect loop nests; polynomial time algorithms; Clustering algorithms; Computer science; Distributed computing; Equations; Matrix decomposition; Parallel processing; Parallel programming; Polynomials; Program processors; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
System Sciences, 1995. Proceedings of the Twenty-Eighth Hawaii International Conference on
Conference_Location :
Wailea, HI
Print_ISBN :
0-8186-6930-6
Type :
conf
DOI :
10.1109/HICSS.1995.375472
Filename :
375472
Link To Document :
بازگشت