Title :
A unified framework for optimizing communication in data-parallel programs
Author :
Gupta, Manish ; Schonberg, Edith ; Srinivasan, Harini
Author_Institution :
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
fDate :
7/1/1996 12:00:00 AM
Abstract :
This paper presents a framework, based on global array data-flow analysis, to reduce communication costs in a program being compiled for a distributed memory machine. We introduce available section descriptor, a novel representation of communication involving array sections. This representation allows us to apply techniques for partial redundancy elimination to obtain powerful communication optimizations. With a single framework, we are able to capture optimizations like (1) vectorizing communication, (2) eliminating communication that is redundant on any control flow path, (3) reducing the amount of data being communicated, (4) reducing the number of processors to which data must be communicated, and (5) moving communication earlier to hide latency, and to subsume previous communication. We show that the bidirectional problem of eliminating partial redundancies can be decomposed into simpler unidirectional problems even in the context of an array section representation, which makes the analysis procedure more efficient. We present results from a preliminary implementation of this framework, which are extremely encouraging, and demonstrate the effectiveness of this analysis in improving the performance of programs
Keywords :
optimisation; parallel programming; performance evaluation; program compilers; software performance evaluation; communication costs; communication optimisation; data-parallel programs; distributed memory machine; global array data-flow analysis; partial redundancies; partial redundancy elimination; section descriptor; unified framework; Availability; Buildings; Communication system control; Context; Cost function; Data analysis; Delay; Memory architecture; Optimization methods; Performance analysis;
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on