• DocumentCode
    1083450
  • Title

    A unified framework for optimizing communication in data-parallel programs

  • Author

    Gupta, Manish ; Schonberg, Edith ; Srinivasan, Harini

  • Author_Institution
    IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
  • Volume
    7
  • Issue
    7
  • fYear
    1996
  • fDate
    7/1/1996 12:00:00 AM
  • Firstpage
    689
  • Lastpage
    704
  • Abstract
    This paper presents a framework, based on global array data-flow analysis, to reduce communication costs in a program being compiled for a distributed memory machine. We introduce available section descriptor, a novel representation of communication involving array sections. This representation allows us to apply techniques for partial redundancy elimination to obtain powerful communication optimizations. With a single framework, we are able to capture optimizations like (1) vectorizing communication, (2) eliminating communication that is redundant on any control flow path, (3) reducing the amount of data being communicated, (4) reducing the number of processors to which data must be communicated, and (5) moving communication earlier to hide latency, and to subsume previous communication. We show that the bidirectional problem of eliminating partial redundancies can be decomposed into simpler unidirectional problems even in the context of an array section representation, which makes the analysis procedure more efficient. We present results from a preliminary implementation of this framework, which are extremely encouraging, and demonstrate the effectiveness of this analysis in improving the performance of programs
  • Keywords
    optimisation; parallel programming; performance evaluation; program compilers; software performance evaluation; communication costs; communication optimisation; data-parallel programs; distributed memory machine; global array data-flow analysis; partial redundancies; partial redundancy elimination; section descriptor; unified framework; Availability; Buildings; Communication system control; Context; Cost function; Data analysis; Delay; Memory architecture; Optimization methods; Performance analysis;
  • fLanguage
    English
  • Journal_Title
    Parallel and Distributed Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1045-9219
  • Type

    jour

  • DOI
    10.1109/71.508249
  • Filename
    508249