Title :
Processor mapping techniques toward efficient data redistribution
Author :
Kalns, Edgar T. ; Ni, Lionel M.
Author_Institution :
POWERparallel Syst., IBM Corp., Poughkeepsie, NY, USA
fDate :
12/1/1995 12:00:00 AM
Abstract :
Run-time data redistribution can enhance algorithm performance in distributed-memory machines. Explicit redistribution of data can be performed between algorithm phases when a different data decomposition is expected to deliver increased performance for a subsequent phase of computation. Redistribution, however, represents increased program overhead as algorithm computation is discontinued while data are exchanged among processor memories. In this paper, we present a technique that minimizes the amount of data exchange for BLOCK to CYCLIC(c) (or vice-versa) redistributions of arbitrary number of dimensions. Preserving the semantics of the target (destination) distribution pattern, the technique manipulates the data to logical processor mapping of the target pattern. When implemented on an IBM SP, the mapping technique demonstrates redistribution performance improvements of approximately 40% over traditional data to processor mapping. Relative to the traditional mapping technique, the proposed method affords greater flexibility in specifying precisely which data elements are redistributed and which elements remain on-processor
Keywords :
FORTRAN; distributed memory systems; parallel languages; parallel programming; processor scheduling; High Performance Fortran; algorithm computation; algorithm performance; algorithm phases; data decomposition; data redistribution; data-parallel programming; distributed-memory architectures; distributed-memory machines; efficient data redistribution; logical processor mapping; processor mapping; processor mapping techniques; program overhead; redistribution performance improvements; run-time data redistribution; target pattern; traditional mapping technique; Computer Society; Computer science; Costs; Runtime; System performance;
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on