DocumentCode :
1084953
Title :
Processor mapping techniques toward efficient data redistribution
Author :
Kalns, Edgar T. ; Ni, Lionel M.
Author_Institution :
POWERparallel Syst., IBM Corp., Poughkeepsie, NY, USA
Volume :
6
Issue :
12
fYear :
1995
fDate :
12/1/1995 12:00:00 AM
Firstpage :
1234
Lastpage :
1247
Abstract :
Run-time data redistribution can enhance algorithm performance in distributed-memory machines. Explicit redistribution of data can be performed between algorithm phases when a different data decomposition is expected to deliver increased performance for a subsequent phase of computation. Redistribution, however, represents increased program overhead as algorithm computation is discontinued while data are exchanged among processor memories. In this paper, we present a technique that minimizes the amount of data exchange for BLOCK to CYCLIC(c) (or vice-versa) redistributions of arbitrary number of dimensions. Preserving the semantics of the target (destination) distribution pattern, the technique manipulates the data to logical processor mapping of the target pattern. When implemented on an IBM SP, the mapping technique demonstrates redistribution performance improvements of approximately 40% over traditional data to processor mapping. Relative to the traditional mapping technique, the proposed method affords greater flexibility in specifying precisely which data elements are redistributed and which elements remain on-processor
Keywords :
FORTRAN; distributed memory systems; parallel languages; parallel programming; processor scheduling; High Performance Fortran; algorithm computation; algorithm performance; algorithm phases; data decomposition; data redistribution; data-parallel programming; distributed-memory architectures; distributed-memory machines; efficient data redistribution; logical processor mapping; processor mapping; processor mapping techniques; program overhead; redistribution performance improvements; run-time data redistribution; target pattern; traditional mapping technique; Computer Society; Computer science; Costs; Runtime; System performance;
fLanguage :
English
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on
Publisher :
ieee
ISSN :
1045-9219
Type :
jour
DOI :
10.1109/71.476166
Filename :
476166
Link To Document :
بازگشت