DocumentCode
2031379
Title
Automatic Partitioning of Parallel Loops for Cache-Coherent Multiprocessors
Author
Agarwal, Anant ; Kranz, David ; Natarajan, Venkat
Author_Institution
Massachusetts Institute of Technology
Volume
1
fYear
1993
fDate
16-20 Aug. 1993
Firstpage
2
Lastpage
11
Abstract
This paper presents a theoretical framework for automatically partitioning parallel loops to minimize cache coherency traffic on shared-memory multiprocessors. While several previous papers have looked at hyperplane partitioning of iteration spaces to reduce communication traffic, the problem of deriving the optimal tiling parameters for minimal communication in loops with general affine index expressions and multiple arrays has remained open. Our paper solves this open problem by presenting a method for deriving an optimal hyperparallelepiped tiling of iteration spaces for minimal communication in multiprocessors with caches. We show that the same theoretical framework can also be used to determine optimal tiling parameters for data and loop partitioning in distributed memory multiprocessors without caches. Like previous papers, our framework uses matrices to represent iteration and data space mappings and the notion of uniformly intersecting references to capture temporal locality in array references. We introduce the notion of data footprints to estimate the communication traffic between processors and use lattice theory to compute precisely the size of data footprints. We have implemented a subset of this framework in a compiler for the Alewife machine.
Keywords
Computer science; Laboratories; Lattices; Parallel processing; Partitioning algorithms; Program processors; Programming profession; Shape; Space technology; Tiles;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel Processing, 1993. ICPP 1993. International Conference on
Conference_Location
Syracuse, NY, USA
ISSN
0190-3918
Print_ISBN
0-8493-8983-6
Type
conf
DOI
10.1109/ICPP.1993.49
Filename
4134106
Link To Document