DocumentCode
159143
Title
Symbolic inner loop parallelisation for massively parallel processor arrays
Author
Tanase, Alexandru ; Witterauf, Michael ; Teich, Jurgen ; Hannig, Frank
Author_Institution
Dept. of Comput. Sci., Univ. of Erlangen-Nurnberg (FAU), Erlangen, Germany
fYear
2014
fDate
19-21 Oct. 2014
Firstpage
219
Lastpage
228
Abstract
This paper presents a first solution to the unsolved problem of symbolically scheduling a given loop nest with uniform data dependences using inner loop parallelization, in particular, the locally parallel, globally sequential (LPGS) mapping technique. This technique is needed in the case of loop program specifications for which the iterations shall be scheduled on a processor array of unknown size at compile time while keeping the local memory consumption independent of the problem size of the mapped loop nest. We show that it is possible to derive such parameterized LPGS schedules statically by proposing a mixed compile-/runtime approach: At compile time, we first determine the set of all schedule candidates, each latency-optimal for a different scanning order of the loop nest. Then we devise an exact parameterized formula for determining the latency of the resulting symbolic schedules, thus making each schedule fully predictable. At runtime, once the size of the processor array becomes known, a simple prolog selects the overall latency-optimal schedule that is then dynamically activated and executed on the processor array. Hence, our approach avoids any further runtime optimization and expensive re-compilations while achieving the same results as computing an optimal static schedule for each possible combination of array and problem size.
Keywords
formal specification; parallel processing; program compilers; LPGS mapping technique; data dependence; locally parallel globally sequential mapping technique; loop nest; loop program specifications; massively parallel processor arrays; mixed compile-/runtime approach; symbolic inner loop parallelisation; symbolic schedules; Finite impulse response filters; Optimal scheduling; Processor scheduling; Runtime; Schedules; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Formal Methods and Models for Codesign (MEMOCODE), 2014 Twelfth ACM/IEEE International Conference on
Conference_Location
Lausanne
Type
conf
DOI
10.1109/MEMCOD.2014.6961865
Filename
6961865
Link To Document