Symbolic inner loop parallelisation for massively parallel processor arrays

Author

Tanase, Alexandru ; Witterauf, Michael ; Teich, Jurgen ; Hannig, Frank

Author_Institution

Dept. of Comput. Sci., Univ. of Erlangen-Nurnberg (FAU), Erlangen, Germany

fYear

2014

fDate

19-21 Oct. 2014

Firstpage

219

Lastpage

228

Abstract

This paper presents a first solution to the unsolved problem of symbolically scheduling a given loop nest with uniform data dependences using inner loop parallelization, in particular, the locally parallel, globally sequential (LPGS) mapping technique. This technique is needed in the case of loop program specifications for which the iterations shall be scheduled on a processor array of unknown size at compile time while keeping the local memory consumption independent of the problem size of the mapped loop nest. We show that it is possible to derive such parameterized LPGS schedules statically by proposing a mixed compile-/runtime approach: At compile time, we first determine the set of all schedule candidates, each latency-optimal for a different scanning order of the loop nest. Then we devise an exact parameterized formula for determining the latency of the resulting symbolic schedules, thus making each schedule fully predictable. At runtime, once the size of the processor array becomes known, a simple prolog selects the overall latency-optimal schedule that is then dynamically activated and executed on the processor array. Hence, our approach avoids any further runtime optimization and expensive re-compilations while achieving the same results as computing an optimal static schedule for each possible combination of array and problem size.

Keywords

formal specification; parallel processing; program compilers; LPGS mapping technique; data dependence; locally parallel globally sequential mapping technique; loop nest; loop program specifications; massively parallel processor arrays; mixed compile-/runtime approach; symbolic inner loop parallelisation; symbolic schedules; Finite impulse response filters; Optimal scheduling; Processor scheduling; Runtime; Schedules; Vectors;

fLanguage

English

Publisher

ieee

Conference_Titel

Formal Methods and Models for Codesign (MEMOCODE), 2014 Twelfth ACM/IEEE International Conference on

Conference_Location

Lausanne

Type

conf

DOI

10.1109/MEMCOD.2014.6961865

Filename

6961865