• DocumentCode
    159143
  • Title

    Symbolic inner loop parallelisation for massively parallel processor arrays

  • Author

    Tanase, Alexandru ; Witterauf, Michael ; Teich, Jurgen ; Hannig, Frank

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Erlangen-Nurnberg (FAU), Erlangen, Germany
  • fYear
    2014
  • fDate
    19-21 Oct. 2014
  • Firstpage
    219
  • Lastpage
    228
  • Abstract
    This paper presents a first solution to the unsolved problem of symbolically scheduling a given loop nest with uniform data dependences using inner loop parallelization, in particular, the locally parallel, globally sequential (LPGS) mapping technique. This technique is needed in the case of loop program specifications for which the iterations shall be scheduled on a processor array of unknown size at compile time while keeping the local memory consumption independent of the problem size of the mapped loop nest. We show that it is possible to derive such parameterized LPGS schedules statically by proposing a mixed compile-/runtime approach: At compile time, we first determine the set of all schedule candidates, each latency-optimal for a different scanning order of the loop nest. Then we devise an exact parameterized formula for determining the latency of the resulting symbolic schedules, thus making each schedule fully predictable. At runtime, once the size of the processor array becomes known, a simple prolog selects the overall latency-optimal schedule that is then dynamically activated and executed on the processor array. Hence, our approach avoids any further runtime optimization and expensive re-compilations while achieving the same results as computing an optimal static schedule for each possible combination of array and problem size.
  • Keywords
    formal specification; parallel processing; program compilers; LPGS mapping technique; data dependence; locally parallel globally sequential mapping technique; loop nest; loop program specifications; massively parallel processor arrays; mixed compile-/runtime approach; symbolic inner loop parallelisation; symbolic schedules; Finite impulse response filters; Optimal scheduling; Processor scheduling; Runtime; Schedules; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Formal Methods and Models for Codesign (MEMOCODE), 2014 Twelfth ACM/IEEE International Conference on
  • Conference_Location
    Lausanne
  • Type

    conf

  • DOI
    10.1109/MEMCOD.2014.6961865
  • Filename
    6961865