• DocumentCode
    3146324
  • Title

    An Extended Work-Stealing Framework for Mixed-Mode Parallel Applications

  • Author

    Wimmer, Martin ; Träff, Jesper Larsson

  • Author_Institution
    Dept. of Sci. Comput., Univ. of Vienna, Vienna, Austria
  • fYear
    2011
  • fDate
    16-20 May 2011
  • Firstpage
    1683
  • Lastpage
    1690
  • Abstract
    Parallelizing complex applications even for well-behaved parallel systems often calls for different parallelization approaches within the same application. In this paper we discuss three applications from the literature that for both reasons of efficiency and expressive convenience benefit from a mixture of task and more tightly coupled data parallelism. These three applications, namely Quick sort, list ranking, and LU factorization with partial pivoting, are paradigms for recursive, mixed-mode parallel algorithms that can neither easily nor efficiently be expressed in either a purely data-parallel or a purely task-parallel fashion. As a solution we present a shared-memory programming framework that allows tasks to dynamically spawn subtasks with a given degree of parallelism for implementing tightly coupled parallel parts of the algorithm. All three paradigmatic applications can naturally be expressed in this framework, which in turn can be supported by an extended, non-conventional work-stealing scheduler, which we also briefly sketch. Using our new algorithm for work-stealing with deterministic team-building we are able to show, beyond the improved, more natural implementability, in many cases better scalability and sometimes absolute performance than with less natural implementations based on pure task-parallelism executed with conventional work-stealing. Detailed performance results using an Intel 32-core system substantiate our claims.
  • Keywords
    parallel algorithms; shared memory systems; LU factorization; deterministic team-building; extended work-stealing framework; list ranking; mixed-mode parallel algorithm; mixed-mode parallel applications; parallel systems; parallelization approach; partial pivoting; quick sort; shared-memory programming framework; tightly coupled data parallelism; Arrays; Dynamic scheduling; Heuristic algorithms; Instruction sets; Parallel processing; Scalability; Synchronization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Workshops and Phd Forum (IPDPSW), 2011 IEEE International Symposium on
  • Conference_Location
    Shanghai
  • ISSN
    1530-2075
  • Print_ISBN
    978-1-61284-425-1
  • Electronic_ISBN
    1530-2075
  • Type

    conf

  • DOI
    10.1109/IPDPS.2011.325
  • Filename
    6009034