Title :
An Extended Work-Stealing Framework for Mixed-Mode Parallel Applications
Author :
Wimmer, Martin ; Träff, Jesper Larsson
Author_Institution :
Dept. of Sci. Comput., Univ. of Vienna, Vienna, Austria
Abstract :
Parallelizing complex applications even for well-behaved parallel systems often calls for different parallelization approaches within the same application. In this paper we discuss three applications from the literature that for both reasons of efficiency and expressive convenience benefit from a mixture of task and more tightly coupled data parallelism. These three applications, namely Quick sort, list ranking, and LU factorization with partial pivoting, are paradigms for recursive, mixed-mode parallel algorithms that can neither easily nor efficiently be expressed in either a purely data-parallel or a purely task-parallel fashion. As a solution we present a shared-memory programming framework that allows tasks to dynamically spawn subtasks with a given degree of parallelism for implementing tightly coupled parallel parts of the algorithm. All three paradigmatic applications can naturally be expressed in this framework, which in turn can be supported by an extended, non-conventional work-stealing scheduler, which we also briefly sketch. Using our new algorithm for work-stealing with deterministic team-building we are able to show, beyond the improved, more natural implementability, in many cases better scalability and sometimes absolute performance than with less natural implementations based on pure task-parallelism executed with conventional work-stealing. Detailed performance results using an Intel 32-core system substantiate our claims.
Keywords :
parallel algorithms; shared memory systems; LU factorization; deterministic team-building; extended work-stealing framework; list ranking; mixed-mode parallel algorithm; mixed-mode parallel applications; parallel systems; parallelization approach; partial pivoting; quick sort; shared-memory programming framework; tightly coupled data parallelism; Arrays; Dynamic scheduling; Heuristic algorithms; Instruction sets; Parallel processing; Scalability; Synchronization;
Conference_Titel :
Parallel and Distributed Processing Workshops and Phd Forum (IPDPSW), 2011 IEEE International Symposium on
Conference_Location :
Shanghai
Print_ISBN :
978-1-61284-425-1
Electronic_ISBN :
1530-2075
DOI :
10.1109/IPDPS.2011.325