Title :
On the Merits of Distributed Work-Stealing on Selective Locality-Aware Tasks
Author :
Paudel, Jeeva ; Tardieu, Olivier ; Amaral, Jose Nelson
Author_Institution :
Comput. Sc., Univ. of Alberta, Edmonton, AB, Canada
Abstract :
Improving the performance of work-stealing load-balancing algorithms in distributed shared-memory systems is challenging. These algorithms need to overcome high costs of contention among workers, communication and remote data-references between nodes, and their impact on the locality preferences of tasks. Prior research focus on stealing from a victim that best exploits data locality, and on using special deques that minimize the contention between local and remote workers. This work explores the selection of tasks that are favourable for migration across nodes in a distributed memory cluster, a lesser-explored dimension to distributed work-stealing. The selection of tasks is guided by the application-level task locality rather than hardware memory topology as is the norm in the literature. The prototype for the performance evaluation of these ideas is implemented in X10, a realization of the asynchronous partitioned global address space programming model. This evaluation reveals the applicability of this new approach on several real-world applications chosen from the Cowichan and the Lone star suites. On a cluster of 128 processors, the new work-stealing strategy demonstrates a speedup between 12% and 31% over X10´s existing scheduler. Moreover, the new strategy does not degrade the performance of any of the applications studied.
Keywords :
distributed shared memory systems; performance evaluation; resource allocation; scheduling; Cowichan suites; Lonestar suites; X10; application-level task locality; asynchronous partitioned global address space programming model; distributed memory cluster; distributed shared-memory systems; distributed work-stealing; migration across nodes; performance evaluation; processor cluster; scheduler; selective locality-aware tasks; work-stealing load-balancing algorithms; work-stealing strategy; Distributed databases; Instruction sets; Parallel processing; Runtime; Sociology; Statistics; APGAS; Distributed Work Stealing; Performance; X10;
Conference_Titel :
Parallel Processing (ICPP), 2013 42nd International Conference on
Conference_Location :
Lyon
DOI :
10.1109/ICPP.2013.19