• DocumentCode
    656143
  • Title

    On the Merits of Distributed Work-Stealing on Selective Locality-Aware Tasks

  • Author

    Paudel, Jeeva ; Tardieu, Olivier ; Amaral, Jose Nelson

  • Author_Institution
    Comput. Sc., Univ. of Alberta, Edmonton, AB, Canada
  • fYear
    2013
  • fDate
    1-4 Oct. 2013
  • Firstpage
    100
  • Lastpage
    109
  • Abstract
    Improving the performance of work-stealing load-balancing algorithms in distributed shared-memory systems is challenging. These algorithms need to overcome high costs of contention among workers, communication and remote data-references between nodes, and their impact on the locality preferences of tasks. Prior research focus on stealing from a victim that best exploits data locality, and on using special deques that minimize the contention between local and remote workers. This work explores the selection of tasks that are favourable for migration across nodes in a distributed memory cluster, a lesser-explored dimension to distributed work-stealing. The selection of tasks is guided by the application-level task locality rather than hardware memory topology as is the norm in the literature. The prototype for the performance evaluation of these ideas is implemented in X10, a realization of the asynchronous partitioned global address space programming model. This evaluation reveals the applicability of this new approach on several real-world applications chosen from the Cowichan and the Lone star suites. On a cluster of 128 processors, the new work-stealing strategy demonstrates a speedup between 12% and 31% over X10´s existing scheduler. Moreover, the new strategy does not degrade the performance of any of the applications studied.
  • Keywords
    distributed shared memory systems; performance evaluation; resource allocation; scheduling; Cowichan suites; Lonestar suites; X10; application-level task locality; asynchronous partitioned global address space programming model; distributed memory cluster; distributed shared-memory systems; distributed work-stealing; migration across nodes; performance evaluation; processor cluster; scheduler; selective locality-aware tasks; work-stealing load-balancing algorithms; work-stealing strategy; Distributed databases; Instruction sets; Parallel processing; Runtime; Sociology; Statistics; APGAS; Distributed Work Stealing; Performance; X10;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel Processing (ICPP), 2013 42nd International Conference on
  • Conference_Location
    Lyon
  • ISSN
    0190-3918
  • Type

    conf

  • DOI
    10.1109/ICPP.2013.19
  • Filename
    6687343