DocumentCode
1960285
Title
Asynchronous Work Stealing on Distributed Memory Systems
Author
Shigang Li ; Jingyuan Hu ; Xin Cheng ; Chongchong Zhao
Author_Institution
Sch. of Comput. & Commun. Eng., Univ. of Sci. & Technol. Beijing, Beijing, China
fYear
2013
fDate
Feb. 27 2013-March 1 2013
Firstpage
198
Lastpage
202
Abstract
Work stealing is a popular policy for dynamic load balancing of irregular applications. However, communication overhead incurred by work stealing may make it less efficient, especially on distributed memory systems. In this work we propose an asynchronous work stealing (AsynchWS) strategy which exploits opportunities to overlap communication with local residual tasks. Profiling information is collected locally to optimize task granularity and guide the asynchronous work stealing. AsynchWS is implemented in Unified Parallel C (UPC), which effectively supports non-blocking one-sided communication and facilitates the implementation. Experiments are conducted on a 32 nodes Xeon X5650 cluster using a set of irregular applications. Results show that up to 16% better performance than the state-of-the-art strategies on distributed memory.
Keywords
C language; distributed memory systems; parallel programming; resource allocation; 32 nodes Xeon X5650 cluster; AsynchWS strategy; UPC; asynchronous work stealing strategy; distributed memory systems; dynamic load balancing; irregular applications; local residual tasks; nonblocking one-sided communication; profiling information; state-of-the-art strategies; unified parallel C; Grain size; Instruction sets; Kernel; Libraries; Load management; Parallel processing; Runtime; UPC; asynchronous work stealing; distributed memory; task granularity;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel, Distributed and Network-Based Processing (PDP), 2013 21st Euromicro International Conference on
Conference_Location
Belfast
ISSN
1066-6192
Print_ISBN
978-1-4673-5321-2
Electronic_ISBN
1066-6192
Type
conf
DOI
10.1109/PDP.2013.35
Filename
6498552
Link To Document