DocumentCode :
668144
Title :
Model-driven multisite workflow scheduling
Author :
Maheshwari, Ketan ; Eun-Sung Jung ; Jiayuan Meng ; Vishwanath, Venkatram ; Kettimuthu, Rajkumar
Author_Institution :
Argonne Nat. Lab., Argonne, IL, USA
fYear :
2013
fDate :
23-27 Sept. 2013
Firstpage :
1
Lastpage :
5
Abstract :
Workflows continue to play an important role in expressing and deploying scientific applications. In recent years, a wide variety of computational sites have emerged with shared access to users. A user may not be able to complete a complex workflow at a single site. It is thus beneficial to run different tasks of a workflow on different sites. For such cases, judicious scheduling strategy is required in order to map tasks in the workflow to resources at multiple sites so that the workload is balanced among sites and the overhead is minimized in data transfer. The key challenge is that the data transfer rate among sites varies based on the network capacity and load. We propose a workflow scheduling technique that tackles the multi-site task distribution challenge by using data movement performance modeling. We applied this technique to schedule an earth observation science workflow over three sites. Executed via the Swift parallel scripting paradigm, we augmented its default schedule and improved the time-to-completion by up to 52%.
Keywords :
geophysics computing; parallel processing; scheduling; workflow management software; Swift parallel scripting paradigm; data movement performance modeling; data transfer; earth observation science workflow; judicious scheduling strategy; model-driven multisite workflow scheduling; multisite task distribution; network capacity; network load; scientific applications; workload balancing; Processor scheduling;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cluster Computing (CLUSTER), 2013 IEEE International Conference on
Conference_Location :
Indianapolis, IN
Type :
conf
DOI :
10.1109/CLUSTER.2013.6702647
Filename :
6702647
Link To Document :
بازگشت