DocumentCode :
2960541
Title :
Improving Scientific Workflow Performance Using Policy Based Data Placement
Author :
Amer, Muhammad Ali ; Chervenak, Ann ; Chen, Weiwei
Author_Institution :
Univ. of Southern California, Los Angeles, CA, USA
fYear :
2012
fDate :
16-18 July 2012
Firstpage :
86
Lastpage :
93
Abstract :
I/O intensive jobs such as stage-in, stage-out or data clean-up jobs account for significant time in execution of scientific workflows. Workflow managers typically add these data management operations as supporting jobs to computational tasks with scheduling emphasis on compute jobs only. We present the integration of the Pegasus Workflow Management System with a Policy Based Data Placement Service (PDPS) to reduce overall workflow execution time. Pegasus delegates all data staging jobs to PDPS, which schedules and executes stage-in jobs based on selected data placement policies and simply executes stage-out and clean-up jobs independent of the workflow execution state. We measure the impact of using PDPS with Pegasus first with the Montage workflow, and then with a synthetic workflow. We enforce two policies and demonstrate the advantage of using PDPS for asynchronous data placement for scientific workflows. Our results show that the influence of PDPS on the overall workflow runtimes is dependent on the data characteristics of the executable workflow and the data placement policy being enforced.
Keywords :
natural sciences computing; scheduling; workflow management software; IO intensive jobs; PDPS; Pegasus workflow management system; computational tasks; compute jobs; data clean-up jobs; data management operations; montage workflow; policy based data placement; policy based data placement service; scheduling; scientific workflow performance; stage-in jobs; stage-out jobs; synthetic workflow; workflow managers; Abstracts; Handheld computers; Processor scheduling; Runtime; Schedules; Servers; Distributed data management; Policy; Workflows;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Policies for Distributed Systems and Networks (POLICY), 2012 IEEE International Symposium on
Conference_Location :
Chapel Hill, NC
Print_ISBN :
978-1-4673-1993-5
Type :
conf
DOI :
10.1109/POLICY.2012.8
Filename :
6268005
Link To Document :
بازگشت