DocumentCode :
3203104
Title :
A Physical and Virtual Compute Cluster Resource Load Balancing Approach to Data-Parallel Scientific Workflow Scheduling
Author :
Wang, Jianwu ; Korambath, Prakashan ; Altintas, Ilkay
Author_Institution :
San Diego Supercomput. Center, UCSD, San Diego, CA, USA
fYear :
2011
fDate :
4-9 July 2011
Firstpage :
212
Lastpage :
215
Abstract :
To execute workflows on a compute cluster resource, workflow engines can work with cluster resource manager software to distribute jobs into compute nodes on the cluster. We discuss how to interact with traditional Oracle Grid Engine and recent Hadoop cluster resource managers using a dataflow-based scheduling approach to balance compute resource load for data-parallel workflow execution. Our experiments show that: 1) The presented approach can balance computational resource load well by interacting with the resource managers and provides good execution performance on both physical and virtual clusters, 2) Oracle Grid Engine outperforms Hadoop for CPU-intensive applications on small-scale clusters.
Keywords :
data flow computing; grid computing; natural sciences computing; resource allocation; scheduling; workflow management software; CPU-intensive applications; Hadoop cluster resource manager; Oracle Grid Engine; computational resource load; compute nodes; data-parallel scientific workflow scheduling; dataflow-based scheduling; job distribution; physical cluster; virtual compute cluster resource load balancing; workflow engine; workflow execution; Biochemistry; Cloud computing; Engines; File systems; Load management; Processor scheduling; Servers; cluster resource manager comparison; data-parallel workflow scheduling; load balancing; virtual cluster;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Services (SERVICES), 2011 IEEE World Congress on
Conference_Location :
Washington, DC
Print_ISBN :
978-1-4577-0879-4
Electronic_ISBN :
978-0-7695-4461-8
Type :
conf
DOI :
10.1109/SERVICES.2011.50
Filename :
6012715
Link To Document :
بازگشت