Title :
Scheduling of Scientific Workflows on Data Grids
Author :
Pandey, Suraj ; Buyya, Rajkumar
Author_Institution :
Dept. of Comput. Sci. & Software Eng., Melbourne Univ., Melbourne, VIC
Abstract :
Selection of resources for execution of scientific workflows in data grids becomes challenging with the exponential growth of files as a result of the distribution of scientific experiments around the world. With more runs of these experiments, huge number of data-files produced can be made available from numerous resources. There is lack of work in optimal selection of data-hosts and compute resources in the presence of replicated files for scientific workflows. Foreseeing this, the thesis work aims at proposing novel workflow scheduling algorithms on data grids with large number of replicated files that incorporates practical constraints in heterogeneous environments such as grids. In this paper, we define the workflow scheduling problem statement in the context of data grids, supported by motivating applications; list research issues arising from practical constraints; propose two algorithms for experimenting with the problem; report simulation results obtained as a result of preliminary studies. The results are promising enough to motivate us to research on the problem stated.
Keywords :
grid computing; natural sciences computing; scheduling; workflow management software; data files; data grid; replicated files; scientific experiment; scientific workflow; workflow scheduling; Computer science; Costs; Distributed computing; Grid computing; Laboratories; Large Hadron Collider; Observatories; Processor scheduling; Scheduling algorithm; Software engineering; Data-intensive Scheduling; Grid; Scheduling Workflows; Workflows;
Conference_Titel :
Cluster Computing and the Grid, 2008. CCGRID '08. 8th IEEE International Symposium on
Conference_Location :
Lyon
Print_ISBN :
978-0-7695-3156-4
Electronic_ISBN :
978-0-7695-3156-4
DOI :
10.1109/CCGRID.2008.32