Title :
A Service Framework for Scientific Workflow Management in the Cloud
Author :
Yong Zhao ; Youfu Li ; Raicu, Ioan ; Shiyong Lu ; Cui Lin ; Yanzhe Zhang ; Wenhong Tian ; Ruini Xue
Author_Institution :
Sch. of Comput. Sci. & Eng., Univ. of Electron. Sci. & Technol. of China, Chengdu, China
Abstract :
Cloud computing is an emerging computing paradigm that can offer unprecedented scalability and resources on demand, and is getting more and more adoption in the science community, while scientific workflow management systems provide essential support such as management of data and task dependencies, job scheduling and execution, provenance tracking, etc., to scientific computing. As we are entering into a “big data” era, it is imperative to migrate scientific workflow management systems into the cloud to manage the ever increasing data scale and analysis complexity. We propose a reference service framework for integrating scientific workflow management systems into various cloud platforms, which consists of eight major components, including Cloud Workflow Management Service, Cloud Resource Manager, etc., and six interfaces between them. We also present a reference framework for the implementation of Cloud Resource Manager, which is responsible for the provisioning and management of virtual resources in the cloud. We discuss our implementation of the framework by integrating the Swift scientific workflow management system with the OpenNebula and Eucalyptus cloud platforms, and demonstrate the capability of the solution using a NASA MODIS image processing workflow and a production deployment on the Science@Guoshi network with support for the Montage image mosaic workflow.
Keywords :
Big Data; cloud computing; data analysis; image processing; natural sciences computing; resource allocation; workflow management software; Eucalyptus cloud platform; Montage image mosaic workflow; NASA MODIS image processing workflow; OpenNebula cloud platform; Science@Guoshi network; Swift scientific workflow management system; big data era; cloud computing; cloud resource manager; cloud workflow management service; data analysis complexity; data management; data scale; job scheduling; production deployment; provenance tracking; reference service framework; science community; scientific computing; scientific workflow management systems; task dependencies; virtual resource management; virtual resource provisioning; Cloud computing; Computer architecture; Processor scheduling; Resource management; Scalability; Virtual machining; Cloud workflow; cloud resource management; reference service framework; swift; virtual cluster provisioning; workflow-as-a-service;
Journal_Title :
Services Computing, IEEE Transactions on
DOI :
10.1109/TSC.2014.2341235