DocumentCode
2995763
Title
GreenPipe: A Hadoop Based Workflow System on Energy-efficient Clouds
Author
Mao, Yaokuan ; Wu, Wenjun ; Zhang, Hui ; Luo, Liang
Author_Institution
State Key Software Dev. Environ. Lab., Beihang Univ., Beijing, China
fYear
2012
fDate
21-25 May 2012
Firstpage
2211
Lastpage
2219
Abstract
Cloud computing is increasingly becoming a popular solution to massive data analysis in bioinformatics. In order to enable scientists to harness the computing power provided by Cloud platforms, we designed Green Pipe, a scalable computational workflow system, which runs jobs as MapReduce tasks on virtual Hadoop clusters. This paper introduces a power-aware scheduling algorithm in the workflow engine to optimize workflow execution in terms of running time and energy consumption. Experimental results demonstrate the performance improvement in Green Pipe.
Keywords
bioinformatics; cloud computing; data analysis; data flow analysis; power aware computing; workflow management software; GreenPipe; Hadoop based workflow system; MapReduce tasks; bioinformatics; cloud computing; computational workflow system; energy consumption; energy-efficient clouds; massive data analysis; power-aware scheduling algorithm; time consumption; virtual Hadoop clusters; workflow engine; workflow execution optimization; Bioinformatics; Cloud computing; Connectors; Databases; Green products; Proteins; XML; cloud computing; schedule; virtual machine; workflow;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2012 IEEE 26th International
Conference_Location
Shanghai
Print_ISBN
978-1-4673-0974-5
Type
conf
DOI
10.1109/IPDPSW.2012.273
Filename
6270584
Link To Document