DocumentCode :
2446899
Title :
MapReduce in the Clouds for Science
Author :
Gunarathne, Thilina ; Wu, Tak-Lon ; Qiu, Judy ; Fox, Geoffrey
Author_Institution :
Sch. of Inf. & Comput., Indiana Univ., Bloomington, IN, USA
fYear :
2010
fDate :
Nov. 30 2010-Dec. 3 2010
Firstpage :
565
Lastpage :
572
Abstract :
The utility computing model introduced by cloud computing combined with the rich set of cloud infrastructure services offers a very viable alternative to traditional servers and computing clusters. MapReduce distributed data processing architecture has become the weapon of choice for data-intensive analyses in the clouds and in commodity clusters due to its excellent fault tolerance features, scalability and the ease of use. Currently, there are several options for using MapReduce in cloud environments, such as using MapReduce as a service, setting up one´s own MapReduce cluster on cloud instances, or using specialized cloud MapReduce runtimes that take advantage of cloud infrastructure services. In this paper, we introduce Azure MapReduce, a novel MapReduce runtime built using the Microsoft Azure cloud infrastructure services. Azure MapReduce architecture successfully leverages the high latency, eventually consistent, yet highly scalable Azure infrastructure services to provide an efficient, on demand alternative to traditional MapReduce clusters. Further we evaluate the use and performance of MapReduce frameworks, including Azure MapReduce, in cloud environments for scientific applications using sequence assembly and sequence alignment as use cases.
Keywords :
cloud computing; distributed processing; fault tolerant computing; MapReduce; cloud computing; cloud infrastructure services; commodity clusters; data intensive analyses; distributed data processing; fault tolerance; utility computing; Availability; Cloud computing; Fault tolerance; Fault tolerant systems; Processor scheduling; Runtime; Scalability; AzureMapReduce; Cloud Computing; Elastic MapReduce; Hadoop; MapReduce;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cloud Computing Technology and Science (CloudCom), 2010 IEEE Second International Conference on
Conference_Location :
Indianapolis, IN
Print_ISBN :
978-1-4244-9405-7
Electronic_ISBN :
978-0-7695-4302-4
Type :
conf
DOI :
10.1109/CloudCom.2010.107
Filename :
5708501
Link To Document :
بازگشت