Title :
MapReduce in the Clouds for Science
Author :
Gunarathne, Thilina ; Wu, Tak-Lon ; Qiu, Judy ; Fox, Geoffrey
Author_Institution :
Sch. of Inf. & Comput., Indiana Univ., Bloomington, IN, USA
fDate :
Nov. 30 2010-Dec. 3 2010
Abstract :
The utility computing model introduced by cloud computing combined with the rich set of cloud infrastructure services offers a very viable alternative to traditional servers and computing clusters. MapReduce distributed data processing architecture has become the weapon of choice for data-intensive analyses in the clouds and in commodity clusters due to its excellent fault tolerance features, scalability and the ease of use. Currently, there are several options for using MapReduce in cloud environments, such as using MapReduce as a service, setting up one´s own MapReduce cluster on cloud instances, or using specialized cloud MapReduce runtimes that take advantage of cloud infrastructure services. In this paper, we introduce Azure MapReduce, a novel MapReduce runtime built using the Microsoft Azure cloud infrastructure services. Azure MapReduce architecture successfully leverages the high latency, eventually consistent, yet highly scalable Azure infrastructure services to provide an efficient, on demand alternative to traditional MapReduce clusters. Further we evaluate the use and performance of MapReduce frameworks, including Azure MapReduce, in cloud environments for scientific applications using sequence assembly and sequence alignment as use cases.
Keywords :
cloud computing; distributed processing; fault tolerant computing; MapReduce; cloud computing; cloud infrastructure services; commodity clusters; data intensive analyses; distributed data processing; fault tolerance; utility computing; Availability; Cloud computing; Fault tolerance; Fault tolerant systems; Processor scheduling; Runtime; Scalability; AzureMapReduce; Cloud Computing; Elastic MapReduce; Hadoop; MapReduce;
Conference_Titel :
Cloud Computing Technology and Science (CloudCom), 2010 IEEE Second International Conference on
Conference_Location :
Indianapolis, IN
Print_ISBN :
978-1-4244-9405-7
Electronic_ISBN :
978-0-7695-4302-4
DOI :
10.1109/CloudCom.2010.107