Title :
A performance analysis of MapReduce applications on big data in cloud based Hadoop
Author :
Gohil, Parth ; Garg, Dweepna ; Panchal, Bakul
Author_Institution :
Dept. of Comput. Sci. & Eng., Gov. Eng. Coll., Modasa, India
Abstract :
MapReduce is one of the most popular programming model for big data analysis in Distributed and Parallel Computing Environment. It is used for implementing parallel applications. With the growing development of mobile Internet and cloud computing, the issues related to big data have been a matter of concern in both industry and academy. There are several platforms for users to develop their applications based on MapReduce framework such as Hadoop. Hadoop is a free, Java-based programming framework that supports the processing of large data sets in a distributed computing environment. This paper discusses various MapReduce applications like Wordcount, Pi, TeraSort, Grep in Cloud based Hadoop. We have shown experimental results of these applications on Amazon EC2 using two types of Ubuntu instances. In this paper, performance of above application has been shown with respect to execution time and number of nodes. We find in our research study that as the number of nodes increases the execution time decreases and performance increases.
Keywords :
Big Data; cloud computing; parallel processing; software performance evaluation; Amazon EC2; Grep; Java-based programming framework; MapReduce applications; Pi; TeraSort; Ubuntu instances; Wordcount; big data analysis; cloud based Hadoop; distributed computing environment; performance analysis; Big data; Cloud computing; Educational institutions; File systems; Hardware; Programming; Servers; Amazon EC2; Big Data; Cloud Computing; HDFS; Hadoop; MapReduce;
Conference_Titel :
Information Communication and Embedded Systems (ICICES), 2014 International Conference on
Conference_Location :
Chennai
Print_ISBN :
978-1-4799-3835-3
DOI :
10.1109/ICICES.2014.7033791