• DocumentCode
    3579267
  • Title

    An approach to discover the best-fit factors for the optimal performance of Hadoop map reduce in virtualized environment

  • Author

    Vellaipandiyan, Solaimurugan ; Srikrishnan, V.

  • Author_Institution
    Centre for Development of Advanced Computing, Chennai, India
  • fYear
    2014
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    Map Reduce pioneered by Google is mainly employed in Big Data analytics. In Map Reduce environment, most of the algorithms are re-used for mining the data. Prediction of execution time and system overhead of MapReduce job is very vital, from which performance shall be ascertained. Cloud computing is widely used as a computing platform in business and academic communities. Performance plays a major role, when user runs an application in the cloud. User may want to estimate the application execution time (latency) before submitting a Task or a Job. Hadoop clusters are deployed on Cloud environment performing the experiment. System overhead is determined by running Map Reduce job over Hadoop Clusters. While performing the experiment, metrics such as network I/O, CPU, Swap utilization, Time to complete the job and RSS, VSZ were captured and evaluated in order to diagnose, how performance of Hadoop is influenced by reconstructing the block size and split size with respect to block size.
  • Keywords
    Big data; Cloud computing; Conferences; Measurement; Random access memory; Virtualization; Big Data; Distributed framework; Hadoop; MapReduce; Performance; VM;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Computing Research (ICCIC), 2014 IEEE International Conference on
  • Print_ISBN
    978-1-4799-3974-9
  • Type

    conf

  • DOI
    10.1109/ICCIC.2014.7238471
  • Filename
    7238471