• DocumentCode
    688227
  • Title

    A Hadoop MapReduce Performance Prediction Method

  • Author

    Ge Song ; Zide Meng ; Huet, Fabrice ; Magoules, Frederic ; Lei Yu ; Xuelian Lin

  • Author_Institution
    Univ. of Nice Sophia Antipolis, Nice, France
  • fYear
    2013
  • fDate
    13-15 Nov. 2013
  • Firstpage
    820
  • Lastpage
    825
  • Abstract
    More and more Internet companies rely on large scale data analysis as part of their core services for tasks such as log analysis, feature extraction or data filtering. Map-Reduce, through its Hadoop implementation, has proved to be an efficient model for dealing with such data. One important challenge when performing such analysis is to predict the performance of individual jobs. In this paper, we propose a simple framework to predict the performance of Hadoop jobs. It is composed of a dynamic light-weight Hadoop job analyzer, and a prediction module using locally weighted regression methods. Our framework makes some theoretical cost models more practical, and also well fits for the diversification of the jobs and clusters. It can also help those users who want to predict the cost when applying for an on-demand cloud service. At the end, we do some experiments to verify our framework.
  • Keywords
    Internet; parallel algorithms; regression analysis; Hadoop MapReduce performance prediction method; Internet companies; core services; data analysis; data filtering; feature extraction; locally weighted regression methods; log analysis; Accuracy; Complexity theory; Correlation; Data conversion; History; Predictive models; Training; Hadoop; Job Analyzer; Locally Weighted Regression; MapReduce; Performance Prediction;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing (HPCC_EUC), 2013 IEEE 10th International Conference on
  • Conference_Location
    Zhangjiajie
  • Type

    conf

  • DOI
    10.1109/HPCC.and.EUC.2013.118
  • Filename
    6832000