• DocumentCode
    119393
  • Title

    An Adaptive Auto-configuration Tool for Hadoop

  • Author

    ChangLong Li ; Hang Zhuang ; Kun Lu ; MingMing Sun ; Jinhong Zhou ; Dong Dai ; Xuehai Zhou

  • Author_Institution
    Comput. Sci. & Technol., Univ. of Sci. & Technol. of China, Hefei, China
  • fYear
    2014
  • fDate
    4-7 Aug. 2014
  • Firstpage
    69
  • Lastpage
    72
  • Abstract
    With the coming concept of ´big data´, the ability to handle large datasets has become a critical consideration for the success of industrial organizations such as Google, Amazon, Yahoo! and Facebook. As an important Cloud Computing framework for bulk data processing, Hadoop is widely used in these organizations. However, the performance of MapReduce is seriously limited by its stiff configuration strategy. Even for a single simple job in Hadoop, a large number of tuning parameters have to be set by users. This may easily lead to performance loss due to some misconfigurations. In this paper, we present an adaptive automatic configuration tool (AACT) for Hadoop to achieve performance optimization. To achieve this goal, we propose a mathematical model which will accurately learn the relationship between system performance and configuration parameters, then configure Hadoop system based on this mathematical model. With the help of AACT, Hadoop is able to adapt the hardware and software configurations dynamically and drive the system to an optimal configuration in acceptable time. Experimental results show its efficiency and adaptability, and that it is ten times faster compared with default configuration.
  • Keywords
    Big Data; cloud computing; social networking (online); AACT; Amazon; Facebook; Google; MapReduce; Yahoo!; adaptive auto-configuration to; adaptive automatic configuration tool; big data; bulk data processing; cloud computing framework; hadoop; hardware configuration; industrial organizations; mathematical model; optimal configuration; performance optimization; software configuration; tuning parameters; Benchmark testing; Cloud computing; Computers; Hardware; Mathematical model; Optimization; System performance; Auto-Configuration; Hadoop; Self-Learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Engineering of Complex Computer Systems (ICECCS), 2014 19th International Conference on
  • Conference_Location
    Tianjin
  • Print_ISBN
    978-1-4799-5481-0
  • Type

    conf

  • DOI
    10.1109/ICECCS.2014.17
  • Filename
    6923119