• DocumentCode
    3588375
  • Title

    Performance optimization of Hadoop cluster using linux services

  • Author

    Ahmed, Hameeza ; Ismail, Muhammad Ali ; Hyder, Muhammad Faraz

  • Author_Institution
    Dept. of Comput. & Inf. Syst. Eng., NED Univ. of Eng. & Technol., Karachi, Pakistan
  • fYear
    2014
  • Firstpage
    167
  • Lastpage
    172
  • Abstract
    Hadoop is an open source tool. It enables the processing and distributed storage of big data sets using commodity cluster computing. With Hadoop occupying a core status in the current processing era, its performance optimization is also being heavily studied. This paper introduces one such method to improve Hadoop cluster performance by using a Remote Procedure Call (RPC), rpcbind service of the Linux system. The comparison is done by executing multiple Hadoop benchmarks on a configured multi-node Hadoop cluster. The final outcome turns in rpcbind favor depicting how the service improves the cluster performance by reducing the elapsed time of the benchmark executed.
  • Keywords
    Big Data; Linux; optimisation; parallel processing; public domain software; remote procedure calls; workstation clusters; Big Data sets; Hadoop cluster performance; Linux services; Linux system; RPC; commodity cluster computing; distributed storage; multinode Hadoop cluster; open source tool; performance optimization; remote procedure call; rpcbind service; Benchmark testing; Distributed databases; File systems; Java; Linux; Measurement; Servers;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multi-Topic Conference (INMIC), 2014 IEEE 17th International
  • Print_ISBN
    978-1-4799-5754-5
  • Type

    conf

  • DOI
    10.1109/INMIC.2014.7097331
  • Filename
    7097331