Title :
Performance optimization of Hadoop cluster using linux services
Author :
Ahmed, Hameeza ; Ismail, Muhammad Ali ; Hyder, Muhammad Faraz
Author_Institution :
Dept. of Comput. & Inf. Syst. Eng., NED Univ. of Eng. & Technol., Karachi, Pakistan
Abstract :
Hadoop is an open source tool. It enables the processing and distributed storage of big data sets using commodity cluster computing. With Hadoop occupying a core status in the current processing era, its performance optimization is also being heavily studied. This paper introduces one such method to improve Hadoop cluster performance by using a Remote Procedure Call (RPC), rpcbind service of the Linux system. The comparison is done by executing multiple Hadoop benchmarks on a configured multi-node Hadoop cluster. The final outcome turns in rpcbind favor depicting how the service improves the cluster performance by reducing the elapsed time of the benchmark executed.
Keywords :
Big Data; Linux; optimisation; parallel processing; public domain software; remote procedure calls; workstation clusters; Big Data sets; Hadoop cluster performance; Linux services; Linux system; RPC; commodity cluster computing; distributed storage; multinode Hadoop cluster; open source tool; performance optimization; remote procedure call; rpcbind service; Benchmark testing; Distributed databases; File systems; Java; Linux; Measurement; Servers;
Conference_Titel :
Multi-Topic Conference (INMIC), 2014 IEEE 17th International
Print_ISBN :
978-1-4799-5754-5
DOI :
10.1109/INMIC.2014.7097331