DocumentCode
3588375
Title
Performance optimization of Hadoop cluster using linux services
Author
Ahmed, Hameeza ; Ismail, Muhammad Ali ; Hyder, Muhammad Faraz
Author_Institution
Dept. of Comput. & Inf. Syst. Eng., NED Univ. of Eng. & Technol., Karachi, Pakistan
fYear
2014
Firstpage
167
Lastpage
172
Abstract
Hadoop is an open source tool. It enables the processing and distributed storage of big data sets using commodity cluster computing. With Hadoop occupying a core status in the current processing era, its performance optimization is also being heavily studied. This paper introduces one such method to improve Hadoop cluster performance by using a Remote Procedure Call (RPC), rpcbind service of the Linux system. The comparison is done by executing multiple Hadoop benchmarks on a configured multi-node Hadoop cluster. The final outcome turns in rpcbind favor depicting how the service improves the cluster performance by reducing the elapsed time of the benchmark executed.
Keywords
Big Data; Linux; optimisation; parallel processing; public domain software; remote procedure calls; workstation clusters; Big Data sets; Hadoop cluster performance; Linux services; Linux system; RPC; commodity cluster computing; distributed storage; multinode Hadoop cluster; open source tool; performance optimization; remote procedure call; rpcbind service; Benchmark testing; Distributed databases; File systems; Java; Linux; Measurement; Servers;
fLanguage
English
Publisher
ieee
Conference_Titel
Multi-Topic Conference (INMIC), 2014 IEEE 17th International
Print_ISBN
978-1-4799-5754-5
Type
conf
DOI
10.1109/INMIC.2014.7097331
Filename
7097331
Link To Document