DocumentCode :
1781105
Title :
A load balance algorithm based on nodes performance in Hadoop cluster
Author :
Zhipeng Gao ; Dangpeng Liu ; Yang Yang ; Jingchen Zheng ; Yuwen Hao
Author_Institution :
Beijing Univ. of Posts & Telecommun., Beijing, China
fYear :
2014
fDate :
17-19 Sept. 2014
Firstpage :
1
Lastpage :
4
Abstract :
MapReduce is an important distributed programming model for large-scale data-parallel applications like web indexing, data mining, and scientific simulation. Hadoop is an open-source implementation of MapReduce and it is often applied to short jobs for which low response time is critical. When the cluster nodes are homogeneous, Hadoop has a good performance. In practice, the homogeneity assumptions do not always hold. In heterogeneous environment, there are various devices which vary greatly in the capacities of computation, communication, architectures, memories and power. When different nodes process the same amount of data, load balancing problem occurs. In this paper we address the problem of how to assign data after Map phase to balance the execution time of each Reduce task by proposing a novel load balancing algorithm based on nodes performance (LBNP), in which the input data of poor performance nodes are decreased. Simulation results indicate that all the Reduce tasks can be completed in the same time which shortens the whole Reduce phase. Thus the efficiency of MapReduce is improved.
Keywords :
data handling; distributed programming; parallel processing; pattern clustering; resource allocation; software performance evaluation; Hadoop cluster; LBNP; Web indexing; data mining; distributed programming model; heterogeneous environment; homogeneity assumptions; large-scale data-parallel applications; load balancing algorithm; node performance; open-source MapReduce implementation; scientific simulation; Algorithm design and analysis; Data models; Indexes; Load management; Load modeling; Silicon; Tin; Hadoop; Heterogeneous environment; Load balance; MapReduce; Nodes performance;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Network Operations and Management Symposium (APNOMS), 2014 16th Asia-Pacific
Conference_Location :
Hsinchu
Type :
conf
DOI :
10.1109/APNOMS.2014.6996555
Filename :
6996555
Link To Document :
بازگشت