Title :
A load balance algorithm based on nodes performance in Hadoop cluster
Author :
Zhipeng Gao ; Dangpeng Liu ; Yang Yang ; Jingchen Zheng ; Yuwen Hao
Author_Institution :
Beijing Univ. of Posts & Telecommun., Beijing, China
Abstract :
MapReduce is an important distributed programming model for large-scale data-parallel applications like web indexing, data mining, and scientific simulation. Hadoop is an open-source implementation of MapReduce and it is often applied to short jobs for which low response time is critical. When the cluster nodes are homogeneous, Hadoop has a good performance. In practice, the homogeneity assumptions do not always hold. In heterogeneous environment, there are various devices which vary greatly in the capacities of computation, communication, architectures, memories and power. When different nodes process the same amount of data, load balancing problem occurs. In this paper we address the problem of how to assign data after Map phase to balance the execution time of each Reduce task by proposing a novel load balancing algorithm based on nodes performance (LBNP), in which the input data of poor performance nodes are decreased. Simulation results indicate that all the Reduce tasks can be completed in the same time which shortens the whole Reduce phase. Thus the efficiency of MapReduce is improved.
Keywords :
data handling; distributed programming; parallel processing; pattern clustering; resource allocation; software performance evaluation; Hadoop cluster; LBNP; Web indexing; data mining; distributed programming model; heterogeneous environment; homogeneity assumptions; large-scale data-parallel applications; load balancing algorithm; node performance; open-source MapReduce implementation; scientific simulation; Algorithm design and analysis; Data models; Indexes; Load management; Load modeling; Silicon; Tin; Hadoop; Heterogeneous environment; Load balance; MapReduce; Nodes performance;
Conference_Titel :
Network Operations and Management Symposium (APNOMS), 2014 16th Asia-Pacific
Conference_Location :
Hsinchu
DOI :
10.1109/APNOMS.2014.6996555