DocumentCode :
688318
Title :
Load Balancing in Heterogeneous MapReduce Environments
Author :
Yuanquan Fan ; Weiguo Wu ; Depei Qian ; Yunlong Xu ; Wei Wei
Author_Institution :
Dept. of Comput. Sci. & Technol., Xi´an Jiaotong Univ., Xi´an, China
fYear :
2013
fDate :
13-15 Nov. 2013
Firstpage :
1480
Lastpage :
1489
Abstract :
MapReduce has emerged as a popular computing model for parallel processing of big data. However, we observe that the native hash partitioning of MapReduce systems leads to frequent uneven data distribution among reduce tasks. The uneven data distribution results in load imbalance among reduce tasks, and thus hampers the performance of MapReduce systems. Moreover, the heterogeneity among cluster nodes exacerbates the negative effects of uneven data distribution due to varying performance of the heterogeneous nodes. To address the above issues, in this paper, we propose a novel load balancing approach with respect to the heterogeneity of clusters. This approach consists of two components: (1) performance estimation for reducers that run on heterogeneous nodes based on history of reduce tasks, and (2) heterogeneity-aware partitioning (HAP), which reallocates the input data for reduce tasks based on the performance estimation for reducers. We implement this approach as a plug-in of current MapReduce system. Experiment results show that our approach improves the performance of MapReduce jobs that run in heterogeneous systems, and incurs little overhead.
Keywords :
parallel programming; resource allocation; Big Data; cluster heterogeneity; data distribution; hash partitioning; heterogeneity-aware partitioning; heterogeneous MapReduce environment; heterogeneous systems; load balancing; parallel processing; performance estimation; Approximation algorithms; Computational modeling; History; Load management; Load modeling; Optimization; Partitioning algorithms; Load Balancing; MapReduce; heterogeneity-aware partitioning; heterogeneous cluster;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing (HPCC_EUC), 2013 IEEE 10th International Conference on
Conference_Location :
Zhangjiajie
Type :
conf
DOI :
10.1109/HPCC.and.EUC.2013.209
Filename :
6832091
Link To Document :
بازگشت