• DocumentCode
    688318
  • Title

    Load Balancing in Heterogeneous MapReduce Environments

  • Author

    Yuanquan Fan ; Weiguo Wu ; Depei Qian ; Yunlong Xu ; Wei Wei

  • Author_Institution
    Dept. of Comput. Sci. & Technol., Xi´an Jiaotong Univ., Xi´an, China
  • fYear
    2013
  • fDate
    13-15 Nov. 2013
  • Firstpage
    1480
  • Lastpage
    1489
  • Abstract
    MapReduce has emerged as a popular computing model for parallel processing of big data. However, we observe that the native hash partitioning of MapReduce systems leads to frequent uneven data distribution among reduce tasks. The uneven data distribution results in load imbalance among reduce tasks, and thus hampers the performance of MapReduce systems. Moreover, the heterogeneity among cluster nodes exacerbates the negative effects of uneven data distribution due to varying performance of the heterogeneous nodes. To address the above issues, in this paper, we propose a novel load balancing approach with respect to the heterogeneity of clusters. This approach consists of two components: (1) performance estimation for reducers that run on heterogeneous nodes based on history of reduce tasks, and (2) heterogeneity-aware partitioning (HAP), which reallocates the input data for reduce tasks based on the performance estimation for reducers. We implement this approach as a plug-in of current MapReduce system. Experiment results show that our approach improves the performance of MapReduce jobs that run in heterogeneous systems, and incurs little overhead.
  • Keywords
    parallel programming; resource allocation; Big Data; cluster heterogeneity; data distribution; hash partitioning; heterogeneity-aware partitioning; heterogeneous MapReduce environment; heterogeneous systems; load balancing; parallel processing; performance estimation; Approximation algorithms; Computational modeling; History; Load management; Load modeling; Optimization; Partitioning algorithms; Load Balancing; MapReduce; heterogeneity-aware partitioning; heterogeneous cluster;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing (HPCC_EUC), 2013 IEEE 10th International Conference on
  • Conference_Location
    Zhangjiajie
  • Type

    conf

  • DOI
    10.1109/HPCC.and.EUC.2013.209
  • Filename
    6832091