DocumentCode :
3256212
Title :
Towards Energy Efficiency in Heterogeneous Hadoop Clusters by Adaptive Task Assignment
Author :
Dazhao Cheng ; Lama, Palden ; Changjun Jiang ; Xiaobo Zhou
Author_Institution :
Dept. of Comput. Sci., Univ. of Colorado, Colorado Springs, CO, USA
fYear :
2015
fDate :
June 29 2015-July 2 2015
Firstpage :
359
Lastpage :
368
Abstract :
The cost of powering servers, storage platforms and related cooling systems has become a major component of the operational costs in big data deployments. Hence, the design of energy-efficient Hadoop clusters has attracted significant research attentions in recent years. However, existing studies do not consider the impact of the complex interplay between workload and hardware heterogeneity on energy efficiency. In this paper, we find that heterogeneity-oblivious task assignment approaches are detrimental to both performance and energy efficiency of Hadoop clusters. Importantly, we make a counterintuitive observation that even heterogeneity-aware techniques that focus on reducing job completion time do not necessarily guarantee energy efficiency. We propose a heterogeneity-aware task assignment approach, E-Ant, that aims to minimize the overall energy consumption in a heterogeneous Hadoop cluster without sacrificing job performance. It adaptively schedules heterogeneous workloads on energy-efficient machines, without a priori knowledge of the workload properties. Furthermore, it provides the flexibility to trade off energy efficiency and job fairness in a Hadoop cluster. E-Ant employs an ant colony optimization approach that generates task assignment solutions based on the feedback of each task´s energy consumption reported by Hadoop Task Trackers in an agile way. Experimental results on a heterogeneous cluster with varying hardware capabilities show that E-Ant improves the overall energy savings for a synthetic workload from Microsoft by 17% and 12% compared to Fair Scheduler and Tarazu, respectively.
Keywords :
Big Data; ant colony optimisation; data handling; energy conservation; energy consumption; parallel processing; power aware computing; scheduling; Big Data deployments; Hadoop task trackers; adaptive task assignment; ant colony optimization approach; cooling systems; e-ant; energy efficiency; energy-efficient Hadoop cluster design; hardware heterogeneity; heterogeneity-aware task assignment approach; heterogeneous Hadoop clusters; heterogeneous workload scheduling; operational costs; storage platforms; synthetic workload; task energy consumption; Accuracy; Adaptation models; Benchmark testing; Energy consumption; Hardware; Power demand; Servers;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Distributed Computing Systems (ICDCS), 2015 IEEE 35th International Conference on
Conference_Location :
Columbus, OH
ISSN :
1063-6927
Type :
conf
DOI :
10.1109/ICDCS.2015.44
Filename :
7164922
Link To Document :
بازگشت