DocumentCode :
265863
Title :
Green MapReduce for heterogeneous data centers
Author :
Cavdar, Derya ; Chen, Lydia Y. ; Alagoz, Fatih
Author_Institution :
IBM Res. Zurich Lab., Zurich, Switzerland
fYear :
2014
fDate :
8-12 Dec. 2014
Firstpage :
1120
Lastpage :
1126
Abstract :
MapReduce has emerged as one of the key workloads in today\´s data centers, which constantly strive for an optimal tradeoff between energy consumption and performance. MapReduce alternates between computation and communication intensive phases with bursty workloads. The challenge to make execution of MapReduce green, lies in controlling server and network resources simultaneously. The related work offers various good solutions for homogenous systems, with the central theme of packing tasks into as small number of servers as possible and thus overlooking the possibility to "sleep" servers and network components. This paper considers a very bursty MapReduce workload with distinct CPU, memory and network requirements executed on heterogenous data centers, where servers have various CPU/memory capacities and execute request in a process-sharing manner. To reduce energy consumption while maintaining a low task response time, we propose an online energy minimization path algorithm, termed GEMS, to schedule MapReduce tasks, in cooperation with sleeping policies on servers as well as the switches. Using Google MapReduce traces, our simulation experiments show that our proposed solution gains a significant energy saving of 35% and meanwhile improves task response times by 35% on heterogenous data centers, compared to policies which are network agnostic or adopt no sleeping schedule. Overall, we achieve greener and faster MapReduce with (surprisingly) only a slightly higher number of servers, by considering energy consumption rather than conventional approach of considering power values only.
Keywords :
computer centres; data handling; green computing; parallel processing; power aware computing; scheduling; CPU; CPU capacities; GEMS; Google MapReduce traces; MapReduce task scheduling; bursty MapReduce workload; communication intensive phase; computation intensive phase; energy consumption; energy consumption reduction; energy saving; green MapReduce; heterogeneous data centers; heterogenous data centers; homogenous systems; memory capacities; memory requirements; network components; network requirements; network resource control; online energy minimization path algorithm; performance analysis; power values; process-sharing; server resource control; sleep servers; sleeping policies; solution gains; task response time; task response time improvement; Bandwidth; Energy consumption; Google; Memory management; Power demand; Servers; Time factors; MapReduce; energy optimization; heterogeneous data centers;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Global Communications Conference (GLOBECOM), 2014 IEEE
Conference_Location :
Austin, TX
Type :
conf
DOI :
10.1109/GLOCOM.2014.7036959
Filename :
7036959
Link To Document :
بازگشت