Title :
Improving Resource Utilization in MapReduce
Author :
Guo, Zhenhua ; Fox, Geoffrey ; Zhou, Mo ; Ruan, Yang
Author_Institution :
Sch. of Inf. & Comput., Indiana Univ., Bloomington, IN, USA
Abstract :
MapReduce has been adopted widely in both academia and industry to run large-scale data parallel applications. In MapReduce, each slave node hosts a number of task slots to which tasks can be assigned. So they limit the maximum number of tasks that can execute concurrently on each node. When all task slots of a node are not used, the resources “reserved” for idle slots are unutilized. To improve resource utilization, we propose resource stealing to enable running tasks to steal resources reserved for idle slots and give them back proportionally whenever new tasks are assigned. Resource stealing makes the otherwise wasted resources get fully utilized without interfering with normal job scheduling. MapReduce uses speculative execution to improve fault tolerance. Current Hadoop implementation decides whether to run speculative tasks based on the progress rates of running tasks, which does not take into consideration the absolute progress of each task. We propose Benefit Aware Speculative Execution which evaluates the potential benefit of speculative tasks and eliminates unnecessary runs. We implement the proposed algorithms in Hadoop, and our experiments show that our algorithms can significantly shorten job execution time and reduce the number of non-beneficial speculative tasks.
Keywords :
data handling; fault tolerant computing; parallel processing; resource allocation; scheduling; Hadoop implementation; MapReduce; benefit aware speculative execution; fault tolerance; idle slot; job execution time; job scheduling; large-scale data parallel application; resource stealing; resource utilization; slave node; speculative task; task slot; Energy consumption; Hardware; Nickel; Parallel processing; Peer to peer computing; Radio frequency; Resource management; MapReduce; scheduling; speculative execution; utilization;
Conference_Titel :
Cluster Computing (CLUSTER), 2012 IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4673-2422-9
DOI :
10.1109/CLUSTER.2012.69