Title :
Performance-driven task co-scheduling for MapReduce environments
Author :
Jord? Polo;David Carrera;Yolanda Becerra;Malgorzata Steinder;Ian Whalley
Author_Institution :
Jordi Torres and Eduard Ayguad?, Barcelona Supercomputing Center (BSC) - Technical University of Catalonia (UPC), Spain
Abstract :
MapReduce is a data-driven programming model proposed by Google in 2004 which is especially well suited for distributed data analytics applications. We consider the management of MapReduce applications in an environment where multiple applications share the same physical resources. Such sharing is in line with recent trends in data center management which aim to consolidate workloads in order to achieve cost and energy savings. In a shared environment, it is necessary to predict and manage the performance of workloads given a set of performance goals defined for them. In this paper, we address this problem by introducing a new task scheduler for a MapReduce framework that allows performance-driven management of MapReduce tasks. The proposed task scheduler dynamically predicts the performance of concurrent MapReduce jobs and adjusts the resource allocation for the jobs. It allows applications to meet their performance objectives without over-provisioning of physical resources.
Keywords :
"Resource management","File systems","Programming profession","Environmental management","Concurrent computing","Runtime environment","Data analysis","Energy management","Costs","Dynamic scheduling"
Conference_Titel :
Network Operations and Management Symposium (NOMS), 2010 IEEE
Print_ISBN :
978-1-4244-5366-5
Electronic_ISBN :
2374-9709
DOI :
10.1109/NOMS.2010.5488494