DocumentCode :
1944641
Title :
Scheduling in mapreduce-like systems for fast completion time
Author :
Chang, Hyunseok ; Kodialam, Murali ; Kompella, Ramana Rao ; Lakshman, T.V. ; Lee, Myungjin ; Mukherjee, Sarit
Author_Institution :
Bell Labs. Alcatel-Lucent, Holmdel, NJ, USA
fYear :
2011
fDate :
10-15 April 2011
Firstpage :
3074
Lastpage :
3082
Abstract :
Large-scale data processing needs of enterprises today are primarily met with distributed and parallel computing in data centers. MapReduce has emerged as an important programming model for these environments. Since today´s data centers run many MapReduce jobs in parallel, it is important to find a good scheduling algorithm that can optimize the completion times of these jobs. While several recent papers focused on optimizing the scheduler, there exists very little theoretical understanding of the scheduling problem in the context of MapReduce. In this paper, we seek to address this problem by first presenting a simplified abstraction of the MapReduce scheduling problem, and then formulate the scheduling problem as an optimization problem.We devise various online and offline algorithms to arrive at a good ordering of jobs to minimize the overall job completion times. Since optimal solutions are hard to compute (NP-hard), we propose approximation algorithms that work within a factor of 3 of the optimal. Using simulations, we also compare our online algorithm with standard scheduling strategies such as FIFO, Shortest Job First and show that our algorithm consistently outperforms these across different job distributions.
Keywords :
computational complexity; optimisation; parallel processing; scheduling; MapReduce scheduling problem; MapReduce-like systems; NP-hard; approximation algorithm; data centers; distributed computing; fast completion time; optimization problem; parallel computing; programming model; scheduling algorithm; standard scheduling strategies; Approximation algorithms; Approximation methods; Equations; Optimal scheduling; Optimized production technology; Processor scheduling; Program processors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
INFOCOM, 2011 Proceedings IEEE
Conference_Location :
Shanghai
ISSN :
0743-166X
Print_ISBN :
978-1-4244-9919-9
Type :
conf
DOI :
10.1109/INFCOM.2011.5935152
Filename :
5935152
Link To Document :
بازگشت