DocumentCode :
659405
Title :
HFSP: Size-based scheduling for Hadoop
Author :
Pastorelli, Michele ; Barbuzzi, Antonio ; Carra, Damiano ; Dell´Amico, Matteo ; Michiardi, Pietro
Author_Institution :
EURECOM, Sophia-Antipolis, France
fYear :
2013
fDate :
6-9 Oct. 2013
Firstpage :
51
Lastpage :
59
Abstract :
Size-based scheduling with aging has, for long, been recognized as an effective approach to guarantee fairness and near-optimal system response times. We present HFSP, a scheduler introducing this technique to a real, multi-server, complex and widely used system such as Hadoop. Size-based scheduling requires a priori job size information, which is not available in Hadoop: HFSP builds such knowledge by estimating it on-line during job execution. Our experiments, which are based on realistic workloads generated via a standard benchmarking suite, pinpoint at a significant decrease in system response times with respect to the widely used Hadoop Fair scheduler, and show that HFSP is largely tolerant to job size estimation errors.
Keywords :
distributed processing; resource allocation; scheduling; HFSP scheduler; Hadoop Fair scheduler; fairness; job execution; job size estimation errors; job size information; near-optimal system response times; size-based scheduling; Abstracts; Aging; Estimation error; Processor scheduling; Schedules; Time factors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Big Data, 2013 IEEE International Conference on
Conference_Location :
Silicon Valley, CA
Type :
conf
DOI :
10.1109/BigData.2013.6691554
Filename :
6691554
Link To Document :
بازگشت