DocumentCode
623750
Title
A new analytical technique for designing provably efficient MapReduce schedulers
Author
Yousi Zheng ; Shroff, Ness B. ; Sinha, Pradeep
Author_Institution
Dept. of Electr. & Comput. Eng., Ohio State Univ., Columbus, OH, USA
fYear
2013
fDate
14-19 April 2013
Firstpage
1600
Lastpage
1608
Abstract
With the rapid increase in size and number of jobs that are being processed in the MapReduce framework, efficiently scheduling jobs under this framework is becoming increasingly important. We consider the problem of minimizing the total flowtime of a sequence of jobs in the MapReduce framework, where the jobs arrive over time and need to be processed through both Map and Reduce procedures before leaving the system. We show that for this problem for non-preemptive tasks, no on-line algorithm can achieve a constant competitive ratio (defined as the ratio between the completion time of the online algorithm to the completion time of the optimal non-causal off-line algorithm). We then construct a slightly weaker metric of performance called the efficiency ratio. An online algorithm is said to achieve an efficiency ratio of γ when the flow-time incurred by that scheduler divided by the minimum flow-time achieved over all possible schedulers is almost surely less than or equal to γ. Under some weak assumptions, we then show a surprising property that, for the flow-time problem, any work-conserving scheduler has a constant efficiency ratio in both preemptive and nonpreemptive scenarios. More importantly, we are able to develop an online scheduler with a very small efficiency ratio (2), and through simulations we show that it outperforms the state-of-the-art schedulers.
Keywords
data handling; parallel algorithms; parallel programming; scheduling; MapReduce framework; MapReduce schedulers; completion time; constant efficiency ratio; flow-time problem; job scheduling; nonpreemptive tasks; online algorithm; online scheduler; optimal noncausal off-line algorithm; work-conserving scheduler; Algorithm design and analysis; Delays; Schedules; Scheduling algorithms; Writing;
fLanguage
English
Publisher
ieee
Conference_Titel
INFOCOM, 2013 Proceedings IEEE
Conference_Location
Turin
ISSN
0743-166X
Print_ISBN
978-1-4673-5944-3
Type
conf
DOI
10.1109/INFCOM.2013.6566956
Filename
6566956
Link To Document