DocumentCode
142178
Title
A usage-aware scheduler for improving MapReduce performance in heterogeneous environments
Author
Hsiao, J.H. ; Kao, S.J.
Author_Institution
Dept. of Comput. Sci. & Eng., Nat. Chung Hsing Univ., Taichung, Taiwan
Volume
3
fYear
2014
fDate
26-28 April 2014
Firstpage
1648
Lastpage
1652
Abstract
Big data cannot be efficiently dealt with using most relational database management systems, as usually it requires parallel execution on a large amount of servers. MapReduce is suitable for processing large data sets, however, most traditional MapReduce schedulers assume that system is homogeneous and all tasks are executed equally in time. In reality, the completion time of a MapReduce job may be delayed due to slower tasks. This paper presents a usage-aware MapReduce scheduler to deal with the system heterogeneity by including task execution time in scheduling. Inspiration from the ideas of both the Fair scheduler and LATE scheduler, our usage-aware scheduler is able to reduce the overall completion time of MapReduce applications. Experimental results show that a reduction of up to 30% of job execution time is attainable.
Keywords
Big Data; parallel processing; relational databases; scheduling; Big data; fair scheduler; heterogeneous environments; late scheduler; relational database management systems; task execution time; usage-aware MapReduce scheduler; Acceleration; Benchmark testing; Hardware; Organizations; Scheduling; Scheduling algorithms; Hadoop; mapreduce; scheduler;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Science, Electronics and Electrical Engineering (ISEEE), 2014 International Conference on
Conference_Location
Sapporo
Print_ISBN
978-1-4799-3196-5
Type
conf
DOI
10.1109/InfoSEEE.2014.6946201
Filename
6946201
Link To Document