• DocumentCode
    142178
  • Title

    A usage-aware scheduler for improving MapReduce performance in heterogeneous environments

  • Author

    Hsiao, J.H. ; Kao, S.J.

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Nat. Chung Hsing Univ., Taichung, Taiwan
  • Volume
    3
  • fYear
    2014
  • fDate
    26-28 April 2014
  • Firstpage
    1648
  • Lastpage
    1652
  • Abstract
    Big data cannot be efficiently dealt with using most relational database management systems, as usually it requires parallel execution on a large amount of servers. MapReduce is suitable for processing large data sets, however, most traditional MapReduce schedulers assume that system is homogeneous and all tasks are executed equally in time. In reality, the completion time of a MapReduce job may be delayed due to slower tasks. This paper presents a usage-aware MapReduce scheduler to deal with the system heterogeneity by including task execution time in scheduling. Inspiration from the ideas of both the Fair scheduler and LATE scheduler, our usage-aware scheduler is able to reduce the overall completion time of MapReduce applications. Experimental results show that a reduction of up to 30% of job execution time is attainable.
  • Keywords
    Big Data; parallel processing; relational databases; scheduling; Big data; fair scheduler; heterogeneous environments; late scheduler; relational database management systems; task execution time; usage-aware MapReduce scheduler; Acceleration; Benchmark testing; Hardware; Organizations; Scheduling; Scheduling algorithms; Hadoop; mapreduce; scheduler;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Science, Electronics and Electrical Engineering (ISEEE), 2014 International Conference on
  • Conference_Location
    Sapporo
  • Print_ISBN
    978-1-4799-3196-5
  • Type

    conf

  • DOI
    10.1109/InfoSEEE.2014.6946201
  • Filename
    6946201