• DocumentCode
    558669
  • Title

    Mitigating the negative impact of preemption on heterogeneous MapReduce workloads

  • Author

    Cheng, Lu ; Zhang, Qi ; Boutaba, Raouf

  • Author_Institution
    David R. Cheriton Sch. of Comput. Sci., Univ. of Waterloo, Waterloo, ON, Canada
  • fYear
    2011
  • fDate
    24-28 Oct. 2011
  • Firstpage
    1
  • Lastpage
    9
  • Abstract
    Modern production clusters are often shared by multiple types of jobs with different priorities in order to improve resource utilization. Preemption is a common technique employed by MapReduce schedulers to avoid delaying production jobs while allowing the cluster to be shared by other non-production jobs. In addition, it also prevents a large job from occupying too many resources and starving others. Recent literature shows that jobs in production MapReduce clusters have a mixture of lengths and sizes spanning many orders of magnitude. In this type of environments, the current preemption policy used by MapReduce schedulers can significantly delay the completion time of long running tasks, resulting in waste of resources. This paper firstly discusses the heterogeneous nature of MapReduce jobs and their arrival rates in several production clusters. Secondly, we characterize the situations where the current preemption policy causes significant preemption penalty. We then propose a simple mechanism that works in conjunction with existing job schedulers to address this problem. Finally, we evaluate our solution under various types of workloads in Amazon EC2. Experiments show our method can improve system normalized performance by 15% during busy periods by effectively avoiding unnecessary preemption while preserving fairness.
  • Keywords
    cloud computing; production control; Amazon EC2; MapReduce schedulers; heterogeneous MapReduce workloads; negative impact; preemption policy; production clusters; production jobs; resource utilization; Companies; Facebook; Google; Integrated circuits; Internet; Production; Time factors; Cloud Computing; Hadoop; Mapreduce;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Network and Service Management (CNSM), 2011 7th International Conference on
  • Conference_Location
    Paris
  • Print_ISBN
    978-1-4577-1588-4
  • Electronic_ISBN
    978-3-901882-44-9
  • Type

    conf

  • Filename
    6103959