• DocumentCode
    650576
  • Title

    Improving Multi-job MapReduce Scheduling in an Opportunistic Environment

  • Author

    Yuting Ji ; Lang Tong ; Ting He ; Jian Tan ; Kang-Won Lee ; Li Zhang

  • Author_Institution
    Sch. of Electr. & Comput. Eng., Cornell Univ., Ithaca, NY, USA
  • fYear
    2013
  • fDate
    June 28 2013-July 3 2013
  • Firstpage
    9
  • Lastpage
    16
  • Abstract
    As a state-of-the-art programming model for big data analytics, MapReduce is well suited for parallel processing of large data sets in opportunistic environments. Existing research on MapReduce in opportunistic environment has focused on improving single job performance, the issue of fairness that is critical in the more dominant scenario of multiple concurrent jobs remains unexplored. We address this problem by proposing an opportunistic fair scheduling algorithm, which extends the broadly adopted Fair Scheduler to an environment where nodes are intermittently available with possibly different availability patterns. The proposed scheduler maintains statistics specific to the opportunistic environment, e.g., node availability rates and pairwise availability correlations, and utilizes this information in scheduling decisions to improve fairness. Using a Hadoop-based implementation, we compare our scheduler with the current Hadoop Fair Scheduler on representative benchmarks. Our experiments verify that our scheduler can significantly reduce the variability in job completion times.
  • Keywords
    cloud computing; data analysis; parallel programming; public domain software; scheduling; Hadoop fair scheduler; big data analytics; concurrent jobs; job completion times; multijob MapReduce scheduling; opportunistic environment; opportunistic fair scheduling algorithm; parallel processing; programming model; scheduling decisions; single job performance; Availability; Correlation; Hardware; History; Processor scheduling; Schedules; Scheduling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cloud Computing (CLOUD), 2013 IEEE Sixth International Conference on
  • Conference_Location
    Santa Clara, CA
  • Print_ISBN
    978-0-7695-5028-2
  • Type

    conf

  • DOI
    10.1109/CLOUD.2013.84
  • Filename
    6676672