• DocumentCode
    154153
  • Title

    A Batch System with Fair Scheduling for Evolving Applications

  • Author

    Prabhakaran, Suraj ; Iqbal, M. ; Rinke, Sebastian ; Windisch, Christian ; Wolf, Felix

  • Author_Institution
    German Res. Sch. for Simulation Sci., Aachen, Germany
  • fYear
    2014
  • fDate
    9-12 Sept. 2014
  • Firstpage
    351
  • Lastpage
    360
  • Abstract
    Cluster batch systems usually support only static allocation of resources to applications before job start. After job start, applications cannot increase or decrease their resource set. However, some applications unpredictably evolve during execution and thus may require additional resources. If the extra resources cannot be delivered during runtime, those applications may have to run longer to finish, or are not even able to finish when their job´s time slice expires. Likewise, a job may have to end without additional resources due to hardware limits being reached, such as the memory available to the compute node. To avoid such scenarios, users have to make large static allocations to account for a potential demand for resources. This leads to wastage of resources as they idle before they might actually be used at an unknown point. In this paper, we propose a batch system with dynamic allocation facilities to support on-the-fly resource allocation to unpredictably evolving jobs based on demand. We present a novel dynamic resource allocation strategy that also accounts for a fair assignment of resources between the usual rigid jobs and the evolving jobs. The results for a CFD production application and a mixed workload of rigid and evolving jobs (based on the widely used ESP benchmark) show that our system not only reduces the job waiting and job turnaround times, but also increases system utilization and system throughput.
  • Keywords
    batch processing (computers); resource allocation; scheduling; CFD production application; cluster batch systems; dynamic resource allocation strategy; fair scheduling; job turnaround time reduction; job waiting time reduction; on-the-fly resource allocation; static resource allocation; system throughput; system utilization; Dynamic scheduling; Heuristic algorithms; Program processors; Resource management; Servers; Torque; batch systems; dynamic resource management; dynamic scheduling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel Processing (ICPP), 2014 43rd International Conference on
  • Conference_Location
    Minneapolis MN
  • ISSN
    0190-3918
  • Type

    conf

  • DOI
    10.1109/ICPP.2014.44
  • Filename
    6957244