Title :
A Batch System with Fair Scheduling for Evolving Applications
Author :
Prabhakaran, Suraj ; Iqbal, M. ; Rinke, Sebastian ; Windisch, Christian ; Wolf, Felix
Author_Institution :
German Res. Sch. for Simulation Sci., Aachen, Germany
Abstract :
Cluster batch systems usually support only static allocation of resources to applications before job start. After job start, applications cannot increase or decrease their resource set. However, some applications unpredictably evolve during execution and thus may require additional resources. If the extra resources cannot be delivered during runtime, those applications may have to run longer to finish, or are not even able to finish when their job´s time slice expires. Likewise, a job may have to end without additional resources due to hardware limits being reached, such as the memory available to the compute node. To avoid such scenarios, users have to make large static allocations to account for a potential demand for resources. This leads to wastage of resources as they idle before they might actually be used at an unknown point. In this paper, we propose a batch system with dynamic allocation facilities to support on-the-fly resource allocation to unpredictably evolving jobs based on demand. We present a novel dynamic resource allocation strategy that also accounts for a fair assignment of resources between the usual rigid jobs and the evolving jobs. The results for a CFD production application and a mixed workload of rigid and evolving jobs (based on the widely used ESP benchmark) show that our system not only reduces the job waiting and job turnaround times, but also increases system utilization and system throughput.
Keywords :
batch processing (computers); resource allocation; scheduling; CFD production application; cluster batch systems; dynamic resource allocation strategy; fair scheduling; job turnaround time reduction; job waiting time reduction; on-the-fly resource allocation; static resource allocation; system throughput; system utilization; Dynamic scheduling; Heuristic algorithms; Program processors; Resource management; Servers; Torque; batch systems; dynamic resource management; dynamic scheduling;
Conference_Titel :
Parallel Processing (ICPP), 2014 43rd International Conference on
Conference_Location :
Minneapolis MN
DOI :
10.1109/ICPP.2014.44