Title :
Throughput Enhancement through Selective Time Sharing and Dynamic Grouping
Author :
Junliang Chen ; Bing Bing Zhou ; Chen Wang ; Peng Lu ; Penghao Wang ; Zomaya, Albert Y.
Author_Institution :
Centre for Distrib. & High Performance Comput., Univ. of Sydney, Sydney, NSW, Australia
Abstract :
Space sharing approaches are widely used in job scheduling for HPC systems. The main drawback of these approaches is the blocking of short jobs, which results in low throughput. The research on gang scheduling has shown the potential of time sharing in improving throughput. However, traditional gang scheduling adds jobs for time sharing without selection, which may cause a higher performance degradation of existing running jobs than the performance gain of waiting jobs. Moreover, gang scheduling often adopts a contiguous buddy allocation scheme which has problems of fragmentation and low resource utilization. We design a selective time sharing technique that allows waiting jobs to be co-scheduled with existing running jobs only if the overall throughput can be improved. To alleviate the fragmentation problem, we present a dynamic grouping resource allocation mechanism that relaxes the contiguous allocation requirement imposed on gang scheduling. By integrating these techniques, our new job co-scheduling algorithm is able to simultaneously take system throughput and resource utilization into consideration. The experimental results demonstrate that our approach significantly outperforms both EASY backfilling and traditional gang scheduling in terms of both average turnaround time and bounded slowdown.
Keywords :
parallel processing; resource allocation; scheduling; HPC system; contiguous buddy allocation scheme; dynamic grouping resource allocation mechanism; fragmentation problem; gang scheduling; job scheduling; resource utilization; selective time sharing technique; space sharing approach; throughput enhancement; waiting jobs coscheduling; Australia; Dynamic scheduling; Processor scheduling; Resource management; Runtime; Throughput; job scheduling; resource allocation; time sharing;
Conference_Titel :
Parallel & Distributed Processing (IPDPS), 2013 IEEE 27th International Symposium on
Conference_Location :
Boston, MA
Print_ISBN :
978-1-4673-6066-1
DOI :
10.1109/IPDPS.2013.34