Title :
Production job scheduling for parallel shared memory systems
Author :
Chiang, Su-Hui ; Vernon, Mary K.
Author_Institution :
Dept. of Comput. Sci., Wisconsin Univ., Madison, WI, USA
Abstract :
This paper addresses open job scheduling questions for the challenge workloads that run on the large scale parallel systems at supercomputer centers. Simulation results for six recent one-month job traces from the NCSA Origin 2000 (O2K) system are used to evaluate (I) the experimentally tuned NCSA LSF* policy, (2) the FCFS-backfill policy, (3) the Priority-backfill policy with alternative priority functions and with limited pre-emption to provide immediate service to each arriving job, and (4) the spatial equipartitioning (EQspatial) policy with an optional modification to reduce the maximum waiting time for the largest jobs in the challenge workloads. Measurements on the O2K validate the simulation results for two of the policies. The priority-backfill policy with immediate service and a starvation-free priority measure that favors short jobs is shown to be the most promising if jobs cannot adapt to changing processor allocations at runtime, but EQspatial provides significantly better 95th percentile waiting time
Keywords :
parallel architectures; processor scheduling; resource allocation; shared memory systems; challenge workloads; job scheduling; parallel shared memory systems; priority-backfill policy; spatial equipartitioning; supercomputer centers; waiting time; Computational modeling; Concurrent computing; High performance computing; Job production systems; Laboratories; Large-scale systems; Processor scheduling; Runtime; Supercomputers; Time measurement;
Conference_Titel :
Parallel and Distributed Processing Symposium., Proceedings 15th International
Conference_Location :
San Francisco, CA
Print_ISBN :
0-7695-0990-8
DOI :
10.1109/IPDPS.2001.924987