Title :
Scheduling Parallel Tasks onto Opportunistically Available Cloud Resources
Author :
He, Ting ; Chen, Shiyao ; Kim, Hyoil ; Tong, Lang ; Lee, Kang-Won
Author_Institution :
IBM T.J. Watson Res. Center, Yorktown Heights, NY, USA
Abstract :
We consider the problem of opportunistically scheduling low-priority tasks onto underutilized computation resources in the cloud left by high-priority tasks. To avoid conflicts with high-priority tasks, the scheduler must suspend the low-priority tasks (causing waiting), or move them to other underutilized servers (causing migration), if the high-priority tasks resume. The goal of opportunistic scheduling is to schedule the low-priority tasks onto intermittently available server resources while minimizing the combined cost of waiting and migration. Moreover, we aim to support multiple parallel low-priority tasks with synchronization constraints. Under the assumption that servers´ availability to low-priority tasks can be modeled as ON/OFF Markov chains, we have shown that the optimal solution requires solving a Markov Decision Process (MDP) that has exponential complexity, and efficient solutions are known only in the case of homogeneously behaving servers. In this paper, we propose an efficient heuristic scheduling policy by formulating the problem as restless Multi-Armed Bandits (MAB) under relaxed synchronization. We prove the index ability of the problem and provide closed-form formulas to compute the indices. Our evaluation using real data center traces shows that the performance result closely matches the prediction by the Markov chain model, and the proposed index policy achieves consistently good performance under various server dynamics compared with the existing policies.
Keywords :
Markov processes; cloud computing; computational complexity; parallel processing; probability; scheduling; MAB; MDP; Markov decision process; ON-OFF Markov chains; cloud resources; data center traces; exponential complexity; heuristic scheduling policy; high-priority tasks; multiarmed bandits; multiple parallel low-priority tasks scheduling; opportunistic scheduling; parallel task scheduling; server dynamics; synchronization constraints; underutilized computation resources; Availability; Indexes; Markov processes; Optimal scheduling; Processor scheduling; Servers; Synchronization; Markov Decision Process; Multi-armed Bandits; Opportunistic scheduling; parallel task scheduling;
Conference_Titel :
Cloud Computing (CLOUD), 2012 IEEE 5th International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
978-1-4673-2892-0
DOI :
10.1109/CLOUD.2012.15