Title :
Scheduling Jobs With Unknown Duration in Clouds
Author :
Maguluri, Siva Theja ; Srikant, R.
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Illinois at Urbana Champaign, Urbana, IL, USA
Abstract :
We consider a stochastic model of jobs arriving at a cloud data center. Each job requests a certain amount of CPU, memory, disk space, etc. Job sizes (durations) are also modeled as random variables, with possibly unbounded support. These jobs need to be scheduled nonpreemptively on servers. The jobs are first routed to one of the servers when they arrive and are queued at the servers. Each server then chooses a set of jobs from its queues so that it has enough resources to serve all of them simultaneously. This problem has been studied previously under the assumption that job sizes are known and upper-bounded, and an algorithm was proposed that stabilizes traffic load in a diminished capacity region. Here, we present a load balancing and scheduling algorithm that is throughput-optimal, without assuming that job sizes are known or are upper-bounded.
Keywords :
cloud computing; computer centres; queueing theory; resource allocation; scheduling; telecommunication network routing; telecommunication traffic; virtual machines; CPU; cloud data center; diminished capacity region; disk space; job arrival; job queue; job routing; known job sizes; load balancing; memory; nonpreemptively scheduled jobs; optimal throughput; random variables; stochastic model; traffic load stabilization; unknown job duration; upper-bounded job sizes; Markov processes; Routing; Schedules; Scheduling; Servers; Throughput; Vectors; Cloud computing; performance evaluation; queueing theory; resource allocation; scheduling;
Journal_Title :
Networking, IEEE/ACM Transactions on
DOI :
10.1109/TNET.2013.2288973