RSRG Seminar
Cloud computing is emerging as an important platform for business, personal and mobile computing applications. We study a stochastic model of cloud computing, where jobs arrive according to a stochastic process and request resources like CPU, memory and storage space. Job sizes (durations) are also modeled as random variables, with possibly unbounded support. These jobs need to be scheduled non preemptively on servers. The jobs are first routed to one of the servers when they arrive and are queued at the servers. Each server then chooses a set of jobs from its queues so that it has enough resources to serve all of them simultaneously. We present a load balancing and scheduling algorithm that is throughput optimal and delay optimal in the heavy traffic limit.