Search CORE

199 research outputs found

Scheduling for today’s computer systems: bridging theory and practice

Author: Wierman Adam
Publication venue
Publication date: 01/05/2007
Field of study

Scheduling is a fundamental technique for improving performance in computer systems. From web servers to routers to operating systems, how the bottleneck device is scheduled has an enormous impact on the performance of the system as a whole. Given the immense literature studying scheduling, it is easy to think that we already understand enough about scheduling. But, modern computer system designs have highlighted a number of disconnects between traditional analytic results and the needs of system designers. In particular, the idealized policies, metrics, and models used by analytic researchers do not match the policies, metrics, and scenarios that appear in real systems. The goal of this thesis is to take a step towards modernizing the theory of scheduling in order to provide results that apply to today’s computer systems, and thus ease the burden on system designers. To accomplish this goal, we provide new results that help to bridge each of the disconnects mentioned above. We will move beyond the study of idealized policies by introducing a new analytic framework where the focus is on scheduling heuristics and techniques rather than individual policies. By moving beyond the study of individual policies, our results apply to the complex hybrid policies that are often used in practice. For example, our results enable designers to understand how the policies that favor small job sizes are affected by the fact that real systems only have estimates of job sizes. In addition, we move beyond the study of mean response time and provide results characterizing the distribution of response time and the fairness of scheduling policies. These results allow us to understand how scheduling affects QoS guarantees and whether favoring small job sizes results in large job sizes being treated unfairly. Finally, we move beyond the simplified models traditionally used in scheduling research and provide results characterizing the effectiveness of scheduling in multiserver systems and when users are interactive. These results allow us to answer questions about the how to design multiserver systems and how to choose a workload generator when evaluating new scheduling designs

Caltech Authors

Performance of the Gittins Policy in the G/G/1 and G/G/k, With and Without Setup Times

Author: Hong Yige
Scully Ziv
Publication venue
Publication date: 25/04/2023
Field of study

How should we schedule jobs to minimize mean queue length? In the preemptive M/G/1 queue, we know the optimal policy is the Gittins policy, which uses any available information about jobs' remaining service times to dynamically prioritize jobs. For models more complex than the M/G/1, optimal scheduling is generally intractable. This leads us to ask: beyond the M/G/1, does Gittins still perform well? Recent results indicate that Gittins performs well in the M/G/k, meaning that its additive suboptimality gap is bounded by an expression which is negligible in heavy traffic. But allowing multiple servers is just one way to extend the M/G/1, and most other extensions remain open. Does Gittins still perform well with non-Poisson arrival processes? Or if servers require setup times when transitioning from idle to busy? In this paper, we give the first analysis of the Gittins policy that can handle any combination of (a) multiple servers, (b) non-Poisson arrivals, and (c) setup times. Our results thus cover the G/G/1 and G/G/k, with and without setup times, bounding Gittins's suboptimality gap in each case. Each of (a), (b), and (c) adds a term to our bound, but all the terms are negligible in heavy traffic, thus implying Gittins's heavy-traffic optimality in all the systems we consider. Another consequence of our results is that Gittins is optimal in the M/G/1 with setup times at all loads.Comment: 41 page

arXiv.org e-Print Archive

Hospitalization admission control of emergency patients using markovian decision processes and discrete event simulation

Author: Augusto Vincent
Prodel Martin
Xie Xiaolan
Publication venue: IEEE Press Piscataway, NJ, USA ©2014
Publication date: 01/12/2014
Field of study

International audienceThis paper addresses the hospitalization admission control policies of patients from an emergency department that should be admitted shortly or transferred. When an emergency patient arrives, depending on his/her health condition, a physician may decide to hospitalize him/her in a specific department. Patient admission depends on the availability of beds, the length of stay (LOS) and the reward of hospitalization which are both patient-class specific. The problem consists in determining patient admission policies in order to maximize the overall gain. We first propose a Markov Decision Process (MDP) Model for determination of the optimal patient admission policy under some restrictive and necessary assumptions such as exponentially distributed LOS. A simulation model is then built to assess MDP admission policies under realistic conditions. We show that MDP policies significantly improve the overall gain for different types of facilities

Analysis of priority queues with session-based arrival streams

Author: Bruneel Herwig
Walraevens Joris
Wittevrongel Sabine
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2008
Field of study

In this paper, we analyze a discrete-time priority queue with session-based arrivals. We consider a user population, where each user can start and end sessions. Sessions belong to one of two classes and generate a variable number of fixed-length packets which arrive to the queue at the rate of one packet per slot. The lengths of the sessions are generally distributed. Packets of the first class have transmission priority over the packets of the other class. The model is motivated by a web server handling delay-sensitive and delay-insensitive content. By using probability generating functions, some performance measures of the queue such as the moments of the packet delays of both classes are calculated. The impact of the priority scheduling discipline and of the session nature of the arrival process is shown by some numerical examples