2,977 research outputs found

    Dynamic fluid-based scheduling in a multi-class abandonment queue

    Get PDF
    International audienceWe investigate how to share a common resource among multiple classes of customers in the presence of abandonments. We consider two different models: (1) customers can abandon both while waiting in the queue and while being served, (2) only customers that are in the queue can abandon. Given the complexity of the stochastic optimization problem we propose a fluid model as a deterministic approximation. For the overload case we directly obtain that the c˜”/Ξ rule is optimal. For the underload case we use Pontryagin’s Maximum Principle to obtain the optimal solution for two classes of customers; there exists a switching curve that splits the two-dimensional state-space into two regions such that when the number of customers in both classes is sufficiently small the optimal policy follows the c˜”-rule and when the number of customers is sufficiently large the optimal policy follows the c˜”/Ξ-rule. The same structure is observed in the optimal policy of the stochastic model for an arbitrary number of classes. Based on this we develop a heuristic and by numerical experiments we evaluate its performance and compare it to several index policies. We observe that the suboptimality gap of our solution is small

    Developing effective service policies for multiclass queues with abandonment:asymptotic optimality and approximate policy improvement

    Get PDF
    We study a single server queuing model with multiple classes and impatient customers. The goal is to determine a service policy to maximize the long-run reward rate earned from serving customers net of holding costs and penalties respectively due to customers waiting for and leaving before receiving service. We first show that it is without loss of generality to study a pure-reward model. Since standard methods can usually only compute the optimal policy for problems with up to three customer classes, our focus is to develop a suite of heuristic approaches, with a preference for operationally simple policies with good reward characteristics. One such heuristic is the RΌΞ rule—a priority policy that ranks all customer classes based on the product of reward R, service rate ÎŒ, and abandonment rate Ξ. We show that the RΌΞ rule is asymptotically optimal as customer abandonment rates approach zero and often performs well in cases where the simpler RÎŒ rule performs poorly. The paper also develops an approximate policy improvement method that uses simulation and interpolation to estimate the bias function for use in a dynamic programming recursion. For systems with two or three customer classes, our numerical study indicates that the best of our simple priority policies is near optimal in most cases; when it is not, the approximate policy improvement method invariably tightens up the gap substantially. For systems with five customer classes, our heuristics typically achieve within 4% of an upper bound for the optimal value, which is computed via a linear program that relies on a relaxation of the original system. The computational requirement of the approximate policy improvement method grows rapidly when the number of customer classes or the traffic intensity increases

    Asymptotically optimal index policies for an abandonment queue with convex holding cost.

    Get PDF
    International audienceWe investigate a resource allocation problem in a multi-class server with convex holding costs and user impatience under the average cost criterion. In general, the optimal policy has a complex dependency on all the input parameters and state information. Our main contribution is to derive index policies that can serve as heuristics and are shown to give good performance. Our index policy attributes to each class an index, which depends on the number of customers currently present in that class. The index values are obtained by solving a relaxed version of the optimal stochastic control problem and combining results from restless multi-armed bandits and queueing theory. They can be expressed as a function of the steady-state distribution probabilities of a one-dimensional birth-and-death process. For linear holding cost, the index can be calculated in closed-form and turns out to be independent of the arrival rates and the number of customers present. In the case of no abandonments and linear holding cost, our index coincides with the cÎŒc\mu-rule, which is known to be optimal in this simple setting. For general convex holding cost we derive properties of the index value in limiting regimes: we consider the behavior of the index (i) as the number of customers in a class grows large, which allows us to derive the asymptotic structure of the index policies, (ii) as the abandonment rate vanishes, which allows us to retrieve an index policy proposed for the multi-class M/M/1 queue with convex holding cost and no abandonments, and (iii) as the arrival rate goes to either 0 or ∞\infty, representing light-traffic and heavy-traffic regimes, respectively. We show that Whittle's index policy is asymptotically optimal in both light-traffic and heavy-traffic regimes. To obtain further insights into the index policy, we consider the fluid version of the relaxed problem and derive a closed-form expression for the fluid index. The latter is shown to coincide with the index values for the stochastic model in asymptotic regimes. For arbitrary convex holding cost the fluid index can be seen as the GcÎŒ/ΞGc\mu/\theta-rule, that is, including abandonments into the generalized cÎŒc\mu-rule (GcÎŒGc\mu-rule). Numerical experiments for a wide range of parameters have shown that the Whittle index policy and the fluid index policy perform very well for a broad range of parameters

    Stochastic and fluid index policies for resource allocation problems

    Get PDF
    We develop a unifying framework to obtain efficient index policies for restless multi-armed bandit problems with birth-and-death state evolution. This is a broad class of stochastic resource allocation problems whose objective is to determine efficient policies to share resources among competing projects. In a seminal work, Whittle developed a methodology to derive well-performing (Whittle’s) index policies that are obtained by solving a relaxed version of the original problem. Our first main contribution is the derivation of a closed-form expression for Whittle’s index as a function of the steady-state probabilities. It can be efficiently calculated, however, it requires several technical conditions to be verified, and in addition, it does not provide qualitative insights into Whittle’s index. We therefore formulate a fluid version of the relaxed optimization problem and in our second main contribution we develop a fluid index policy. The latter does provide qualitative insights and is close to Whittle’s index. The applicability of our approach is illustrated by two important problems: optimal class selection and optimal load balancing. Allowing state-dependent capacities we can model important phenomena: e.g. power-aware server-farms and opportunistic scheduling in wireless systems. Numerical simulations show that Whittle’s index and our fluid index policy are both nearly optima

    Statistical Analysis of a Telephone Call Center: A Queueing-Science Perspective

    Get PDF
    A call center is a service network in which agents provide telephone-based services. Customers that seek these services are delayed in tele-queues. This paper summarizes an analysis of a unique record of call center operations. The data comprise a complete operational history of a small banking call center, call by call, over a full year. Taking the perspective of queueing theory, we decompose the service process into three fundamental components: arrivals, customer abandonment behavior and service durations. Each component involves different basic mathematical structures and requires a different style of statistical analysis. Some of the key empirical results are sketched, along with descriptions of the varied techniques required. Several statistical techniques are developed for analysis of the basic components. One of these is a test that a point process is a Poisson process. Another involves estimation of the mean function in a nonparametric regression with lognormal errors. A new graphical technique is introduced for nonparametric hazard rate estimation with censored data. Models are developed and implemented for forecasting of Poisson arrival rates. We then survey how the characteristics deduced from the statistical analyses form the building blocks for theoretically interesting and practically useful mathematical models for call center operations. Key Words: call centers, queueing theory, lognormal distribution, inhomogeneous Poisson process, censored data, human patience, prediction of Poisson rates, Khintchine-Pollaczek formula, service times, arrival rate, abandonment rate, multiserver queues.

    Resource allocation with observable and unobservable environments

    Get PDF
    Cette thĂšse Ă©tudie les problĂšmes d'allocation des ressources dans les rĂ©seaux stochastiques Ă  grande Ă©chelle dans lesquels les paramĂštres fluctuent dans le temps. Nous supposons que l'Ă©tat du systĂšme est formĂ© de deux processus, une partie contrĂŽlable dont l'Ă©volution dĂ©pend de l'action du dĂ©cideur et la partie environnement dont l'Ă©volution est exogĂšne. L'Ă©volution stochastique du processus contrĂŽlable dĂ©pend de l'Ă©tat actuel de l'environnement. Selon que le dĂ©cideur observe l'Ă©tat de l'environnement, nous disons que l'environnement est observable ou non observable. La thĂšse suit trois axes de recherche principaux. Dans le premier problĂšme, nous Ă©tudions le contrĂŽle optimal d'un problĂšme de bandit agitĂ© multi-bras MARBP avec un environnement inobservable. L'objectif est de caractĂ©riser la politique optimale de maĂźtrise du processus contrĂŽlable malgrĂ© le fait que l'environnement ne peut pas ĂȘtre observĂ©. Nous considĂ©rons le rĂ©gime asymptotique Ă  grande Ă©chelle dans lequel le nombre de bandits et la vitesse de l'environnement tendent tous deux Ă  l'infini. Dans notre rĂ©sultat principal, nous Ă©tablissons qu'un ensemble de politiques prioritaires est asymptotiquement optimal. Nous montrons que cet ensemble comprend notamment l'indice de Whittle d'un systĂšme dont les paramĂštres sont moyennĂ©s sur le comportement stationnaire de l'environnement. Dans le second problĂšme, nous considĂ©rons un MARBP avec un environnement observable. L'objectif est de tirer parti des informations sur l'environnement pour dĂ©river une politique optimale pour le processus contrĂŽlable. En supposant que la condition technique d'indexabilitĂ© soit vĂ©rifiĂ©e, nous dĂ©veloppons un algorithme pour calculer numĂ©riquement l'indice de Whittle. Nous appliquons ensuite ce rĂ©sultat au cas particulier d'une file d'attente avec abandon. Nous Ă©tablissons une indexabilitĂ©, et nous obtenons des caractĂ©risations de l'indice de Whittle sous forme fermĂ©e. Dans le troisiĂšme problĂšme, nous considĂ©rons un modĂšle d'allocation de fichiers dans un grand systĂšme de stockage, oĂč il y a des fichiers rĂ©partis sur un ensemble de nƓuds. Chaque nƓud tombe en panne selon une loi qui dĂ©pend de la charge qu'il gĂšre. Chaque fois qu'un nƓud tombe en panne, tous les fichiers qu'il possĂ©dait sont rĂ©allouĂ©s selon une stratĂ©gie d'allocation fixe, et le nƓud redĂ©marre son travail en Ă©tant vide. Nous Ă©tudions l'Ă©volution de la charge d'un nƓud dans le rĂ©gime de champ moyen, lorsque le nombre de fichiers et le nombre de nƓuds deviennent importants. Nous prouvons l'existence et l'unicitĂ© de la mesure de probabilitĂ© stationnaire du processus, et la convergence dans la distribution de cette mesure.This thesis studies resource allocation problems in large-scale stochastic networks. We work on problems where the availability of resources is subject to time fluctuations, a situation that one may encounter, for example, in load balancing systems or in wireless downlink scheduling systems. The time fluctuations are modelled considering two types of processes, controllable processes, whose evolution depends on the action of the decision maker, and environment processes, whose evolution is exogenous. The stochastic evolution of the controllable process depends on the the current state of the environment. Depending on whether the decision maker observes the state of the environment, we say that the environment is observable or unobservable. The mathematical formulation used is the Markov Decision Processes (MDPs). The thesis follows three main research axes. In the first problem we study the optimal control of a Multi-armed restless bandit problem (MARBP) with an unobservable environment. The objective is to characterise the optimal policy for the controllable process in spite of the fact that the environment cannot be observed. We consider the large-scale asymptotic regime in which the number of bandits and the speed of the environment both tend to infinity. In our main result we establish that a set of priority policies is asymptotically optimal. We show that, in particular, this set includes Whittle index policy of a system whose parameters are averaged over the stationary behaviour of the environment. In the second problem, we consider an MARBP with an observable environment. The objective is to leverage information on the environment to derive an optimal policy for the controllable process. Assuming that the technical condition of indexability holds, we develop an algorithm to compute Whittle's index. We then apply this result to the particular case of a queue with abandonments. We prove indexability, and we provide closed-form expressions of Whittle's index. In the third problem we consider a model of a large-scale storage system, where there are files distributed across a set of nodes. Each node breaks down following a law that depends on the load it handles. Whenever a node breaks down, all the files it had are reallocated to other nodes. We study the evolution of the load of a single node in the mean-field regime, when the number of nodes and files grow large. We prove the existence of the process in the mean-field regime. We further show the convergence in distribution of the load in steady state as the average number of files per node tends to infinity

    Dynamic control of stochastic and fluid resource-sharing systems

    Get PDF
    In this thesis we study the dynamic control of resource-sharing systems that arise in various domains: e.g. inventory management, healthcare and communication networks. We aim at efficiently allocating the available resources among competing projects according to a certain performance criteria. These type of problems have a stochastic nature and may be very complex to solve. We therefore focus on developing well-performing heuristics. In Part I, we consider the framework of Restless Bandit Problems, which is a general class of dynamic stochastic optimization problems. Relaxing the sample-path constraint in the optimization problem enables to define an index-based heuristic for the original constrained model, the so-called Whittle index policy. We derive a closed-form expression for the Whittle index as a function of the steady-state probabilities for the case in which bandits (projects) evolve in a birth-and-death fashion. This expression requires several technical conditions to be verified, and in addition, it can only be computed explicitly in specific cases. In the particular case of a multi-class abandonment queue, we further prove that the Whittle index policy is asymptotically optimal in the light-traffic and heavy-traffic regimes. In Part II, we derive heuristics by approximating the stochastic resource-sharing systems with deterministic fluid models. We first formulate a fluid version of the relaxed optimization problem introduced in Part I, and we develop a fluid index policy. The fluid index can always be computed explicitly and hence overcomes the technical issues that arise when calculating the Whittle index. We apply the Whittle index and the fluid index policies to several systems: e.g. power-aware server-farms, opportunistic scheduling in wireless systems, and make-to-stock problems with perishable items. We show numerically that both index policies are nearly optimal. Secondly, we study the optimal scheduling control for the fluid version of a multi-class abandonment queue. We derive the fluid optimal control when there are two classes of customers competing for a single resource. Based on the insights provided by this result we build a heuristic for the general multi-class setting. This heuristic shows near-optimal performance when applied to the original stochastic model for high workloads. In Part III, we further investigate the abandonment phenomena in the context of a content delivery problem. We characterize an optimal grouping policy so that requests, which are impatient, are efficiently transmitted in a multi-cast mode
    • 

    corecore