488 research outputs found
Note on the GI/GI/1 queue with LCFS-PR observed at arbitrary times
Consider the GI/GI/1 queue with the Last-Come First-Served Preemptive-Resume service discipline. We give intuitive explanations for (i) the geometric nature of the stationary queue length distribution and (ii) the mutual independence of the residual service requirements of the customers in the queue, both considered at arbitrary time points. These distributions have previously been established in the literature by either first considering the system at arrival instants or using balance equations. Our direct arguments provide further understanding of (i) and (ii)
Large deviations analysis for the queue in the Halfin-Whitt regime
We consider the FCFS queue in the Halfin-Whitt heavy traffic
regime. It is known that the normalized sequence of steady-state queue length
distributions is tight and converges weakly to a limiting random variable W.
However, those works only describe W implicitly as the invariant measure of a
complicated diffusion. Although it was proven by Gamarnik and Stolyar that the
tail of W is sub-Gaussian, the actual value of was left open. In subsequent work, Dai and He
conjectured an explicit form for this exponent, which was insensitive to the
higher moments of the service distribution.
We explicitly compute the true large deviations exponent for W when the
abandonment rate is less than the minimum service rate, the first such result
for non-Markovian queues with abandonments. Interestingly, our results resolve
the conjecture of Dai and He in the negative. Our main approach is to extend
the stochastic comparison framework of Gamarnik and Goldberg to the setting of
abandonments, requiring several novel and non-trivial contributions. Our
approach sheds light on several novel ways to think about multi-server queues
with abandonments in the Halfin-Whitt regime, which should hold in considerable
generality and provide new tools for analyzing these systems
Combined analysis of transient delay characteristics and delay autocorrelation function in the Geo(X)/G/1 queue
We perform a discrete-time analysis of customer delay in a buffer with batch arrivals. The delay of the kth customer that enters the FIFO buffer is characterized under the assumption that the numbers of arrivals per slot are independent and identically distributed. By using supplementary variables and generating functions, z-transforms of the transient delays are calculated. Numerical inversion of these transforms lead to results for the moments of the delay of the kth customer. For computational reasons k cannot be too large. Therefore, these numerical inversion results are complemented by explicit analytic expressions for the asymptotics for large k. We further show how the results allow us to characterize jitter-related variables, such as the autocorrelation of the delay in steady state
Many-server queues with customer abandonment
Customer call centers with hundreds of agents working in parallel are ubiquitous in many industries. These systems have a large amount of daily traffic that is stochastic in nature. It becomes more and more difficult to manage a call center because of its increasingly large scale and the stochastic variability in arrival and service processes. In call center operations, customer abandonment is a key factor and may significantly impact the system performance. It must be modeled explicitly in order for an operational model to be relevant for decision making.
In this thesis, a large-scale call center is modeled as a queue with many parallel servers. To model the customer abandonment, each customer is assigned a patience time. When his waiting time for service exceeds his patience time, a customer abandons the system without service. We develop analytical and numerical tools for analyzing such a queue.
We first study a sequence of G/G/n+GI queues, where the customer patience times are independent and identically distributed (iid) following a general distribution. The focus is the abandonment and the queue length processes. We prove that under certain conditions, a deterministic relationship holds asymptotically in diffusion scaling between these two stochastic processes, as the number of servers goes to infinity.
Next, we restrict the service time distribution to be a phase-type distribution with d phases. Using the aforementioned asymptotic relationship, we prove limit theorems for G/Ph/n+GI queues in the quality- and efficiency-driven (QED) regime. In particular, the limit process for the customer number in each phase is a d-dimensional piecewise Ornstein-Uhlenbeck (OU) process.
Motivated by the diffusion limit process, we propose two approximate models for a GI/Ph/n+GI queue. In each model, a d-dimensional diffusion process is used to approximate the dynamics of the queue. These two models differ in how the patience time distribution is built into them. The first diffusion model uses the patience time density at zero and the second one uses the entire patience time distribution. We also develop a numerical algorithm to analyze these diffusion models. The algorithm solves the stationary distribution of each model. The computed stationary distribution is used to estimate the queue's performance. A crucial part of this algorithm is to choose an appropriate reference density that controls the convergence of the algorithm. We develop a systematic approach to constructing a reference density. With the proposed reference density, the algorithm is shown to converge quickly in numerical experiments. These experiments also show that the diffusion models are good approximations of queues with a moderate to large number of servers.Ph.D.Committee Chair: Dai, Jiangang; Committee Member: Ayhan, Hayriye; Committee Member: Foley, Robert; Committee Member: Kleywegt, Anton; Committee Member: Tezcan, Tolg
Conditional limit theorems for regulated fractional Brownian motion
We consider a stationary fluid queue with fractional Brownian motion input.
Conditional on the workload at time zero being greater than a large value ,
we provide the limiting distribution for the amount of time that the workload
process spends above level over the busy cycle straddling the origin, as
. Our results can be interpreted as showing that long delays occur
in large clumps of size of order . The conditional limit result
involves a finer scaling of the queueing process than fluid analysis, thereby
departing from previous related literature.Comment: Published in at http://dx.doi.org/10.1214/09-AAP605 the Annals of
Applied Probability (http://www.imstat.org/aap/) by the Institute of
Mathematical Statistics (http://www.imstat.org
- …