2,514 research outputs found
Simple and explicit bounds for multi-server queues with (and sometimes better) scaling
We consider the FCFS queue, and prove the first simple and explicit
bounds that scale as (and sometimes better). Here
denotes the corresponding traffic intensity. Conceptually, our results can be
viewed as a multi-server analogue of Kingman's bound. Our main results are
bounds for the tail of the steady-state queue length and the steady-state
probability of delay. The strength of our bounds (e.g. in the form of tail
decay rate) is a function of how many moments of the inter-arrival and service
distributions are assumed finite. More formally, suppose that the inter-arrival
and service times (distributed as random variables and respectively)
have finite th moment for some Let (respectively )
denote (respectively ). Then
our bounds (also for higher moments) are simple and explicit functions of
, and
only. Our bounds scale gracefully even when the number of
servers grows large and the traffic intensity converges to unity
simultaneously, as in the Halfin-Whitt scaling regime. Some of our bounds scale
better than in certain asymptotic regimes. More precisely,
they scale as multiplied by an inverse polynomial in These results formalize the intuition that bounds should be tighter
in light traffic as well as certain heavy-traffic regimes (e.g. with
fixed and large). In these same asymptotic regimes we also prove bounds for
the tail of the steady-state number in service.
Our main proofs proceed by explicitly analyzing the bounding process which
arises in the stochastic comparison bounds of amarnik and Goldberg for
multi-server queues. Along the way we derive several novel results for suprema
of random walks and pooled renewal processes which may be of independent
interest. We also prove several additional bounds using drift arguments (which
have much smaller pre-factors), and make several conjectures which would imply
further related bounds and generalizations
Stochastic Analysis of Power-Aware Scheduling
Energy consumption in a computer system can be reduced by dynamic speed scaling, which adapts the processing speed to the current load. This paper studies the optimal way to adjust speed to balance mean response time and mean energy consumption, when jobs arrive as a Poisson process and processor sharing scheduling is used. Both bounds and asymptotics for the optimal speeds are provided. Interestingly, a simple scheme that halts when the system is idle and uses a static rate while the system is busy provides nearly the same performance as the optimal dynamic speed scaling. However, dynamic speed scaling which allocates a higher speed when more jobs are present significantly improves robustness to bursty traffic and mis-estimation of workload parameters
Power-Aware Speed Scaling in Processor Sharing Systems
Energy use of computer communication systems has quickly become a vital design consideration. One effective method for reducing energy consumption is dynamic speed scaling, which adapts the processing speed to the current load. This paper studies how to optimally scale speed to balance mean response time and mean energy consumption under processor sharing scheduling. Both bounds and asymptotics for the optimal speed scaling scheme are provided. These results show that a simple scheme that halts when the system is idle and uses a static rate while the system is busy provides nearly the same performance as the optimal dynamic speed scaling. However, the results also highlight that dynamic speed scaling provides at least one key benefit - significantly improved robustness to bursty traffic and mis-estimation of workload parameters
Delay Performance and Mixing Times in Random-Access Networks
We explore the achievable delay performance in wireless random-access
networks. While relatively simple and inherently distributed in nature,
suitably designed queue-based random-access schemes provide the striking
capability to match the optimal throughput performance of centralized
scheduling mechanisms in a wide range of scenarios. The specific type of
activation rules for which throughput optimality has been established, may
however yield excessive queues and delays.
Motivated by that issue, we examine whether the poor delay performance is
inherent to the basic operation of these schemes, or caused by the specific
kind of activation rules. We derive delay lower bounds for queue-based
activation rules, which offer fundamental insight in the cause of the excessive
delays. For fixed activation rates we obtain lower bounds indicating that
delays and mixing times can grow dramatically with the load in certain
topologies as well
- …