61 research outputs found
Analysis of join-the-shortest-queue routing for web server farms
Join the Shortest Queue (JSQ) is a popular routing policy for server farms. However, until now all analysis of JSQ has been limited to First-Come-First-Serve (FCFS) server farms, whereas it is known that web server farms are better modeled as Processor Sharing (PS) server farms. We provide the first approximate analysis of JSQ in the PS server farm model for general job-size distributions, obtaining the distribution of queue length at each queue. To do this, we approximate the queue length of each queue in the server farm by a one-dimensional Markov chain, in a novel fashion. We also discover some interesting insensitivity properties of PS server farms with JSQ routing, and discuss the near-optimality of JSQ
On the Benefit of Information Centric Networks for Traffic Engineering
Current Internet performs traffic engineering (TE) by estimating traffic
matrices on a regular schedule, and allocating flows based upon weights
computed from these matrices. This means the allocation is based upon a guess
of the traffic in the network based on its history. Information-Centric
Networks on the other hand provide a finer-grained description of the traffic:
a content between a client and a server is uniquely identified by its name, and
the network can therefore learn the size of different content items, and
perform traffic engineering and resource allocation accordingly. We claim that
Information-Centric Networks can therefore provide a better handle to perform
traffic engineering, resulting in significant performance gain.
We present a mechanism to perform such resource allocation. We see that our
traffic engineering method only requires knowledge of the flow size (which, in
ICN, can be learned from previous data transfers) and outperforms a min-MLU
allocation in terms of response time. We also see that our method identifies
the traffic allocation patterns similar to that of min-MLU without having
access to the traffic matrix ahead of time. We show a very significant gain in
response time where min MLU is almost 50% slower than our ICN-based TE method
Approximate performance analysis of generalized join the shortest queue routing
In this paper we propose a highly accurate approximate performance analysis
of a heterogeneous server system with a processor sharing service discipline
and a general job-size distribution under a generalized join the shortest queue
(GJSQ) routing protocol. The GJSQ routing protocol is a natural extension of
the well-known join the shortest queue routing policy that takes into account
the non-identical service rates in addition to the number of jobs at each
server. The performance metrics that are of interest here are the equilibrium
distribution and the mean and standard deviation of the number of jobs at each
server. We show that the latter metrics are near-insensitive to the job-size
distribution using simulation experiments. By applying a single queue
approximation we model each server as a single server queue with a
state-dependent arrival process, independent of other servers in the system,
and derive the distribution of the number of jobs at the server. These
state-dependent arrival rates are intended to capture the inherent correlation
between servers in the original system and behave in a rather atypical way.Comment: 16 pages, 5 figures -- version 2 incorporates minor textual change
QuLa: service selection and forwarding table population in service-centric networking using real-life topologies
The amount of services located in the network has drastically increased over the last decade which is why more and more datacenters are located at the network edge, closer to the users. In the current Internet it is up to the client to select a destination using a resolution service (Domain Name System, Content Delivery Networks ...). In the last few years, research on Information-Centric Networking (ICN) suggests to put this selection responsibility at the network components; routers find the closest copy of a content object using the content name as input.
We extend the principle of ICN to services; service routers forward requests to service instances located in datacenters spread across the network edge. To solve this problem, we first present a service selection algorithm based on both server and network metrics. Next, we describe a method to reduce the state required in service routers while minimizing the performance loss caused by this data reduction. Simulation results based on real-life networks show that we are able to find a near-optimal load distribution with only minimal state required in the service routers
Universality of Load Balancing Schemes on Diffusion Scale
We consider a system of parallel queues with identical exponential
service rates and a single dispatcher where tasks arrive as a Poisson process.
When a task arrives, the dispatcher always assigns it to an idle server, if
there is any, and to a server with the shortest queue among randomly
selected servers otherwise . This load balancing scheme
subsumes the so-called Join-the-Idle Queue (JIQ) policy and the
celebrated Join-the-Shortest Queue (JSQ) policy as two crucial
special cases. We develop a stochastic coupling construction to obtain the
diffusion limit of the queue process in the Halfin-Whitt heavy-traffic regime,
and establish that it does not depend on the value of , implying that
assigning tasks to idle servers is sufficient for diffusion level optimality
- …