2,818 research outputs found
Network Utility Maximization under Maximum Delay Constraints and Throughput Requirements
We consider the problem of maximizing aggregate user utilities over a
multi-hop network, subject to link capacity constraints, maximum end-to-end
delay constraints, and user throughput requirements. A user's utility is a
concave function of the achieved throughput or the experienced maximum delay.
The problem is important for supporting real-time multimedia traffic, and is
uniquely challenging due to the need of simultaneously considering maximum
delay constraints and throughput requirements. We first show that it is
NP-complete either (i) to construct a feasible solution strictly meeting all
constraints, or (ii) to obtain an optimal solution after we relax maximum delay
constraints or throughput requirements up to constant ratios. We then develop a
polynomial-time approximation algorithm named PASS. The design of PASS
leverages a novel understanding between non-convex maximum-delay-aware problems
and their convex average-delay-aware counterparts, which can be of independent
interest and suggest a new avenue for solving maximum-delay-aware network
optimization problems. Under realistic conditions, PASS achieves constant or
problem-dependent approximation ratios, at the cost of violating maximum delay
constraints or throughput requirements by up to constant or problem-dependent
ratios. PASS is practically useful since the conditions for PASS are satisfied
in many popular application scenarios. We empirically evaluate PASS using
extensive simulations of supporting video-conferencing traffic across Amazon
EC2 datacenters. Compared to existing algorithms and a conceivable baseline,
PASS obtains up to improvement of utilities, by meeting the throughput
requirements but relaxing the maximum delay constraints that are acceptable for
practical video conferencing applications
Datacenter Traffic Control: Understanding Techniques and Trade-offs
Datacenters provide cost-effective and flexible access to scalable compute
and storage resources necessary for today's cloud computing needs. A typical
datacenter is made up of thousands of servers connected with a large network
and usually managed by one operator. To provide quality access to the variety
of applications and services hosted on datacenters and maximize performance, it
deems necessary to use datacenter networks effectively and efficiently.
Datacenter traffic is often a mix of several classes with different priorities
and requirements. This includes user-generated interactive traffic, traffic
with deadlines, and long-running traffic. To this end, custom transport
protocols and traffic management techniques have been developed to improve
datacenter network performance.
In this tutorial paper, we review the general architecture of datacenter
networks, various topologies proposed for them, their traffic properties,
general traffic control challenges in datacenters and general traffic control
objectives. The purpose of this paper is to bring out the important
characteristics of traffic control in datacenters and not to survey all
existing solutions (as it is virtually impossible due to massive body of
existing research). We hope to provide readers with a wide range of options and
factors while considering a variety of traffic control mechanisms. We discuss
various characteristics of datacenter traffic control including management
schemes, transmission control, traffic shaping, prioritization, load balancing,
multipathing, and traffic scheduling. Next, we point to several open challenges
as well as new and interesting networking paradigms. At the end of this paper,
we briefly review inter-datacenter networks that connect geographically
dispersed datacenters which have been receiving increasing attention recently
and pose interesting and novel research problems.Comment: Accepted for Publication in IEEE Communications Surveys and Tutorial
- …