66,044 research outputs found
SplitPlace: AI augmented splitting and placement of large-scale neural networks in mobile edge environments
In recent years, deep learning models have become ubiquitous in industry and academia alike. Deep neural networks can solve some of the most complex pattern-recognition problems today, but come with the price of massive compute and memory requirements. This makes the problem of deploying such large-scale neural networks challenging in resource-constrained mobile edge computing platforms, specifically in mission-critical domains like surveillance and healthcare. To solve this, a promising solution is to split resource-hungry neural networks into lightweight disjoint smaller components for pipelined distributed processing. At present, there are two main approaches to do this: semantic and layer-wise splitting. The former partitions a neural network into parallel disjoint models that produce a part of the result, whereas the latter partitions into sequential models that produce intermediate results. However, there is no intelligent algorithm that decides which splitting strategy to use and places such modular splits to edge nodes for optimal performance. To combat this, this work proposes a novel AI-driven online policy, SplitPlace, that uses Multi-Armed-Bandits to intelligently decide between layer and semantic splitting strategies based on the input task's service deadline demands. SplitPlace places such neural network split fragments on mobile edge devices using decision-aware reinforcement learning for efficient and scalable computing. Moreover, SplitPlace fine-tunes its placement engine to adapt to volatile environments. Our experiments on physical mobile-edge environments with real-world workloads show that SplitPlace can significantly improve the state-of-the-art in terms of average response time, deadline violation rate, inference accuracy, and total reward by up to 46, 69, 3 and 12 percent respectively
Lying Your Way to Better Traffic Engineering
To optimize the flow of traffic in IP networks, operators do traffic
engineering (TE), i.e., tune routing-protocol parameters in response to traffic
demands. TE in IP networks typically involves configuring static link weights
and splitting traffic between the resulting shortest-paths via the
Equal-Cost-MultiPath (ECMP) mechanism. Unfortunately, ECMP is a notoriously
cumbersome and indirect means for optimizing traffic flow, often leading to
poor network performance. Also, obtaining accurate knowledge of traffic demands
as the input to TE is elusive, and traffic conditions can be highly variable,
further complicating TE. We leverage recently proposed schemes for increasing
ECMP's expressiveness via carefully disseminated bogus information ("lies") to
design COYOTE, a readily deployable TE scheme for robust and efficient network
utilization. COYOTE leverages new algorithmic ideas to configure (static)
traffic splitting ratios that are optimized with respect to all (even
adversarially chosen) traffic scenarios within the operator's "uncertainty
bounds". Our experimental analyses show that COYOTE significantly outperforms
today's prevalent TE schemes in a manner that is robust to traffic uncertainty
and variation. We discuss experiments with a prototype implementation of
COYOTE
An Improved Upper Bound for the Ring Loading Problem
The Ring Loading Problem emerged in the 1990s to model an important special
case of telecommunication networks (SONET rings) which gained attention from
practitioners and theorists alike. Given an undirected cycle on nodes
together with non-negative demands between any pair of nodes, the Ring Loading
Problem asks for an unsplittable routing of the demands such that the maximum
cumulated demand on any edge is minimized. Let be the value of such a
solution. In the relaxed version of the problem, each demand can be split into
two parts where the first part is routed clockwise while the second part is
routed counter-clockwise. Denote with the maximum load of a minimum split
routing solution. In a landmark paper, Schrijver, Seymour and Winkler [SSW98]
showed that , where is the maximum demand value. They
also found (implicitly) an instance of the Ring Loading Problem with . Recently, Skutella [Sku16] improved these bounds by showing that , and there exists an instance with .
We contribute to this line of research by showing that . We
also take a first step towards lower and upper bounds for small instances
Capacitated Vehicle Routing with Non-Uniform Speeds
The capacitated vehicle routing problem (CVRP) involves distributing
(identical) items from a depot to a set of demand locations, using a single
capacitated vehicle. We study a generalization of this problem to the setting
of multiple vehicles having non-uniform speeds (that we call Heterogenous
CVRP), and present a constant-factor approximation algorithm.
The technical heart of our result lies in achieving a constant approximation
to the following TSP variant (called Heterogenous TSP). Given a metric denoting
distances between vertices, a depot r containing k vehicles with possibly
different speeds, the goal is to find a tour for each vehicle (starting and
ending at r), so that every vertex is covered in some tour and the maximum
completion time is minimized. This problem is precisely Heterogenous CVRP when
vehicles are uncapacitated.
The presence of non-uniform speeds introduces difficulties for employing
standard tour-splitting techniques. In order to get a better understanding of
this technique in our context, we appeal to ideas from the 2-approximation for
scheduling in parallel machine of Lenstra et al.. This motivates the
introduction of a new approximate MST construction called Level-Prim, which is
related to Light Approximate Shortest-path Trees. The last component of our
algorithm involves partitioning the Level-Prim tree and matching the resulting
parts to vehicles. This decomposition is more subtle than usual since now we
need to enforce correlation between the size of the parts and their distances
to the depot
- …