29 research outputs found
Sparse Hopsets in Congested Clique
We give the first Congested Clique algorithm that computes a sparse hopset
with polylogarithmic hopbound in polylogarithmic time. Given a graph ,
a -hopset with "hopbound" , is a set of edges
added to such that for any pair of nodes and in there is a path
with at most hops in with length within of
the shortest path between and in .
Our hopsets are significantly sparser than the recent construction of
Censor-Hillel et al. [6], that constructs a hopset of size
, but with a smaller polylogarithmic hopbound. On the other
hand, the previously known constructions of sparse hopsets with polylogarithmic
hopbound in the Congested Clique model, proposed by Elkin and Neiman
[10],[11],[12], all require polynomial rounds.
One tool that we use is an efficient algorithm that constructs an
-limited neighborhood cover, that may be of independent interest.
Finally, as a side result, we also give a hopset construction in a variant of
the low-memory Massively Parallel Computation model, with improved running time
over existing algorithms
Massively Parallel Approximate Distance Sketches
Data structures that allow efficient distance estimation (distance oracles, distance sketches, etc.) have been extensively studied, and are particularly well studied in centralized models and classical distributed models such as CONGEST. We initiate their study in newer (and arguably more realistic) models of distributed computation: the Congested Clique model and the Massively Parallel Computation (MPC) model. We provide efficient constructions in both of these models, but our core results are for MPC. In MPC we give two main results: an algorithm that constructs stretch/space optimal distance sketches but takes a (small) polynomial number of rounds, and an algorithm that constructs distance sketches with worse stretch but that only takes polylogarithmic rounds.
Along the way, we show that other useful combinatorial structures can also be computed in MPC. In particular, one key component we use to construct distance sketches are an MPC construction of the hopsets of [Elkin and Neiman, 2016]. This result has additional applications such as the first polylogarithmic time algorithm for constant approximate single-source shortest paths for weighted graphs in the low memory MPC setting
DISTRIBUTED, PARALLEL AND DYNAMIC DISTANCE STRUCTURES
Many fundamental computational tasks can be modeled by distances on a graph. This has inspired studying various structures that preserve approximate distances, but trade off this approximation factor with size, running time, or the number of hops on the approximate shortest paths.
Our focus is on three important objects involving preservation of graph distances: hopsets, in which our goal is to ensure that small-hop paths also provide approximate shortest paths; distance oracles, in which we build a small data structure that supports efficient distance queries; and spanners, in which we find a sparse subgraph that approximately preserves all distances.
We study efficient constructions and applications of these structures in various models of computation that capture different aspects of computational systems. Specifically, we propose new algorithms for constructing hopsets and distance oracles in two modern distributed models: the Massively Parallel Computation (MPC) and the Congested Clique model. These models have received significant attention recently due to their close connection to present-day big data platforms.
In a different direction, we consider a centralized dynamic model in which the input changes over time. We propose new dynamic algorithms for constructing hopsets and distance oracles that lead to state-of-the-art approximate single-source, multi-source and all-pairs shortest path algorithms with respect to update-time.
Finally, we study the problem of finding optimal spanners in a different distributed model, the LOCAL model. Unlike our other results, for this problem our goal is to find the best solution for a specific input graph rather than giving a general guarantee that holds for all inputs.
One contribution of this work is to emphasize the significance of the tools and the techniques used for these distance problems rather than heavily focusing on a specific model.
In other words, we show that our techniques are broad enough that they can be extended to different models
Exploiting Hopsets: Improved Distance Oracles for Graphs of Constant Highway Dimension and Beyond
For fixed h >= 2, we consider the task of adding to a graph G a set of weighted shortcut edges on the same vertex set, such that the length of a shortest h-hop path between any pair of vertices in the augmented graph is exactly the same as the original distance between these vertices in G. A set of shortcut edges with this property is called an exact h-hopset and may be applied in processing distance queries on graph G. In particular, a 2-hopset directly corresponds to a distributed distance oracle known as a hub labeling. In this work, we explore centralized distance oracles based on 3-hopsets and display their advantages in several practical scenarios. In particular, for graphs of constant highway dimension, and more generally for graphs of constant skeleton dimension, we show that 3-hopsets require exponentially fewer shortcuts per node than any previously described distance oracle, and also offer a speedup in query time when compared to simple oracles based on a direct application of 2-hopsets. Finally, we consider the problem of computing minimum-size h-hopset (for any h >= 2) for a given graph G, showing a polylogarithmic-factor approximation for the case of unique shortest path graphs. When h=3, for a given bound on the space used by the distance oracle, we provide a construction of hopset achieving polylog approximation both for space and query time compared to the optimal 3-hopset oracle given the space bound
Almost Shortest Paths with Near-Additive Error in Weighted Graphs
Let be a weighted undirected graph with vertices and
edges, and fix a set of sources . We study the problem of
computing {\em almost shortest paths} (ASP) for all pairs in in
both classical centralized and parallel (PRAM) models of computation. Consider
the regime of multiplicative approximation of , for an arbitrarily
small constant . In this regime existing centralized algorithms
require time, where is the
matrix multiplication exponent. Existing PRAM algorithms with polylogarithmic
depth (aka time) require work .
Our centralized algorithm has running time , and its PRAM
counterpart has polylogarithmic depth and work , for an
arbitrarily small constant . For a pair , it
provides a path of length that satisfies , where is the weight of the
heaviest edge on some shortest path. Hence our additive term depends
linearly on a {\em local} maximum edge weight, as opposed to the global maximum
edge weight in previous works. Finally, our .
We also extend a centralized algorithm of Dor et al. \cite{DHZ00}. For a
parameter , this algorithm provides for {\em unweighted}
graphs a purely additive approximation of for {\em all pairs
shortest paths} (APASP) in time . Within the same
running time, our algorithm for {\em weighted} graphs provides a purely
additive error of , for every vertex pair , with defined as above.
On the way to these results we devise a suit of novel constructions of
spanners, emulators and hopsets
On Efficient Distributed Construction of Near Optimal Routing Schemes
Given a distributed network represented by a weighted undirected graph
on vertices, and a parameter , we devise a distributed
algorithm that computes a routing scheme in
rounds, where is the hop-diameter of the network. The running time matches
the lower bound of rounds (which holds for any
scheme with polynomial stretch), up to lower order terms. The routing tables
are of size , the labels are of size , and
every packet is routed on a path suffering stretch at most . Our
construction nearly matches the state-of-the-art for routing schemes built in a
centralized sequential manner. The previous best algorithms for building
routing tables in a distributed small messages model were by \cite[STOC
2013]{LP13} and \cite[PODC 2015]{LP15}. The former has similar properties but
suffers from substantially larger routing tables of size ,
while the latter has sub-optimal running time of