108 research outputs found
Congested Clique Algorithms for Graph Spanners
Graph spanners are sparse subgraphs that faithfully preserve the distances in the original graph up to small stretch. Spanner have been studied extensively as they have a wide range of applications ranging from distance oracles, labeling schemes and routing to solving linear systems and spectral sparsification. A k-spanner maintains pairwise distances up to multiplicative factor of k. It is a folklore that for every n-vertex graph G, one can construct a (2k-1) spanner with O(n^{1+1/k}) edges. In a distributed setting, such spanners can be constructed in the standard CONGEST model using O(k^2) rounds, when randomization is allowed.
In this work, we consider spanner constructions in the congested clique model, and show:
- a randomized construction of a (2k-1)-spanner with O~(n^{1+1/k}) edges in O(log k) rounds. The previous best algorithm runs in O(k) rounds;
- a deterministic construction of a (2k-1)-spanner with O~(n^{1+1/k}) edges in O(log k +(log log n)^3) rounds. The previous best algorithm runs in O(k log n) rounds. This improvement is achieved by a new derandomization theorem for hitting sets which might be of independent interest;
- a deterministic construction of a O(k)-spanner with O(k * n^{1+1/k}) edges in O(log k) rounds
Massively Parallel Algorithms for Distance Approximation and Spanners
Over the past decade, there has been increasing interest in
distributed/parallel algorithms for processing large-scale graphs. By now, we
have quite fast algorithms -- usually sublogarithmic-time and often
-time, or even faster -- for a number of fundamental graph
problems in the massively parallel computation (MPC) model. This model is a
widely-adopted theoretical abstraction of MapReduce style settings, where a
number of machines communicate in an all-to-all manner to process large-scale
data. Contributing to this line of work on MPC graph algorithms, we present
round MPC algorithms for computing
-spanners in the strongly sublinear regime of local memory. To
the best of our knowledge, these are the first sublogarithmic-time MPC
algorithms for spanner construction. As primary applications of our spanners,
we get two important implications, as follows:
-For the MPC setting, we get an -round algorithm for
approximation of all pairs shortest paths (APSP) in the
near-linear regime of local memory. To the best of our knowledge, this is the
first sublogarithmic-time MPC algorithm for distance approximations.
-Our result above also extends to the Congested Clique model of distributed
computing, with the same round complexity and approximation guarantee. This
gives the first sub-logarithmic algorithm for approximating APSP in weighted
graphs in the Congested Clique model
Near-Optimal Approximate Shortest Paths and Transshipment in Distributed and Streaming Models
We present a method for solving the transshipment problem - also known as
uncapacitated minimum cost flow - up to a multiplicative error of in undirected graphs with non-negative edge weights using a
tailored gradient descent algorithm. Using to hide
polylogarithmic factors in (the number of nodes in the graph), our gradient
descent algorithm takes iterations, and in each
iteration it solves an instance of the transshipment problem up to a
multiplicative error of . In particular, this allows
us to perform a single iteration by computing a solution on a sparse spanner of
logarithmic stretch. Using a randomized rounding scheme, we can further extend
the method to finding approximate solutions for the single-source shortest
paths (SSSP) problem. As a consequence, we improve upon prior work by obtaining
the following results: (1) Broadcast CONGEST model: -approximate SSSP using rounds, where is the (hop) diameter of the network.
(2) Broadcast congested clique model: -approximate
transshipment and SSSP using rounds. (3)
Multipass streaming model: -approximate transshipment and
SSSP using space and passes. The
previously fastest SSSP algorithms for these models leverage sparse hop sets.
We bypass the hop set construction; computing a spanner is sufficient with our
method. The above bounds assume non-negative edge weights that are polynomially
bounded in ; for general non-negative weights, running times scale with the
logarithm of the maximum ratio between non-zero weights.Comment: Accepted to SIAM Journal on Computing. Preliminary version in DISC
2017. Abstract shortened to fit arXiv's limitation to 1920 character
Sparse Hopsets in Congested Clique
We give the first Congested Clique algorithm that computes a sparse hopset
with polylogarithmic hopbound in polylogarithmic time. Given a graph ,
a -hopset with "hopbound" , is a set of edges
added to such that for any pair of nodes and in there is a path
with at most hops in with length within of
the shortest path between and in .
Our hopsets are significantly sparser than the recent construction of
Censor-Hillel et al. [6], that constructs a hopset of size
, but with a smaller polylogarithmic hopbound. On the other
hand, the previously known constructions of sparse hopsets with polylogarithmic
hopbound in the Congested Clique model, proposed by Elkin and Neiman
[10],[11],[12], all require polynomial rounds.
One tool that we use is an efficient algorithm that constructs an
-limited neighborhood cover, that may be of independent interest.
Finally, as a side result, we also give a hopset construction in a variant of
the low-memory Massively Parallel Computation model, with improved running time
over existing algorithms
Massively Parallel Approximate Distance Sketches
Data structures that allow efficient distance estimation (distance oracles, distance sketches, etc.) have been extensively studied, and are particularly well studied in centralized models and classical distributed models such as CONGEST. We initiate their study in newer (and arguably more realistic) models of distributed computation: the Congested Clique model and the Massively Parallel Computation (MPC) model. We provide efficient constructions in both of these models, but our core results are for MPC. In MPC we give two main results: an algorithm that constructs stretch/space optimal distance sketches but takes a (small) polynomial number of rounds, and an algorithm that constructs distance sketches with worse stretch but that only takes polylogarithmic rounds.
Along the way, we show that other useful combinatorial structures can also be computed in MPC. In particular, one key component we use to construct distance sketches are an MPC construction of the hopsets of [Elkin and Neiman, 2016]. This result has additional applications such as the first polylogarithmic time algorithm for constant approximate single-source shortest paths for weighted graphs in the low memory MPC setting
Derandomizing Local Distributed Algorithms under Bandwidth Restrictions
This paper addresses the cornerstone family of local problems in distributed computing, and investigates the curious gap between randomized and deterministic solutions under bandwidth restrictions.
Our main contribution is in providing tools for derandomizing solutions to local problems, when the n nodes can only send O(log n)-bit messages in each round of communication. We combine bounded independence, which we show to be sufficient for some algorithms, with the method of conditional expectations and with additional machinery, to obtain the following results.
First, we show that in the Congested Clique model, which allows all-to-all communication, there is a deterministic maximal independent set (MIS) algorithm that runs in O(log^2 Delta) rounds, where Delta is the maximum degree. When Delta=O(n^(1/3)), the bound improves to O(log Delta).
Adapting the above to the CONGEST model gives an O(D log^2 n)-round deterministic MIS algorithm, where D is the diameter of the graph. Apart from a previous unproven claim of a O(D log^3 n)-round algorithm, the only known deterministic solutions for the CONGEST model are a coloring-based O(Delta + log^* n)-round algorithm, where Delta is the maximal degree in the graph, and a 2^O(sqrt(log n log log n))-round algorithm, which is super-polylogarithmic in n.
In addition, we deterministically construct a (2k-1)-spanner with O(kn^(1+1/k) log n) edges in O(k log n) rounds in the Congested Clique model. For comparison, in the more stringent CONGEST model, where the communication graph is identical to the input graph, the best deterministic algorithm for constructing a (2k-1)-spanner with O(kn^(1+1/k)) edges runs in O(n^(1-1/k)) rounds
Brief Announcement: Massively Parallel Approximate Distance Sketches
Data structures that allow efficient distance estimation have been extensively studied both in centralized models and classical distributed models. We initiate their study in newer (and arguably more realistic) models of distributed computation: the Congested Clique model and the Massively Parallel Computation (MPC) model. In MPC we give two main results: an algorithm that constructs stretch/space optimal distance sketches but takes a (small) polynomial number of rounds, and an algorithm that constructs distance sketches with worse stretch but that only takes polylogarithmic rounds. Along the way, we show that other useful combinatorial structures can also be computed in MPC. In particular, one key component we use is an MPC construction of the hopsets of Elkin and Neiman (2016). This result has additional applications such as the first polylogarithmic time algorithm for constant approximate single-source shortest paths for weighted graphs in the low memory MPC setting
Being Fast Means Being Chatty: The Local Information Cost of Graph Spanners
We introduce a new measure for quantifying the amount of information that the
nodes in a network need to learn to jointly solve a graph problem. We show that
the local information cost () presents a natural lower bound on
the communication complexity of distributed algorithms. For the synchronous
CONGEST-KT1 model, where each node has initial knowledge of its neighbors' IDs,
we prove that bits are
required for solving a graph problem with a -round algorithm that
errs with probability at most . Our result is the first lower bound
that yields a general trade-off between communication and time for graph
problems in the CONGEST-KT1 model.
We demonstrate how to apply the local information cost by deriving a lower
bound on the communication complexity of computing a -spanner that
consists of at most edges, where . Our main result is that any -time
algorithm must send at least bits in the
CONGEST model under the KT1 assumption. Previously, only a trivial lower bound
of bits was known for this problem.
A consequence of our lower bound is that achieving both time- and
communication-optimality is impossible when designing a distributed spanner
algorithm. In light of the work of King, Kutten, and Thorup (PODC 2015), this
shows that computing a minimum spanning tree can be done significantly faster
than finding a spanner when considering algorithms with
communication complexity. Our result also implies time complexity lower bounds
for constructing a spanner in the node-congested clique of Augustine et al.
(2019) and in the push-pull gossip model with limited bandwidth
- …