Search CORE

1,351 research outputs found

Log Diameter Rounds Algorithms for 2-Vertex and 2-Edge Connectivity

Author: Andoni Alexandr
Stein Clifford
Zhong Peilin
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 46th International Colloquium on Automata, Languages, and Programming (ICALP 2019)
Publication date: 01/01/2019
Field of study

Many modern parallel systems, such as MapReduce, Hadoop and Spark, can be modeled well by the MPC model. The MPC model captures well coarse-grained computation on large data - data is distributed to processors, each of which has a sublinear (in the input data) amount of memory and we alternate between rounds of computation and rounds of communication, where each machine can communicate an amount of data as large as the size of its memory. This model is stronger than the classical PRAM model, and it is an intriguing question to design algorithms whose running time is smaller than in the PRAM model. In this paper, we study two fundamental problems, 2-edge connectivity and 2-vertex connectivity (biconnectivity). PRAM algorithms which run in O(log n) time have been known for many years. We give algorithms using roughly log diameter rounds in the MPC model. Our main results are, for an n-vertex, m-edge graph of diameter D and bi-diameter D\u27, 1) a O(log D log log_{m/n} n) parallel time 2-edge connectivity algorithm, 2) a O(log D log^2 log_{m/n}n+log D\u27log log_{m/n}n) parallel time biconnectivity algorithm, where the bi-diameter D\u27 is the largest cycle length over all the vertex pairs in the same biconnected component. Our results are fully scalable, meaning that the memory per processor can be O(n^{delta}) for arbitrary constant delta>0, and the total memory used is linear in the problem size. Our 2-edge connectivity algorithm achieves the same parallel time as the connectivity algorithm of [Andoni et al., 2018]. We also show an Omega(log D\u27) conditional lower bound for the biconnectivity problem

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server

Recommended from our members

New Primitives for Tackling Graph Problems and Their Applications in Parallel Computing

Author: Zhong Peilin
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/2021
Field of study

We study fundamental graph problems under parallel computing models. In particular, we consider two parallel computing models: Parallel Random Access Machine (PRAM) and Massively Parallel Computation (MPC). The PRAM model is a classic model of parallel computation. The efficiency of a PRAM algorithm is measured by its parallel time and the number of processors needed to achieve the parallel time. The MPC model is an abstraction of modern massive parallel computing systems such as MapReduce, Hadoop and Spark. The MPC model captures well coarse-grained computation on large data --- data is distributed to processors, each of which has a sublinear (in the input data) amount of local memory and we alternate between rounds of computation and rounds of communication, where each machine can communicate an amount of data as large as the size of its memory. We usually desire fully scalable MPC algorithms, i.e., algorithms that can work for any local memory size. The efficiency of a fully scalable MPC algorithm is measured by its parallel time and the total space usage (the local memory size times the number of machines). Consider an -vertex -edge undirected graph (either weighted or unweighted) with diameter (the largest diameter of its connected components). Let =+ denote the size of . We present a series of efficient (randomized) parallel graph algorithms with theoretical guarantees. Several results are listed as follows: 1) Fully scalable MPC algorithms for graph connectivity and spanning forest using () total space and (log loglog_{/} ) parallel time. 2) Fully scalable MPC algorithms for 2-edge and 2-vertex connectivity using () total space where 2-edge connectivity algorithm needs (log loglog_{/} ) parallel time, and 2-vertex connectivity algorithm needs (log ⸱log²log_{/} n+\log D'⸱loglog_{/} ) parallel time. Here ' denotes the bi-diameter of . 3) PRAM algorithms for graph connectivity and spanning forest using () processors and (log loglog_{/} ) parallel time. 4) PRAM algorithms for (1 + )-approximate shortest path and (1 + )-approximate uncapacitated minimum cost flow using () processors and poly(log ) parallel time. These algorithms are built on a series of new graph algorithmic primitives which may be of independent interests

Columbia University Academic Commons

Small Cuts and Connectivity Certificates: A Fault Tolerant Approach

Author: Parter Merav
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 33rd International Symposium on Distributed Computing (DISC 2019)
Publication date: 01/01/2019
Field of study

We revisit classical connectivity problems in the {CONGEST} model of distributed computing. By using techniques from fault tolerant network design, we show improved constructions, some of which are even "local" (i.e., with O~(1) rounds) for problems that are closely related to hard global problems (i.e., with a lower bound of Omega(Diam+sqrt{n}) rounds). Distributed Minimum Cut: Nanongkai and Su presented a randomized algorithm for computing a (1+epsilon)-approximation of the minimum cut using O~(D +sqrt{n}) rounds where D is the diameter of the graph. For a sufficiently large minimum cut lambda=Omega(sqrt{n}), this is tight due to Das Sarma et al. [FOCS \u2711], Ghaffari and Kuhn [DISC \u2713]. - Small Cuts: A special setting that remains open is where the graph connectivity lambda is small (i.e., constant). The only lower bound for this case is Omega(D), with a matching bound known only for lambda <= 2 due to Pritchard and Thurimella [TALG \u2711]. Recently, Daga, Henzinger, Nanongkai and Saranurak [STOC \u2719] raised the open problem of computing the minimum cut in poly(D) rounds for any lambda=O(1). In this paper, we resolve this problem by presenting a surprisingly simple algorithm, that takes a completely different approach than the existing algorithms. Our algorithm has also the benefit that it computes all minimum cuts in the graph, and naturally extends to vertex cuts as well. At the heart of the algorithm is a graph sampling approach usually used in the context of fault tolerant (FT) design. - Deterministic Algorithms: While the existing distributed minimum cut algorithms are randomized, our algorithm can be made deterministic within the same round complexity. To obtain this, we introduce a novel definition of universal sets along with their efficient computation. This allows us to derandomize the FT graph sampling technique, which might be of independent interest. - Computation of all Edge Connectivities: We also consider the more general task of computing the edge connectivity of all the edges in the graph. In the output format, it is required that the endpoints u,v of every edge (u,v) learn the cardinality of the u-v cut in the graph. We provide the first sublinear algorithm for this problem for the case of constant connectivity values. Specifically, by using the recent notion of low-congestion cycle cover, combined with the sampling technique, we compute all edge connectivities in poly(D) * 2^{O(sqrt{log n log log n})} rounds. Sparse Certificates: For an n-vertex graph G and an integer lambda, a lambda-sparse certificate H is a subgraph H subseteq G with O(lambda n) edges which is lambda-connected iff G is lambda-connected. For D-diameter graphs, constructions of sparse certificates for lambda in {2,3} have been provided by Thurimella [J. Alg. \u2797] and Dori [PODC \u2718] respectively using O~(D) number of rounds. The problem of devising such certificates with o(D+sqrt{n}) rounds was left open by Dori [PODC \u2718] for any lambda >= 4. Using connections to fault tolerant spanners, we considerably improve the round complexity for any lambda in [1,n] and epsilon in (0,1), by showing a construction of (1-epsilon)lambda-sparse certificates with O(lambda n) edges using only O(1/epsilon^2 * log^{2+o(1)} n) rounds

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server

Distributed Edge Connectivity in Sublinear Time

Author: Abboud A.
Ahmadi M.
Becker R.
Bernstein A.
Censor-Hillel K.
Chang Y.
Chu T.
Forster S.
Hoang L.
Huang C.
Kannan R.
Kawarabayashi K.
Kelner J. A.
Kuhn F.
Nanongkai D.
Nanongkai D.
Orecchia L.
Orecchia L.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 08/04/2019
Field of study

We present the first sublinear-time algorithm for a distributed message-passing network sto compute its edge connectivity

\lambda

exactly in the CONGEST model, as long as there are no parallel edges. Our algorithm takes

\tilde O(n^{1-1/353}D^{1/353}+n^{1-1/706})

time to compute

\lambda

and a cut of cardinality

\lambda

with high probability, where

n

and

D

are the number of nodes and the diameter of the network, respectively, and

\tilde O

hides polylogarithmic factors. This running time is sublinear in

n

(i.e.

\tilde O(n^{1-\epsilon})

) whenever

D

is. Previous sublinear-time distributed algorithms can solve this problem either (i) exactly only when

\lambda=O(n^{1/8-\epsilon})

[Thurimella PODC'95; Pritchard, Thurimella, ACM Trans. Algorithms'11; Nanongkai, Su, DISC'14] or (ii) approximately [Ghaffari, Kuhn, DISC'13; Nanongkai, Su, DISC'14]. To achieve this we develop and combine several new techniques. First, we design the first distributed algorithm that can compute a

k

-edge connectivity certificate for any

k=O(n^{1-\epsilon})

in time

\tilde O(\sqrt{nk}+D)

. Second, we show that by combining the recent distributed expander decomposition technique of [Chang, Pettie, Zhang, SODA'19] with techniques from the sequential deterministic edge connectivity algorithm of [Kawarabayashi, Thorup, STOC'15], we can decompose the network into a sublinear number of clusters with small average diameter and without any mincut separating a cluster (except the `trivial' ones). Finally, by extending the tree packing technique from [Karger STOC'96], we can find the minimum cut in time proportional to the number of components. As a byproduct of this technique, we obtain an

\tilde O(n)

-time algorithm for computing exact minimum cut for weighted graphs.Comment: Accepted at 51st ACM Symposium on Theory of Computing (STOC 2019

arXiv.org e-Print Archive

Crossref

Distributed Connectivity Decomposition

Author: Censor-Hillel Keren
Ghaffari Mohsen
Kuhn Fabian
Publication venue
Publication date: 21/11/2013
Field of study

We present time-efficient distributed algorithms for decomposing graphs with large edge or vertex connectivity into multiple spanning or dominating trees, respectively. As their primary applications, these decompositions allow us to achieve information flow with size close to the connectivity by parallelizing it along the trees. More specifically, our distributed decomposition algorithms are as follows: (I) A decomposition of each undirected graph with vertex-connectivity

k

into (fractionally) vertex-disjoint weighted dominating trees with total weight

\Omega(\frac{k}{\log n})

, in

\widetilde{O}(D+\sqrt{n})

rounds. (II) A decomposition of each undirected graph with edge-connectivity

\lambda

into (fractionally) edge-disjoint weighted spanning trees with total weight

\lceil\frac{\lambda-1}{2}\rceil(1-\varepsilon)

, in

\widetilde{O}(D+\sqrt{n\lambda})

rounds. We also show round complexity lower bounds of

\tilde{\Omega}(D+\sqrt{\frac{n}{k}})

and

\tilde{\Omega}(D+\sqrt{\frac{n}{\lambda}})

for the above two decompositions, using techniques of [Das Sarma et al., STOC'11]. Moreover, our vertex-connectivity decomposition extends to centralized algorithms and improves the time complexity of [Censor-Hillel et al., SODA'14] from

O(n^3)

to near-optimal

\tilde{O}(m)

. As corollaries, we also get distributed oblivious routing broadcast with

O(1)

-competitive edge-congestion and

O(\log n)

-competitive vertex-congestion. Furthermore, the vertex connectivity decomposition leads to near-time-optimal

O(\log n)

-approximation of vertex connectivity: centralized

\widetilde{O}(m)

and distributed

\tilde{O}(D+\sqrt{n})

. The former moves toward the 1974 conjecture of Aho, Hopcroft, and Ullman postulating an

O(m)

centralized exact algorithm while the latter is the first distributed vertex connectivity approximation

arXiv.org e-Print Archive

CiteSeerX

Theoretically Efficient Parallel Graph Algorithms Can Be Fast and Scalable

Author: Blelloch G. E.
Blelloch G. E.
Cormen T. H.
Da Zheng D. M.
Dasari N. S.
Gonzalez J. E.
Greenlaw R.
Karp R. M.
Low Y.
Maon Y.
Ramachandran V.
Shiloach Y.
Zhou W.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 03/07/2019
Field of study

There has been significant recent interest in parallel graph processing due to the need to quickly analyze the large graphs available today. Many graph codes have been designed for distributed memory or external memory. However, today even the largest publicly-available real-world graph (the Hyperlink Web graph with over 3.5 billion vertices and 128 billion edges) can fit in the memory of a single commodity multicore server. Nevertheless, most experimental work in the literature report results on much smaller graphs, and the ones for the Hyperlink graph use distributed or external memory. Therefore, it is natural to ask whether we can efficiently solve a broad class of graph problems on this graph in memory. This paper shows that theoretically-efficient parallel graph algorithms can scale to the largest publicly-available graphs using a single machine with a terabyte of RAM, processing them in minutes. We give implementations of theoretically-efficient parallel algorithms for 20 important graph problems. We also present the optimizations and techniques that we used in our implementations, which were crucial in enabling us to process these large graphs quickly. We show that the running times of our implementations outperform existing state-of-the-art implementations on the largest real-world graphs. For many of the problems that we consider, this is the first time they have been solved on graphs at this scale. We have made the implementations developed in this work publicly-available as the Graph-Based Benchmark Suite (GBBS).Comment: This is the full version of the paper appearing in the ACM Symposium on Parallelism in Algorithms and Architectures (SPAA), 201

arXiv.org e-Print Archive

Crossref

DSpace@MIT

Distributed Minimum Cut Approximation

Author: A. Das Sarma
A.A. Razborov
B. Kalyanasundaram
D.R. Karger
H. Nagamochi
J.C. Picard
L.R. Ford
M. Elkin
M.V. Lomonosov
P. Elias
R. Thurimella
Publication venue
Publication date: 01/01/2013
Field of study

We study the problem of computing approximate minimum edge cuts by distributed algorithms. We use a standard synchronous message passing model where in each round,

O(\log n)

bits can be transmitted over each edge (a.k.a. the CONGEST model). We present a distributed algorithm that, for any weighted graph and any

\epsilon \in (0, 1)

, with high probability finds a cut of size at most

O(\epsilon^{-1}\lambda)

O(D) + \tilde{O}(n^{1/2 + \epsilon})

rounds, where

\lambda

is the size of the minimum cut. This algorithm is based on a simple approach for analyzing random edge sampling, which we call the random layering technique. In addition, we also present another distributed algorithm, which is based on a centralized algorithm due to Matula [SODA '93], that with high probability computes a cut of size at most

(2+\epsilon)\lambda

\tilde{O}((D+\sqrt{n})/\epsilon^5)

rounds for any

\epsilon>0

. The time complexities of both of these algorithms almost match the

\tilde{\Omega}(D + \sqrt{n})

lower bound of Das Sarma et al. [STOC '11], thus leading to an answer to an open question raised by Elkin [SIGACT-News '04] and Das Sarma et al. [STOC '11]. Furthermore, we also strengthen the lower bound of Das Sarma et al. by extending it to unweighted graphs. We show that the same lower bound also holds for unweighted multigraphs (or equivalently for weighted graphs in which