Search CORE

6,022 research outputs found

Theoretically Efficient Parallel Graph Algorithms Can Be Fast and Scalable

Author: Blelloch G. E.
Blelloch G. E.
Cormen T. H.
Da Zheng D. M.
Dasari N. S.
Gonzalez J. E.
Greenlaw R.
Karp R. M.
Low Y.
Maon Y.
Ramachandran V.
Shiloach Y.
Zhou W.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 03/07/2019
Field of study

There has been significant recent interest in parallel graph processing due to the need to quickly analyze the large graphs available today. Many graph codes have been designed for distributed memory or external memory. However, today even the largest publicly-available real-world graph (the Hyperlink Web graph with over 3.5 billion vertices and 128 billion edges) can fit in the memory of a single commodity multicore server. Nevertheless, most experimental work in the literature report results on much smaller graphs, and the ones for the Hyperlink graph use distributed or external memory. Therefore, it is natural to ask whether we can efficiently solve a broad class of graph problems on this graph in memory. This paper shows that theoretically-efficient parallel graph algorithms can scale to the largest publicly-available graphs using a single machine with a terabyte of RAM, processing them in minutes. We give implementations of theoretically-efficient parallel algorithms for 20 important graph problems. We also present the optimizations and techniques that we used in our implementations, which were crucial in enabling us to process these large graphs quickly. We show that the running times of our implementations outperform existing state-of-the-art implementations on the largest real-world graphs. For many of the problems that we consider, this is the first time they have been solved on graphs at this scale. We have made the implementations developed in this work publicly-available as the Graph-Based Benchmark Suite (GBBS).Comment: This is the full version of the paper appearing in the ACM Symposium on Parallelism in Algorithms and Architectures (SPAA), 201

arXiv.org e-Print Archive

Fast Local Computation Algorithms

Author: Rubinfeld Ronitt
Tamir Gil
Vardi Shai
Xie Ning
Publication venue
Publication date: 01/01/2011
Field of study

For input

x

, let

F(x)

denote the set of outputs that are the "legal" answers for a computational problem

F

. Suppose

x

and members of

F(x)

are so large that there is not time to read them in their entirety. We propose a model of {\em local computation algorithms} which for a given input

x

, support queries by a user to values of specified locations

y_i

in a legal output

y \in F(x)

. When more than one legal output

y

exists for a given

x

, the local computation algorithm should output in a way that is consistent with at least one such

y

. Local computation algorithms are intended to distill the common features of several concepts that have appeared in various algorithmic subfields, including local distributed computation, local algorithms, locally decodable codes, and local reconstruction. We develop a technique, based on known constructions of small sample spaces of

k

-wise independent random variables and Beck's analysis in his algorithmic approach to the Lov{\'{a}}sz Local Lemma, which under certain conditions can be applied to construct local computation algorithms that run in {\em polylogarithmic} time and space. We apply this technique to maximal independent set computations, scheduling radio network broadcasts, hypergraph coloring and satisfying

k

-SAT formulas.Comment: A preliminary version of this paper appeared in ICS 2011, pp. 223-23

arXiv.org e-Print Archive

CiteSeerX

Best of Two Local Models: Local Centralized and Local Distributed Algorithms

Author: Even Guy
Medina Moti
Ron Dana
Publication venue
Publication date: 11/11/2014
Field of study

We consider two models of computation: centralized local algorithms and local distributed algorithms. Algorithms in one model are adapted to the other model to obtain improved algorithms. Distributed vertex coloring is employed to design improved centralized local algorithms for: maximal independent set, maximal matching, and an approximation scheme for maximum (weighted) matching over bounded degree graphs. The improvement is threefold: the algorithms are deterministic, stateless, and the number of probes grows polynomially in

\log^* n

, where

n

is the number of vertices of the input graph. The recursive centralized local improvement technique by Nguyen and Onak~\cite{onak2008} is employed to obtain an improved distributed approximation scheme for maximum (weighted) matching. The improvement is twofold: we reduce the number of rounds from

O(\log n)

O(\log^*n)

for a wide range of instances and, our algorithms are deterministic rather than randomized

arXiv.org e-Print Archive

CiteSeerX

Distributed Maximum Matching in Bounded Degree Graphs

Author: Even Guy
Medina Moti
Ron Dana
Publication venue
Publication date: 11/11/2014
Field of study

We present deterministic distributed algorithms for computing approximate maximum cardinality matchings and approximate maximum weight matchings. Our algorithm for the unweighted case computes a matching whose size is at least (1-\eps) times the optimal in \Delta^{O(1/\eps)} + O\left(\frac{1}{\eps^2}\right) \cdot\log^*(n) rounds where

n

is the number of vertices in the graph and

\Delta

is the maximum degree. Our algorithm for the edge-weighted case computes a matching whose weight is at least (1-\eps) times the optimal in \log(\min\{1/\wmin,n/\eps\})^{O(1/\eps)}\cdot(\Delta^{O(1/\eps)}+\log^*(n)) rounds for edge-weights in [\wmin,1]. The best previous algorithms for both the unweighted case and the weighted case are by Lotker, Patt-Shamir, and Pettie~(SPAA 2008). For the unweighted case they give a randomized (1-\eps)-approximation algorithm that runs in O((\log(n)) /\eps^3) rounds. For the weighted case they give a randomized (1/2-\eps)-approximation algorithm that runs in O(\log(\eps^{-1}) \cdot \log(n)) rounds. Hence, our results improve on the previous ones when the parameters

\Delta

, \eps and \wmin are constants (where we reduce the number of runs from

O(\log(n))

O(\log^*(n))

), and more generally when

\Delta

, 1/\eps and 1/\wmin are sufficiently slowly increasing functions of

n

. Moreover, our algorithms are deterministic rather than randomized.Comment: arXiv admin note: substantial text overlap with arXiv:1402.379

arXiv.org e-Print Archive

CiteSeerX

Fast Distributed Approximation for Max-Cut

Author: C Lenzen
F Barahona
F Hadlock
F Kuhn
F Kuhn
G Xue
J Håstad
K Chang
KW Chin
L Trevisan
L Trevisan
L Wang
M Elkin
M Ghaffari
M Grötschel
M Åstrand
MR Garey
MX Goemans
N Buchbinder
N Linial
S Khot
S Matuura
S Sahni
S Saurabh
U Feige
Y Xu
Publication venue
Publication date: 26/07/2017
Field of study

Finding a maximum cut is a fundamental task in many computational settings. Surprisingly, it has been insufficiently studied in the classic distributed settings, where vertices communicate by synchronously sending messages to their neighbors according to the underlying graph, known as the

\mathcal{LOCAL}

\mathcal{CONGEST}

models. We amend this by obtaining almost optimal algorithms for Max-Cut on a wide class of graphs in these models. In particular, for any

\epsilon > 0

, we develop randomized approximation algorithms achieving a ratio of

(1-\epsilon)

to the optimum for Max-Cut on bipartite graphs in the

\mathcal{CONGEST}

model, and on general graphs in the

\mathcal{LOCAL}

model. We further present efficient deterministic algorithms, including a

1/3

-approximation for Max-Dicut in our models, thus improving the best known (randomized) ratio of

1/4

. Our algorithms make non-trivial use of the greedy approach of Buchbinder et al. (SIAM Journal on Computing, 2015) for maximizing an unconstrained (non-monotone) submodular function, which may be of independent interest

arXiv.org e-Print Archive

Distributed Approximation of Maximum Independent Set and Maximum Matching

Author: Bar-Yehuda Reuven
Bodlaender Marijke HL
Czygrinow Andrzej
Edmonds Jack
Halldórsson Magnús M
Kuhn Fabian
Publication venue
Publication date: 01/08/2017
Field of study

We present a simple distributed

\Delta

-approximation algorithm for maximum weight independent set (MaxIS) in the

\mathsf{CONGEST}

model which completes in

O(\texttt{MIS}(G)\cdot \log W)

rounds, where

\Delta

is the maximum degree,

\texttt{MIS}(G)

is the number of rounds needed to compute a maximal independent set (MIS) on

G

, and

W

is the maximum weight of a node. %Whether our algorithm is randomized or deterministic depends on the \texttt{MIS} algorithm used as a black-box. Plugging in the best known algorithm for MIS gives a randomized solution in

O(\log n \log W)

rounds, where

n

is the number of nodes. We also present a deterministic

O(\Delta +\log^* n)

-round algorithm based on coloring. We then show how to use our MaxIS approximation algorithms to compute a

2

-approximation for maximum weight matching without incurring any additional round penalty in the

\mathsf{CONGEST}

model. We use a known reduction for simulating algorithms on the line graph while incurring congestion, but we show our algorithm is part of a broad family of \emph{local aggregation algorithms} for which we describe a mechanism that allows the simulation to run in the

\mathsf{CONGEST}

model without an additional overhead. Next, we show that for maximum weight matching, relaxing the approximation factor to (

2+\varepsilon

) allows us to devise a distributed algorithm requiring

O(\frac{\log \Delta}{\log\log\Delta})

rounds for any constant

\varepsilon>0

. For the unweighted case, we can even obtain a

(1+\varepsilon)

-approximation in this number of rounds. These algorithms are the first to achieve the provably optimal round complexity with respect to dependency on

\Delta

arXiv.org e-Print Archive

Optimal Dynamic Distributed MIS

Author: Censor-Hillel Keren
Haramaty Elad
Karnin Zohar
Publication venue
Publication date: 16/07/2015
Field of study

Finding a maximal independent set (MIS) in a graph is a cornerstone task in distributed computing. The local nature of an MIS allows for fast solutions in a static distributed setting, which are logarithmic in the number of nodes or in their degrees. The result trivially applies for the dynamic distributed model, in which edges or nodes may be inserted or deleted. In this paper, we take a different approach which exploits locality to the extreme, and show how to update an MIS in a dynamic distributed setting, either \emph{synchronous} or \emph{asynchronous}, with only \emph{a single adjustment} and in a single round, in expectation. These strong guarantees hold for the \emph{complete fully dynamic} setting: Insertions and deletions, of edges as well as nodes, gracefully and abruptly. This strongly separates the static and dynamic distributed models, as super-constant lower bounds exist for computing an MIS in the former. Our results are obtained by a novel analysis of the surprisingly simple solution of carefully simulating the greedy \emph{sequential} MIS algorithm with a random ordering of the nodes. As such, our algorithm has a direct application as a

3

-approximation algorithm for correlation clustering. This adds to the important toolbox of distributed graph decompositions, which are widely used as crucial building blocks in distributed computing. Finally, our algorithm enjoys a useful \emph{history-independence} property, meaning the output is independent of the history of topology changes that constructed that graph. This means the output cannot be chosen, or even biased, by the adversary in case its goal is to prevent us from optimizing some objective function.Comment: 19 pages including appendix and reference

arXiv.org e-Print Archive

Parallel Maximum Clique Algorithms with Applications to Network Analysis and Storage

Author: Ali Patwary
Assefaw H. Gebremedhin
David F. Gleich
Md. Mostofa
Ryan A. Rossi
Publication venue
Publication date: 25/12/2013
Field of study

We propose a fast, parallel maximum clique algorithm for large sparse graphs that is designed to exploit characteristics of social and information networks. The method exhibits a roughly linear runtime scaling over real-world networks ranging from 1000 to 100 million nodes. In a test on a social network with 1.8 billion edges, the algorithm finds the largest clique in about 20 minutes. Our method employs a branch and bound strategy with novel and aggressive pruning techniques. For instance, we use the core number of a vertex in combination with a good heuristic clique finder to efficiently remove the vast majority of the search space. In addition, we parallelize the exploration of the search tree. During the search, processes immediately communicate changes to upper and lower bounds on the size of maximum clique, which occasionally results in a super-linear speedup because vertices with large search spaces can be pruned by other processes. We apply the algorithm to two problems: to compute temporal strong components and to compress graphs.Comment: 11 page

arXiv.org e-Print Archive

CiteSeerX

The Bounded Edge Coloring Problem and Offline Crossbar Scheduling

Author: Turner Jonathan
Publication venue
Publication date: 01/12/2015
Field of study

This paper introduces a variant of the classical edge coloring problem in graphs that can be applied to an offline scheduling problem for crossbar switches. We show that the problem is NP-complete, develop three lower bounds bounds on the optimal solution value and evaluate the performance of several approximation algorithms, both analytically and experimentally. We show how to approximate an optimal solution with a worst-case performance ratio of

3/2

and our experimental results demonstrate that the best algorithms produce results that very closely track a lower bound

arXiv.org e-Print Archive

Washington University St. Louis: Open Scholarship