Search CORE

155 research outputs found

Towards Work-Efficient Parallel Parameterized Algorithms

Author: J Cheetham
J Chen
L Cai
M Cesati
M Elberfeld
R Downey
R Niedermeier
RM Karp
Y Han
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 20/02/2019
Field of study

Parallel parameterized complexity theory studies how fixed-parameter tractable (fpt) problems can be solved in parallel. Previous theoretical work focused on parallel algorithms that are very fast in principle, but did not take into account that when we only have a small number of processors (between 2 and, say, 1024), it is more important that the parallel algorithms are work-efficient. In the present paper we investigate how work-efficient fpt algorithms can be designed. We review standard methods from fpt theory, like kernelization, search trees, and interleaving, and prove trade-offs for them between work efficiency and runtime improvements. This results in a toolbox for developing work-efficient parallel fpt algorithms.Comment: Prior full version of the paper that will appear in Proceedings of the 13th International Conference and Workshops on Algorithms and Computation (WALCOM 2019), February 27 - March 02, 2019, Guwahati, India. The final authenticated version is available online at https://doi.org/10.1007/978-3-030-10564-8_2

arXiv.org e-Print Archive

Crossref

Realizing degree sequences in parallel

Author: Arikati S.
Maheshwari A.
Publication venue: Max-Planck-Institut für Informatik
Publication date: 01/01/1994
Field of study

A sequence

d

of integers is a degree sequence if there exists a (simple) graph

G

such that the components of

d

are equal to the degrees of the vertices of

G

. The graph

G

is said to be a realization of

d

. We provide an efficient parallel algorithm to realize

d

. Before our result, it was not known if the problem of realizing

d

is in

NC

Carleton University's Institutional Repository

MPG.PuRe

Fast distributed almost stable marriages

Author: Ostrovsky Rafail
Rosenbaum Will
Publication venue
Publication date: 02/04/2015
Field of study

In their seminal work on the Stable Marriage Problem, Gale and Shapley describe an algorithm which finds a stable matching in

O(n^2)

communication rounds. Their algorithm has a natural interpretation as a distributed algorithm where each player is represented by a single processor. In this distributed model, Floreen, Kaski, Polishchuk, and Suomela recently showed that for bounded preference lists, terminating the Gale-Shapley algorithm after a constant number of rounds results in an almost stable matching. In this paper, we describe a new deterministic distributed algorithm which finds an almost stable matching in

O(\log^5 n)

communication rounds for arbitrary preferences. We also present a faster randomized variant which requires

O(\log^2 n)

rounds. This run-time can be improved to

O(1)

rounds for "almost regular" (and in particular complete) preferences. To our knowledge, these are the first sub-polynomial round distributed algorithms for any variant of the stable marriage problem with unbounded preferences.Comment: Various improvements in version 2: algorithms for general (not just "almost regular") preferences; deterministic variant of the algorithm; streamlined proof of approximation guarante

arXiv.org e-Print Archive

CiteSeerX

Distributed Maximum Matching in Bounded Degree Graphs

Author: Even Guy
Medina Moti
Ron Dana
Publication venue
Publication date: 11/11/2014
Field of study

We present deterministic distributed algorithms for computing approximate maximum cardinality matchings and approximate maximum weight matchings. Our algorithm for the unweighted case computes a matching whose size is at least (1-\eps) times the optimal in \Delta^{O(1/\eps)} + O\left(\frac{1}{\eps^2}\right) \cdot\log^*(n) rounds where

n

is the number of vertices in the graph and

\Delta

is the maximum degree. Our algorithm for the edge-weighted case computes a matching whose weight is at least (1-\eps) times the optimal in \log(\min\{1/\wmin,n/\eps\})^{O(1/\eps)}\cdot(\Delta^{O(1/\eps)}+\log^*(n)) rounds for edge-weights in [\wmin,1]. The best previous algorithms for both the unweighted case and the weighted case are by Lotker, Patt-Shamir, and Pettie~(SPAA 2008). For the unweighted case they give a randomized (1-\eps)-approximation algorithm that runs in O((\log(n)) /\eps^3) rounds. For the weighted case they give a randomized (1/2-\eps)-approximation algorithm that runs in O(\log(\eps^{-1}) \cdot \log(n)) rounds. Hence, our results improve on the previous ones when the parameters

\Delta

, \eps and \wmin are constants (where we reduce the number of runs from

O(\log(n))

O(\log^*(n))

), and more generally when

\Delta

, 1/\eps and 1/\wmin are sufficiently slowly increasing functions of

n

. Moreover, our algorithms are deterministic rather than randomized.Comment: arXiv admin note: substantial text overlap with arXiv:1402.379

arXiv.org e-Print Archive

CiteSeerX

Crossref

Extending the Nested Parallel Model to the Nested Dataflow Model with Provably Efficient Schedulers

Author: Dinh David
Simhadri Harsha Vardhan
Tang Yuan
Publication venue
Publication date: 14/02/2016
Field of study

The nested parallel (a.k.a. fork-join) model is widely used for writing parallel programs. However, the two composition constructs, i.e. "

\parallel

" (parallel) and "

;

" (serial), are insufficient in expressing "partial dependencies" or "partial parallelism" in a program. We propose a new dataflow composition construct "

\leadsto

" to express partial dependencies in algorithms in a processor- and cache-oblivious way, thus extending the Nested Parallel (NP) model to the \emph{Nested Dataflow} (ND) model. We redesign several divide-and-conquer algorithms ranging from dense linear algebra to dynamic-programming in the ND model and prove that they all have optimal span while retaining optimal cache complexity. We propose the design of runtime schedulers that map ND programs to multicore processors with multiple levels of possibly shared caches (i.e, Parallel Memory Hierarchies) and provide theoretical guarantees on their ability to preserve locality and load balance. For this, we adapt space-bounded (SB) schedulers for the ND model. We show that our algorithms have increased "parallelizability" in the ND model, and that SB schedulers can use the extra parallelizability to achieve asymptotically optimal bounds on cache misses and running time on a greater number of processors than in the NP model. The running time for the algorithms in this paper is

O\left(\frac{\sum_{i=0}^{h-1} Q^{*}({\mathsf t};\sigma\cdot M_i)\cdot C_i}{p}\right)

, where

Q^{*}

is the cache complexity of task

{\mathsf t}

C_i

is the cost of cache miss at level-

i

cache which is of size

M_i

\sigma\in(0,1)

is a constant, and

p

is the number of processors in an

h

-level cache hierarchy

arXiv.org e-Print Archive

Crossref

Distributed Symmetry Breaking in Hypergraphs

Author: A. Ephremides
A.D. Sarma
C. Avin
D. Peleg
D.G. Harris
D.P. Dubhashi
F. Dai
H. Balakrishnan
I. Chlamtac
J.A. Garay
M. Ghaffari
N. Linial
N. Linial
R. Thurimella
S. Kutten
T. Luczak
Y. Métivier
Publication venue
Publication date: 01/01/2014
Field of study

Fundamental local symmetry breaking problems such as Maximal Independent Set (MIS) and coloring have been recognized as important by the community, and studied extensively in (standard) graphs. In particular, fast (i.e., logarithmic run time) randomized algorithms are well-established for MIS and

\Delta +1

-coloring in both the LOCAL and CONGEST distributed computing models. On the other hand, comparatively much less is known on the complexity of distributed symmetry breaking in {\em hypergraphs}. In particular, a key question is whether a fast (randomized) algorithm for MIS exists for hypergraphs. In this paper, we study the distributed complexity of symmetry breaking in hypergraphs by presenting distributed randomized algorithms for a variety of fundamental problems under a natural distributed computing model for hypergraphs. We first show that MIS in hypergraphs (of arbitrary dimension) can be solved in

O(\log^2 n)

rounds (

n

is the number of nodes of the hypergraph) in the LOCAL model. We then present a key result of this paper --- an

O(\Delta^{\epsilon}\text{polylog}(n))

-round hypergraph MIS algorithm in the CONGEST model where

\Delta

is the maximum node degree of the hypergraph and

\epsilon > 0

is any arbitrarily small constant. To demonstrate the usefulness of hypergraph MIS, we present applications of our hypergraph algorithm to solving problems in (standard) graphs. In particular, the hypergraph MIS yields fast distributed algorithms for the {\em balanced minimal dominating set} problem (left open in Harris et al. [ICALP 2013]) and the {\em minimal connected dominating set problem}. We also present distributed algorithms for coloring, maximal matching, and maximal clique in hypergraphs.Comment: Changes from the previous version: More references adde

arXiv.org e-Print Archive

Crossref