Search CORE

2,086 research outputs found

Concurrent Disjoint Set Union

Author: Jayanti Siddhartha V.
Tarjan Robert E.
Publication venue
Publication date: 02/03/2020
Field of study

We develop and analyze concurrent algorithms for the disjoint set union (union-find) problem in the shared memory, asynchronous multiprocessor model of computation, with CAS (compare and swap) or DCAS (double compare and swap) as the synchronization primitive. We give a deterministic bounded wait-free algorithm that uses DCAS and has a total work bound of

O(m \cdot (\log(np/m + 1) + \alpha(n, m/(np)))

for a problem with

n

elements and

m

operations solved by

p

processes, where

\alpha

is a functional inverse of Ackermann's function. We give two randomized algorithms that use only CAS and have the same work bound in expectation. The analysis of the second randomized algorithm is valid even if the scheduler is adversarial. Our DCAS and randomized algorithms take

O(\log n)

steps per operation, worst-case for the DCAS algorithm, high-probability for the randomized algorithms. Our work and step bounds grow only logarithmically with

p

, making our algorithms truly scalable. We prove that for a class of symmetric algorithms that includes ours, no better step or work bound is possible.Comment: 40 pages, combines ideas in two previous PODC paper

arXiv.org e-Print Archive

DSpace@MIT

On Brambles, Grid-Like Minors, and Parameterized Intractability of Monadic Second-Order Logic

Author: Kreutzer Stephan
Tazari Siamak
Publication venue
Publication date: 01/01/2009
Field of study

Brambles were introduced as the dual notion to treewidth, one of the most central concepts of the graph minor theory of Robertson and Seymour. Recently, Grohe and Marx showed that there are graphs G, in which every bramble of order larger than the square root of the treewidth is of exponential size in |G|. On the positive side, they show the existence of polynomial-sized brambles of the order of the square root of the treewidth, up to log factors. We provide the first polynomial time algorithm to construct a bramble in general graphs and achieve this bound, up to log-factors. We use this algorithm to construct grid-like minors, a replacement structure for grid-minors recently introduced by Reed and Wood, in polynomial time. Using the grid-like minors, we introduce the notion of a perfect bramble and an algorithm to find one in polynomial time. Perfect brambles are brambles with a particularly simple structure and they also provide us with a subgraph that has bounded degree and still large treewidth; we use them to obtain a meta-theorem on deciding certain parameterized subgraph-closed problems on general graphs in time singly exponential in the parameter. The second part of our work deals with providing a lower bound to Courcelle's famous theorem, stating that every graph property that can be expressed by a sentence in monadic second-order logic (MSO), can be decided by a linear time algorithm on classes of graphs of bounded treewidth. Using our results from the first part of our work we establish a strong lower bound for tractability of MSO on classes of colored graphs

arXiv.org e-Print Archive

CiteSeerX

Crossref

Oxford University Research Archive

Tree Contraction, Connected Components, Minimum Spanning Trees: a GPU Path to Vertex Fitting

Author: Hobson PR
Lopes RHC
Reid ID
Publication venue: Verlag Deutsches Elektronen-Synchrotron
Publication date: 01/01/2014
Field of study

Standard parallel computing operations are considered in the context of algorithms for solving 3D graph problems which have applications, e.g., in vertex finding in HEP. Exploiting GPUs for tree-accumulation and graph algorithms is challenging: GPUs offer extreme computational power and high memory-access bandwidth, combined with a model of fine-grained parallelism perhaps not suiting the irregular distribution of linked representations of graph data structures. Achieving data-race free computations may demand serialization through atomic transactions, inevitably producing poor parallel performance. A Minimum Spanning Tree algorithm for GPUs is presented, its implementation discussed, and its efficiency evaluated on GPU and multicore architectures

DESY Publication Database

DESY

Brunel University Research Archive

Distributed Connectivity Decomposition

Author: Censor-Hillel Keren
Ghaffari Mohsen
Kuhn Fabian
Publication venue
Publication date: 21/11/2013
Field of study

We present time-efficient distributed algorithms for decomposing graphs with large edge or vertex connectivity into multiple spanning or dominating trees, respectively. As their primary applications, these decompositions allow us to achieve information flow with size close to the connectivity by parallelizing it along the trees. More specifically, our distributed decomposition algorithms are as follows: (I) A decomposition of each undirected graph with vertex-connectivity

k

into (fractionally) vertex-disjoint weighted dominating trees with total weight

\Omega(\frac{k}{\log n})

, in

\widetilde{O}(D+\sqrt{n})

rounds. (II) A decomposition of each undirected graph with edge-connectivity

\lambda

into (fractionally) edge-disjoint weighted spanning trees with total weight

\lceil\frac{\lambda-1}{2}\rceil(1-\varepsilon)

, in

\widetilde{O}(D+\sqrt{n\lambda})

rounds. We also show round complexity lower bounds of

\tilde{\Omega}(D+\sqrt{\frac{n}{k}})

and

\tilde{\Omega}(D+\sqrt{\frac{n}{\lambda}})

for the above two decompositions, using techniques of [Das Sarma et al., STOC'11]. Moreover, our vertex-connectivity decomposition extends to centralized algorithms and improves the time complexity of [Censor-Hillel et al., SODA'14] from

O(n^3)

to near-optimal

\tilde{O}(m)

. As corollaries, we also get distributed oblivious routing broadcast with

O(1)

-competitive edge-congestion and

O(\log n)

-competitive vertex-congestion. Furthermore, the vertex connectivity decomposition leads to near-time-optimal

O(\log n)

-approximation of vertex connectivity: centralized

\widetilde{O}(m)

and distributed

\tilde{O}(D+\sqrt{n})

. The former moves toward the 1974 conjecture of Aho, Hopcroft, and Ullman postulating an

O(m)

centralized exact algorithm while the latter is the first distributed vertex connectivity approximation

arXiv.org e-Print Archive

CiteSeerX

Fast evaluation of union-intersection expressions

Author: Bille Philip
Pagh Anna
Pagh Rasmus
Publication venue
Publication date: 01/01/2007
Field of study

We show how to represent sets in a linear space data structure such that expressions involving unions and intersections of sets can be computed in a worst-case efficient way. This problem has applications in e.g. information retrieval and database systems. We mainly consider the RAM model of computation, and sets of machine words, but also state our results in the I/O model. On a RAM with word size

w

, a special case of our result is that the intersection of

m

(preprocessed) sets, containing

n

elements in total, can be computed in expected time

O(n (\log w)^2 / w + km)

, where

k

is the number of elements in the intersection. If the first of the two terms dominates, this is a factor

w^{1-o(1)}

faster than the standard solution of merging sorted lists. We show a cell probe lower bound of time

\Omega(n/(w m \log m)+ (1-\tfrac{\log k}{w}) k)

, meaning that our upper bound is nearly optimal for small

m

. Our algorithm uses a novel combination of approximate set representations and word-level parallelism

arXiv.org e-Print Archive

CiteSeerX

The IT University of Copenhagen's Repository

LIPIcs

Author: Alistarh Dan-Adrian
Fedorov Alexander
Koval Nikita
Publication venue: Schloss Dagstuhl - Leibniz-Zentrum für Informatik
Publication date: 14/11/2019
Field of study

Union-Find (or Disjoint-Set Union) is one of the fundamental problems in computer science; it has been well-studied from both theoretical and practical perspectives in the sequential case. Recently, there has been mounting interest in analyzing this problem in the concurrent scenario, and several asymptotically-efficient algorithms have been proposed. Yet, to date, there is very little known about the practical performance of concurrent Union-Find. This work addresses this gap. We evaluate and analyze the performance of several concurrent Union-Find algorithms and optimization strategies across a wide range of platforms (Intel, AMD, and ARM) and workloads (social, random, and road networks, as well as integrations into more complex algorithms). We first observe that, due to the limited computational cost, the number of induced cache misses is the critical determining factor for the performance of existing algorithms. We introduce new techniques to reduce this cost by storing node priorities implicitly and by using plain reads and writes in a way that does not affect the correctness of the algorithms. Finally, we show that Union-Find implementations are an interesting application for Transactional Memory (TM): one of the fastest algorithm variants we discovered is a sequential one that uses coarse-grained locking with the lock elision optimization to reduce synchronization cost and increase scalability

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server

IST Austria: PubRep (Institute of Science and Technology)

Liveness of Randomised Parameterised Systems under Arbitrary Schedulers (Technical Report)

Author: Lin Anthony W.
Ruemmer Philipp
Publication venue
Publication date: 01/01/2016
Field of study

We consider the problem of verifying liveness for systems with a finite, but unbounded, number of processes, commonly known as parameterised systems. Typical examples of such systems include distributed protocols (e.g. for the dining philosopher problem). Unlike the case of verifying safety, proving liveness is still considered extremely challenging, especially in the presence of randomness in the system. In this paper we consider liveness under arbitrary (including unfair) schedulers, which is often considered a desirable property in the literature of self-stabilising systems. We introduce an automatic method of proving liveness for randomised parameterised systems under arbitrary schedulers. Viewing liveness as a two-player reachability game (between Scheduler and Process), our method is a CEGAR approach that synthesises a progress relation for Process that can be symbolically represented as a finite-state automaton. The method is incremental and exploits both Angluin-style L*-learning and SAT-solvers. Our experiments show that our algorithm is able to prove liveness automatically for well-known randomised distributed protocols, including Lehmann-Rabin Randomised Dining Philosopher Protocol and randomised self-stabilising protocols (such as the Israeli-Jalfon Protocol). To the best of our knowledge, this is the first fully-automatic method that can prove liveness for randomised protocols.Comment: Full version of CAV'16 pape

arXiv.org e-Print Archive

Crossref

Publikationer från Uppsala Universitet

Oxford University Research Archive

Digitala Vetenskapliga Arkivet - Academic Archive On-line