Search CORE

16,010 research outputs found

A distributed algorithm to find k-dominating sets

Author: Bar-Ilan
Barbosa
Garay
Garey
Kutten
Lucia D Penso
Peleg
Valmir C Barbosa
Wittmann
Publication venue: 'Elsevier BV'
Publication date: 22/09/2003
Field of study

We consider a connected undirected graph

G(n,m)

with

n

nodes and

m

edges. A

k

-dominating set

D

G

is a set of nodes having the property that every node in

G

is at most

k

edges away from at least one node in

D

. Finding a

k

-dominating set of minimum size is NP-hard. We give a new synchronous distributed algorithm to find a

k

-dominating set in

G

of size no greater than

\lfloor n/(k+1)\rfloor

. Our algorithm requires

O(k\log^*n)

time and

O(m\log k+n\log k\log^*n)

messages to run. It has the same time complexity as the best currently known algorithm, but improves on that algorithm's message complexity and is, in addition, conceptually simpler.Comment: To appear in Discrete Applied Mathematic

arXiv.org e-Print Archive

Elsevier - Publisher Connector

Crossref

A Scalable Asynchronous Distributed Algorithm for Topic Modeling

Author: Asuncion A.
Asuncion A.
Cormen T. H.
Gonzalez J. E.
Snyder P.
Yan F.
Publication venue
Publication date: 16/12/2014
Field of study

Learning meaningful topic models with massive document collections which contain millions of documents and billions of tokens is challenging because of two reasons: First, one needs to deal with a large number of topics (typically in the order of thousands). Second, one needs a scalable and efficient way of distributing the computation across multiple machines. In this paper we present a novel algorithm F+Nomad LDA which simultaneously tackles both these problems. In order to handle large number of topics we use an appropriately modified Fenwick tree. This data structure allows us to sample from a multinomial distribution over

T

items in

O(\log T)

time. Moreover, when topic counts change the data structure can be updated in

O(\log T)

time. In order to distribute the computation across multiple processor we present a novel asynchronous framework inspired by the Nomad algorithm of \cite{YunYuHsietal13}. We show that F+Nomad LDA significantly outperform state-of-the-art on massive problems which involve millions of documents, billions of words, and thousands of topics

arXiv.org e-Print Archive

CiteSeerX

Crossref

An Improved Distributed Algorithm for Maximal Independent Set

Author: Ghaffari Mohsen
Publication venue
Publication date: 12/07/2015
Field of study

The Maximal Independent Set (MIS) problem is one of the basics in the study of locality in distributed graph algorithms. This paper presents an extremely simple randomized algorithm providing a near-optimal local complexity for this problem, which incidentally, when combined with some recent techniques, also leads to a near-optimal global complexity. Classical algorithms of Luby [STOC'85] and Alon, Babai and Itai [JALG'86] provide the global complexity guarantee that, with high probability, all nodes terminate after

O(\log n)

rounds. In contrast, our initial focus is on the local complexity, and our main contribution is to provide a very simple algorithm guaranteeing that each particular node

v

terminates after

O(\log \mathsf{deg}(v)+\log 1/\epsilon)

rounds, with probability at least

1-\epsilon

. The guarantee holds even if the randomness outside

2

-hops neighborhood of

v

is determined adversarially. This degree-dependency is optimal, due to a lower bound of Kuhn, Moscibroda, and Wattenhofer [PODC'04]. Interestingly, this local complexity smoothly transitions to a global complexity: by adding techniques of Barenboim, Elkin, Pettie, and Schneider [FOCS'12, arXiv: 1202.1983v3], we get a randomized MIS algorithm with a high probability global complexity of

O(\log \Delta) + 2^{O(\sqrt{\log \log n})}

, where

\Delta

denotes the maximum degree. This improves over the

O(\log^2 \Delta) + 2^{O(\sqrt{\log \log n})}

result of Barenboim et al., and gets close to the

\Omega(\min\{\log \Delta, \sqrt{\log n}\})

lower bound of Kuhn et al. Corollaries include improved algorithms for MIS in graphs of upper-bounded arboricity, or lower-bounded girth, for Ruling Sets, for MIS in the Local Computation Algorithms (LCA) model, and a faster distributed algorithm for the Lov\'asz Local Lemma

arXiv.org e-Print Archive

Crossref

GRiDA: A green distributed algorithm for backbone networks

Author: Aruna Prem Bianzino
Chiaraviglio Luca
Mellia Marco
Publication venue: IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC, 445 HOES LANE, PISCATAWAY, NJ 08855 USA
Publication date: 01/01/2011
Field of study

Crossref

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

Archivio della ricerca- Università di Roma La Sapienza

PORTO Publications Open Repository TOrino

A Distributed Algorithm for Directed Minimum-Weight Spanning Tree

Author: Fischer Orr
Oshman Rotem
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 33rd International Symposium on Distributed Computing (DISC 2019)
Publication date: 01/01/2019
Field of study

Dagstuhl Research Online Publication Server

Computational Limits of A Distributed Algorithm For Smoothing Spline

Author: Cheng Guang
Shang Zuofeng
Publication venue
Publication date: 01/01/2017
Field of study

In this paper, we explore statistical versus computational trade-off to address a basic question in the application of a distributed algorithm: what is the minimal computational cost in obtaining statistical optimality? In smoothing spline setup, we observe a phase transition phenomenon for the number of deployed machines that ends up being a simple proxy for computing cost. Specifically, a sharp upper bound for the number of machines is established: when the number is below this bound, statistical optimality (in terms of nonparametric estimation or testing) is achievable; otherwise, statistical optimality becomes impossible. These sharp bounds partly capture intrinsic computational limits of the distributed algorithm considered in this paper, and turn out to be fully determined by the smoothness of the regression function. As a side remark, we argue that sample splitting may be viewed as an alternative form of regularization, playing a similar role as smoothing parameter.Comment: To Appear in Journal of Machine Learning Researc

arXiv.org e-Print Archive

IUPUIScholarWorks