Search CORE

1,588 research outputs found

Geometry Helps to Compare Persistence Diagrams

Author: Kerber Michael
Morozov Dmitriy
Nigmetov Arnur
Publication venue
Publication date: 16/11/2015
Field of study

Exploiting geometric structure to improve the asymptotic complexity of discrete assignment problems is a well-studied subject. In contrast, the practical advantages of using geometry for such problems have not been explored. We implement geometric variants of the Hopcroft--Karp algorithm for bottleneck matching (based on previous work by Efrat el al.) and of the auction algorithm by Bertsekas for Wasserstein distance computation. Both implementations use k-d trees to replace a linear scan with a geometric proximity query. Our interest in this problem stems from the desire to compute distances between persistence diagrams, a problem that comes up frequently in topological data analysis. We show that our geometric matching algorithms lead to a substantial performance gain, both in running time and in memory consumption, over their purely combinatorial counterparts. Moreover, our implementation significantly outperforms the only other implementation available for comparing persistence diagrams.Comment: 20 pages, 10 figures; extended version of paper published in ALENEX 201

arXiv.org e-Print Archive

Crossref

eScholarship - University of California

MPG.PuRe

Dynamic Graph Stream Algorithms in $o(n)$ Space

Author: Huang Zengfeng
Peng Pan
Publication venue
Publication date: 01/01/2016
Field of study

In this paper we study graph problems in dynamic streaming model, where the input is defined by a sequence of edge insertions and deletions. As many natural problems require

\Omega(n)

space, where

n

is the number of vertices, existing works mainly focused on designing

\tilde{O}(n)

space algorithms. Although sublinear in the number of edges for dense graphs, it could still be too large for many applications (e.g.

n

is huge or the graph is sparse). In this work, we give single-pass algorithms beating this space barrier for two classes of problems. We present

o(n)

space algorithms for estimating the number of connected components with additive error

\varepsilon n

and

(1+\varepsilon)

-approximating the weight of minimum spanning tree, for any small constant

\varepsilon>0

. The latter improves previous

\tilde{O}(n)

space algorithm given by Ahn et al. (SODA 2012) for connected graphs with bounded edge weights. We initiate the study of approximate graph property testing in the dynamic streaming model, where we want to distinguish graphs satisfying the property from graphs that are

\varepsilon

-far from having the property. We consider the problem of testing

k

-edge connectivity,

k

-vertex connectivity, cycle-freeness and bipartiteness (of planar graphs), for which, we provide algorithms using roughly

\tilde{O}(n^{1-\varepsilon})

space, which is

o(n)

for any constant

\varepsilon

. To complement our algorithms, we present

\Omega(n^{1-O(\varepsilon)})

space lower bounds for these problems, which show that such a dependence on

\varepsilon

is necessary.Comment: ICALP 201

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server

UNSWorks

A deterministic near-linear time approximation scheme for geometric transportation

Author: Fox Kyle
Lu Jiashuai
Publication venue
Publication date: 07/11/2022
Field of study

Given a set of points

P = (P^+ \sqcup P^-) \subset \mathbb{R}^d

for some constant

d

and a supply function

\mu:P\to \mathbb{R}

such that

\mu(p) > 0~\forall p \in P^+

\mu(p) < 0~\forall p \in P^-

, and

\sum_{p\in P}{\mu(p)} = 0

, the geometric transportation problem asks one to find a transportation map

\tau: P^+\times P^-\to \mathbb{R}_{\ge 0}

such that

\sum_{q\in P^-}{\tau(p, q)} = \mu(p)~\forall p \in P^+

\sum_{p\in P^+}{\tau(p, q)} = -\mu(q)~ \forall q \in P^-

, and the weighted sum of Euclidean distances for the pairs

\sum_{(p,q)\in P^+\times P^-}\tau(p, q)\cdot ||q-p||_2

is minimized. We present the first deterministic algorithm that computes, in near-linear time, a transportation map whose cost is within a

(1 + \varepsilon)

factor of optimal. More precisely, our algorithm runs in

O(n\varepsilon^{-(d+2)}\log^5{n}\log{\log{n}})

time for any constant

\varepsilon > 0

. While a randomized

n\varepsilon^{-O(d)}\log^{O(d)}{n}

time algorithm was discovered in the last few years, all previously known deterministic

(1 + \varepsilon)

-approximation algorithms run in

\Omega(n^{3/2})

time. A similar situation existed for geometric bipartite matching, the special case of geometric transportation where all supplies are unit, until a deterministic

n\varepsilon^{-O(d)}\log^{O(d)}{n}

time

(1 + \varepsilon)

-approximation algorithm was presented at STOC 2022. Surprisingly, our result is not only a generalization of the bipartite matching one to arbitrary instances of geometric transportation, but it also reduces the running time for all previously known

(1 + \varepsilon)

-approximation algorithms, randomized or deterministic, even for geometric bipartite matching, by removing the dependence on the dimension

d

from the exponent in the running time's polylog.Comment: 23 page

arXiv.org e-Print Archive

Linear Programming in the Semi-streaming Model with Application to the Maximum Matching Problem

Author: A. McGregor
D.E.D. Vinkemeier
J. Edmonds
J. Feigenbaum
J. Feigenbaum
J.E. Hopcroft
K.J. Ahn
L.K. Fleischer
R. Preis
S. Eggert
S. Pettie
Y. Freund
Z. Füredi
Z. Füredi
Publication venue
Publication date: 01/01/2011
Field of study

In this paper, we study linear programming based approaches to the maximum matching problem in the semi-streaming model. The semi-streaming model has gained attention as a model for processing massive graphs as the importance of such graphs has increased. This is a model where edges are streamed-in in an adversarial order and we are allowed a space proportional to the number of vertices in a graph. In recent years, there has been several new results in this semi-streaming model. However broad techniques such as linear programming have not been adapted to this model. We present several techniques to adapt and optimize linear programming based approaches in the semi-streaming model with an application to the maximum matching problem. As a consequence, we improve (almost) all previous results on this problem, and also prove new results on interesting variants

arXiv.org e-Print Archive

CiteSeerX

Crossref

ScholarlyCommons@Penn

Near-linear time approximation algorithms for optimal transport via Sinkhorn iteration

Author: Altschuler Jason
Rigollet Philippe
Weed Jonathan
Publication venue
Publication date: 07/02/2018
Field of study

Computing optimal transport distances such as the earth mover's distance is a fundamental problem in machine learning, statistics, and computer vision. Despite the recent introduction of several algorithms with good empirical performance, it is unknown whether general optimal transport distances can be approximated in near-linear time. This paper demonstrates that this ambitious goal is in fact achieved by Cuturi's Sinkhorn Distances. This result relies on a new analysis of Sinkhorn iteration, which also directly suggests a new greedy coordinate descent algorithm, Greenkhorn, with the same theoretical guarantees. Numerical simulations illustrate that Greenkhorn significantly outperforms the classical Sinkhorn algorithm in practice

arXiv.org e-Print Archive

DSpace@MIT

Target Assignment in Robotic Networks: Distance Optimality Guarantees and Hierarchical Strategies

Author: Chung Soon-Jo
Voulgaris Petros G.
Yu Jingjin
Publication venue
Publication date: 31/07/2014
Field of study

We study the problem of multi-robot target assignment to minimize the total distance traveled by the robots until they all reach an equal number of static targets. In the first half of the paper, we present a necessary and sufficient condition under which true distance optimality can be achieved for robots with limited communication and target-sensing ranges. Moreover, we provide an explicit, non-asymptotic formula for computing the number of robots needed to achieve distance optimality in terms of the robots' communication and target-sensing ranges with arbitrary guaranteed probabilities. The same bounds are also shown to be asymptotically tight. In the second half of the paper, we present suboptimal strategies for use when the number of robots cannot be chosen freely. Assuming first that all targets are known to all robots, we employ a hierarchical communication model in which robots communicate only with other robots in the same partitioned region. This hierarchical communication model leads to constant approximations of true distance-optimal solutions under mild assumptions. We then revisit the limited communication and sensing models. By combining simple rendezvous-based strategies with a hierarchical communication model, we obtain decentralized hierarchical strategies that achieve constant approximation ratios with respect to true distance optimality. Results of simulation show that the approximation ratio is as low as 1.4

arXiv.org e-Print Archive

CiteSeerX

Caltech Authors