580 research outputs found
Comparing Mean Field and Euclidean Matching Problems
Combinatorial optimization is a fertile testing ground for statistical
physics methods developed in the context of disordered systems, allowing one to
confront theoretical mean field predictions with actual properties of finite
dimensional systems. Our focus here is on minimum matching problems, because
they are computationally tractable while both frustrated and disordered. We
first study a mean field model taking the link lengths between points to be
independent random variables. For this model we find perfect agreement with the
results of a replica calculation. Then we study the case where the points to be
matched are placed at random in a d-dimensional Euclidean space. Using the mean
field model as an approximation to the Euclidean case, we show numerically that
the mean field predictions are very accurate even at low dimension, and that
the error due to the approximation is O(1/d^2). Furthermore, it is possible to
improve upon this approximation by including the effects of Euclidean
correlations among k link lengths. Using k=3 (3-link correlations such as the
triangle inequality), the resulting errors in the energy density are already
less than 0.5% at d>=2. However, we argue that the Euclidean model's 1/d series
expansion is beyond all orders in k of the expansion in k-link correlations.Comment: 11 pages, 1 figur
Approximating the Regular Graphic TSP in near linear time
We present a randomized approximation algorithm for computing traveling
salesperson tours in undirected regular graphs. Given an -vertex,
-regular graph, the algorithm computes a tour of length at most
, with high probability, in time. This improves upon a recent result by Vishnoi (\cite{Vishnoi12}, FOCS
2012) for the same problem, in terms of both approximation factor, and running
time. The key ingredient of our algorithm is a technique that uses
edge-coloring algorithms to sample a cycle cover with cycles with
high probability, in near linear time.
Additionally, we also give a deterministic
factor approximation algorithm
running in time .Comment: 12 page
Asymptotically Optimal Algorithms for Pickup and Delivery Problems with Application to Large-Scale Transportation Systems
The Stacker Crane Problem is NP-Hard and the best known approximation
algorithm only provides a 9/5 approximation ratio. The objective of this paper
is threefold. First, by embedding the problem within a stochastic framework, we
present a novel algorithm for the SCP that: (i) is asymptotically optimal,
i.e., it produces, almost surely, a solution approaching the optimal one as the
number of pickups/deliveries goes to infinity; and (ii) has computational
complexity O(n^{2+\eps}), where is the number of pickup/delivery pairs
and \eps is an arbitrarily small positive constant. Second, we asymptotically
characterize the length of the optimal SCP tour. Finally, we study a dynamic
version of the SCP, whereby pickup and delivery requests arrive according to a
Poisson process, and which serves as a model for large-scale demand-responsive
transport (DRT) systems. For such a dynamic counterpart of the SCP, we derive a
necessary and sufficient condition for the existence of stable vehicle routing
policies, which depends only on the workspace geometry, the stochastic
distributions of pickup and delivery points, the arrival rate of requests, and
the number of vehicles. Our results leverage a novel connection between the
Euclidean Bipartite Matching Problem and the theory of random permutations,
and, for the dynamic setting, exhibit novel features that are absent in
traditional spatially-distributed queueing systems.Comment: 27 pages, plus Appendix, 7 figures, extended version of paper being
submitted to IEEE Transactions of Automatic Contro
The random link approximation for the Euclidean traveling salesman problem
The traveling salesman problem (TSP) consists of finding the length of the
shortest closed tour visiting N ``cities''. We consider the Euclidean TSP where
the cities are distributed randomly and independently in a d-dimensional unit
hypercube. Working with periodic boundary conditions and inspired by a
remarkable universality in the kth nearest neighbor distribution, we find for
the average optimum tour length = beta_E(d) N^{1-1/d} [1+O(1/N)] with
beta_E(2) = 0.7120 +- 0.0002 and beta_E(3) = 0.6979 +- 0.0002. We then derive
analytical predictions for these quantities using the random link
approximation, where the lengths between cities are taken as independent random
variables. From the ``cavity'' equations developed by Krauth, Mezard and
Parisi, we calculate the associated random link values beta_RL(d). For d=1,2,3,
numerical results show that the random link approximation is a good one, with a
discrepancy of less than 2.1% between beta_E(d) and beta_RL(d). For large d, we
argue that the approximation is exact up to O(1/d^2) and give a conjecture for
beta_E(d), in terms of a power series in 1/d, specifying both leading and
subleading coefficients.Comment: 29 pages, 6 figures; formatting and typos correcte
Conditional Hardness of Earth Mover Distance
The Earth Mover Distance (EMD) between two sets of points A, B subseteq R^d with |A| = |B| is the minimum total Euclidean distance of any perfect matching between A and B. One of its generalizations is asymmetric EMD, which is the minimum total Euclidean distance of any matching of size |A| between sets of points A,B subseteq R^d with |A| <= |B|. The problems of computing EMD and asymmetric EMD are well-studied and have many applications in computer science, some of which also ask for the EMD-optimal matching itself. Unfortunately, all known algorithms require at least quadratic time to compute EMD exactly. Approximation algorithms with nearly linear time complexity in n are known (even for finding approximately optimal matchings), but suffer from exponential dependence on the dimension.
In this paper we show that significant improvements in exact and approximate algorithms for EMD would contradict conjectures in fine-grained complexity. In particular, we prove the following results:
- Under the Orthogonal Vectors Conjecture, there is some c>0 such that EMD in Omega(c^{log^* n}) dimensions cannot be computed in truly subquadratic time.
- Under the Hitting Set Conjecture, for every delta>0, no truly subquadratic time algorithm can find a (1 + 1/n^delta)-approximate EMD matching in omega(log n) dimensions.
- Under the Hitting Set Conjecture, for every eta = 1/omega(log n), no truly subquadratic time algorithm can find a (1 + eta)-approximate asymmetric EMD matching in omega(log n) dimensions
Conditional Hardness of Earth Mover Distance
The Earth Mover Distance (EMD) between two sets of points A, B subseteq R^d with |A| = |B| is the minimum total Euclidean distance of any perfect matching between A and B. One of its generalizations is asymmetric EMD, which is the minimum total Euclidean distance of any matching of size |A| between sets of points A,B subseteq R^d with |A| <= |B|. The problems of computing EMD and asymmetric EMD are well-studied and have many applications in computer science, some of which also ask for the EMD-optimal matching itself. Unfortunately, all known algorithms require at least quadratic time to compute EMD exactly. Approximation algorithms with nearly linear time complexity in n are known (even for finding approximately optimal matchings), but suffer from exponential dependence on the dimension.
In this paper we show that significant improvements in exact and approximate algorithms for EMD would contradict conjectures in fine-grained complexity. In particular, we prove the following results:
- Under the Orthogonal Vectors Conjecture, there is some c>0 such that EMD in Omega(c^{log^* n}) dimensions cannot be computed in truly subquadratic time.
- Under the Hitting Set Conjecture, for every delta>0, no truly subquadratic time algorithm can find a (1 + 1/n^delta)-approximate EMD matching in omega(log n) dimensions.
- Under the Hitting Set Conjecture, for every eta = 1/omega(log n), no truly subquadratic time algorithm can find a (1 + eta)-approximate asymmetric EMD matching in omega(log n) dimensions
Geometry Helps to Compare Persistence Diagrams
Exploiting geometric structure to improve the asymptotic complexity of
discrete assignment problems is a well-studied subject. In contrast, the
practical advantages of using geometry for such problems have not been
explored. We implement geometric variants of the Hopcroft--Karp algorithm for
bottleneck matching (based on previous work by Efrat el al.) and of the auction
algorithm by Bertsekas for Wasserstein distance computation. Both
implementations use k-d trees to replace a linear scan with a geometric
proximity query. Our interest in this problem stems from the desire to compute
distances between persistence diagrams, a problem that comes up frequently in
topological data analysis. We show that our geometric matching algorithms lead
to a substantial performance gain, both in running time and in memory
consumption, over their purely combinatorial counterparts. Moreover, our
implementation significantly outperforms the only other implementation
available for comparing persistence diagrams.Comment: 20 pages, 10 figures; extended version of paper published in ALENEX
201
Parallel Algorithms for Geometric Graph Problems
We give algorithms for geometric graph problems in the modern parallel models
inspired by MapReduce. For example, for the Minimum Spanning Tree (MST) problem
over a set of points in the two-dimensional space, our algorithm computes a
-approximate MST. Our algorithms work in a constant number of
rounds of communication, while using total space and communication proportional
to the size of the data (linear space and near linear time algorithms). In
contrast, for general graphs, achieving the same result for MST (or even
connectivity) remains a challenging open problem, despite drawing significant
attention in recent years.
We develop a general algorithmic framework that, besides MST, also applies to
Earth-Mover Distance (EMD) and the transportation cost problem. Our algorithmic
framework has implications beyond the MapReduce model. For example it yields a
new algorithm for computing EMD cost in the plane in near-linear time,
. We note that while recently Sharathkumar and Agarwal
developed a near-linear time algorithm for -approximating EMD,
our algorithm is fundamentally different, and, for example, also solves the
transportation (cost) problem, raised as an open question in their work.
Furthermore, our algorithm immediately gives a -approximation
algorithm with space in the streaming-with-sorting model with
passes. As such, it is tempting to conjecture that the
parallel models may also constitute a concrete playground in the quest for
efficient algorithms for EMD (and other similar problems) in the vanilla
streaming model, a well-known open problem
- …