20,095 research outputs found
Probabilistic Polynomials and Hamming Nearest Neighbors
We show how to compute any symmetric Boolean function on variables over
any field (as well as the integers) with a probabilistic polynomial of degree
and error at most . The degree
dependence on and is optimal, matching a lower bound of Razborov
(1987) and Smolensky (1987) for the MAJORITY function. The proof is
constructive: a low-degree polynomial can be efficiently sampled from the
distribution.
This polynomial construction is combined with other algebraic ideas to give
the first subquadratic time algorithm for computing a (worst-case) batch of
Hamming distances in superlogarithmic dimensions, exactly. To illustrate, let
. Suppose we are given a database
of vectors in and a collection of query vectors
in the same dimension. For all , we wish to compute a
with minimum Hamming distance from . We solve this problem in randomized time. Hence, the problem is in "truly subquadratic"
time for dimensions, and in subquadratic time for . We apply the algorithm to computing pairs with maximum
inner product, closest pair in for vectors with bounded integer
entries, and pairs with maximum Jaccard coefficients.Comment: 16 pages. To appear in 56th Annual IEEE Symposium on Foundations of
Computer Science (FOCS 2015
Closest pair optimization on modern hardware
Master's Project (M.S.) University of Alaska Fairbanks, 2019In this project we examine the performance of several algorithms for finding the closest pair of points
out of a given set of points in a plane. We look at four algorithms, including brute force, recursive,
non-recursive, and a random expected linear time for numbers of points ranging from one hundred to
one billion. In our examination, we find that on average the non-recursive is the fastest, except for
limited cases of 100 points for the brute force, and 32 bit spaces for the random expected linear
Dominance Product and High-Dimensional Closest Pair under
Given a set of points in , the Closest Pair problem is
to find a pair of distinct points in at minimum distance. When is
constant, there are efficient algorithms that solve this problem, and fast
approximate solutions for general . However, obtaining an exact solution in
very high dimensions seems to be much less understood. We consider the
high-dimensional Closest Pair problem, where for some , and the underlying metric is .
We improve and simplify previous results for Closest Pair, showing
that it can be solved by a deterministic strongly-polynomial algorithm that
runs in time, and by a randomized algorithm that runs in
expected time, where is the time bound for computing the
{\em dominance product} for points in . That is a matrix ,
such that ; this is the
number of coordinates at which dominates . For integer coordinates
from some interval , we obtain an algorithm that runs in
time, where
is the exponent of multiplying an matrix by an
matrix.
We also give slightly better bounds for , by using more recent
rectangular matrix multiplication bounds. Computing the dominance product
itself is an important task, since it is applied in many algorithms as a major
black-box ingredient, such as algorithms for APBP (all pairs bottleneck paths),
and variants of APSP (all pairs shortest paths)
Distributed PCP Theorems for Hardness of Approximation in P
We present a new distributed model of probabilistically checkable proofs
(PCP). A satisfying assignment to a CNF formula is
shared between two parties, where Alice knows , Bob knows
, and both parties know . The goal is to have
Alice and Bob jointly write a PCP that satisfies , while
exchanging little or no information. Unfortunately, this model as-is does not
allow for nontrivial query complexity. Instead, we focus on a non-deterministic
variant, where the players are helped by Merlin, a third party who knows all of
.
Using our framework, we obtain, for the first time, PCP-like reductions from
the Strong Exponential Time Hypothesis (SETH) to approximation problems in P.
In particular, under SETH we show that there are no truly-subquadratic
approximation algorithms for Bichromatic Maximum Inner Product over
{0,1}-vectors, Bichromatic LCS Closest Pair over permutations, Approximate
Regular Expression Matching, and Diameter in Product Metric. All our
inapproximability factors are nearly-tight. In particular, for the first two
problems we obtain nearly-polynomial factors of ; only
-factor lower bounds (under SETH) were known before
Geographic Gossip: Efficient Averaging for Sensor Networks
Gossip algorithms for distributed computation are attractive due to their
simplicity, distributed nature, and robustness in noisy and uncertain
environments. However, using standard gossip algorithms can lead to a
significant waste in energy by repeatedly recirculating redundant information.
For realistic sensor network model topologies like grids and random geometric
graphs, the inefficiency of gossip schemes is related to the slow mixing times
of random walks on the communication graph. We propose and analyze an
alternative gossiping scheme that exploits geographic information. By utilizing
geographic routing combined with a simple resampling method, we demonstrate
substantial gains over previously proposed gossip protocols. For regular graphs
such as the ring or grid, our algorithm improves standard gossip by factors of
and respectively. For the more challenging case of random
geometric graphs, our algorithm computes the true average to accuracy
using radio
transmissions, which yields a factor improvement over
standard gossip algorithms. We illustrate these theoretical results with
experimental comparisons between our algorithm and standard methods as applied
to various classes of random fields.Comment: To appear, IEEE Transactions on Signal Processin
- …