Search CORE

20,095 research outputs found

Probabilistic Polynomials and Hamming Nearest Neighbors

Author: Alman Josh
Williams Ryan
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 17/07/2015
Field of study

We show how to compute any symmetric Boolean function on

n

variables over any field (as well as the integers) with a probabilistic polynomial of degree

O(\sqrt{n \log(1/\epsilon)})

and error at most

\epsilon

. The degree dependence on

n

and

\epsilon

is optimal, matching a lower bound of Razborov (1987) and Smolensky (1987) for the MAJORITY function. The proof is constructive: a low-degree polynomial can be efficiently sampled from the distribution. This polynomial construction is combined with other algebraic ideas to give the first subquadratic time algorithm for computing a (worst-case) batch of Hamming distances in superlogarithmic dimensions, exactly. To illustrate, let

c(n) : \mathbb{N} \rightarrow \mathbb{N}

. Suppose we are given a database

D

n

vectors in

\{0,1\}^{c(n) \log n}

and a collection of

n

query vectors

Q

in the same dimension. For all

u \in Q

, we wish to compute a

v \in D

with minimum Hamming distance from

u

. We solve this problem in

n^{2-1/O(c(n) \log^2 c(n))}

randomized time. Hence, the problem is in "truly subquadratic" time for

O(\log n)

dimensions, and in subquadratic time for

d = o((\log^2 n)/(\log \log n)^2)

. We apply the algorithm to computing pairs with maximum inner product, closest pair in

\ell_1

for vectors with bounded integer entries, and pairs with maximum Jaccard coefficients.Comment: 16 pages. To appear in 56th Annual IEEE Symposium on Foundations of Computer Science (FOCS 2015

arXiv.org e-Print Archive

Crossref

Closest pair optimization on modern hardware

Author: Bright Jason
Publication venue
Publication date: 01/05/2019
Field of study

Master's Project (M.S.) University of Alaska Fairbanks, 2019In this project we examine the performance of several algorithms for finding the closest pair of points out of a given set of points in a plane. We look at four algorithms, including brute force, recursive, non-recursive, and a random expected linear time for numbers of points ranging from one hundred to one billion. In our examination, we find that on average the non-recursive is the fastest, except for limited cases of 100 points for the brute force, and 32 bit spaces for the random expected linear

ScholarWorks@UA

Dominance Product and High-Dimensional Closest Pair under $L_\infty$

Author: Gold Omer
Sharir Micha
Publication venue
Publication date: 01/01/2017
Field of study

Given a set

S

n

points in

\mathbb{R}^d

, the Closest Pair problem is to find a pair of distinct points in

S

at minimum distance. When

d

is constant, there are efficient algorithms that solve this problem, and fast approximate solutions for general

d

. However, obtaining an exact solution in very high dimensions seems to be much less understood. We consider the high-dimensional

L_\infty

Closest Pair problem, where

d=n^r

for some

r > 0

, and the underlying metric is

L_\infty

. We improve and simplify previous results for

L_\infty

Closest Pair, showing that it can be solved by a deterministic strongly-polynomial algorithm that runs in

O(DP(n,d)\log n)

time, and by a randomized algorithm that runs in

O(DP(n,d))

expected time, where

DP(n,d)

is the time bound for computing the {\em dominance product} for

n

points in

\mathbb{R}^d

. That is a matrix

D

, such that

D[i,j] = \bigl| \{k \mid p_i[k] \leq p_j[k]\} \bigr|

; this is the number of coordinates at which

p_j

dominates

p_i

. For integer coordinates from some interval

[-M, M]

, we obtain an algorithm that runs in

\tilde{O}\left(\min\{Mn^{\omega(1,r,1)},\, DP(n,d)\}\right)

time, where

\omega(1,r,1)

is the exponent of multiplying an

n \times n^r

matrix by an

n^r \times n

matrix. We also give slightly better bounds for

DP(n,d)

, by using more recent rectangular matrix multiplication bounds. Computing the dominance product itself is an important task, since it is applied in many algorithms as a major black-box ingredient, such as algorithms for APBP (all pairs bottleneck paths), and variants of APSP (all pairs shortest paths)

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server

Distributed PCP Theorems for Hardness of Approximation in P

Author: Abboud Amir
Rubinstein Aviad
Williams Ryan
Publication venue
Publication date: 01/01/1952
Field of study

We present a new distributed model of probabilistically checkable proofs (PCP). A satisfying assignment

x \in \{0,1\}^n

to a CNF formula

\varphi

is shared between two parties, where Alice knows

x_1, \dots, x_{n/2}

, Bob knows

x_{n/2+1},\dots,x_n

, and both parties know

\varphi

. The goal is to have Alice and Bob jointly write a PCP that

x

satisfies

\varphi

, while exchanging little or no information. Unfortunately, this model as-is does not allow for nontrivial query complexity. Instead, we focus on a non-deterministic variant, where the players are helped by Merlin, a third party who knows all of

x

. Using our framework, we obtain, for the first time, PCP-like reductions from the Strong Exponential Time Hypothesis (SETH) to approximation problems in P. In particular, under SETH we show that there are no truly-subquadratic approximation algorithms for Bichromatic Maximum Inner Product over {0,1}-vectors, Bichromatic LCS Closest Pair over permutations, Approximate Regular Expression Matching, and Diameter in Product Metric. All our inapproximability factors are nearly-tight. In particular, for the first two problems we obtain nearly-polynomial factors of

2^{(\log n)^{1-o(1)}}

; only

(1+o(1))

-factor lower bounds (under SETH) were known before

arXiv.org e-Print Archive

Biblioteca Virtual del Patrimonio Bibliográfico (Virtual Library of Bibliographical Heritage)

Crossref

Geographic Gossip: Efficient Averaging for Sensor Networks

Author: D. Sarwate
Martin J. Wainwright
Ros D. G. Dimakis
Student Member
Student Member
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 25/09/2007
Field of study

Gossip algorithms for distributed computation are attractive due to their simplicity, distributed nature, and robustness in noisy and uncertain environments. However, using standard gossip algorithms can lead to a significant waste in energy by repeatedly recirculating redundant information. For realistic sensor network model topologies like grids and random geometric graphs, the inefficiency of gossip schemes is related to the slow mixing times of random walks on the communication graph. We propose and analyze an alternative gossiping scheme that exploits geographic information. By utilizing geographic routing combined with a simple resampling method, we demonstrate substantial gains over previously proposed gossip protocols. For regular graphs such as the ring or grid, our algorithm improves standard gossip by factors of

n

and

\sqrt{n}

respectively. For the more challenging case of random geometric graphs, our algorithm computes the true average to accuracy

\epsilon

using

O(\frac{n^{1.5}}{\sqrt{\log n}} \log \epsilon^{-1})

radio transmissions, which yields a

\sqrt{\frac{n}{\log n}}

factor improvement over standard gossip algorithms. We illustrate these theoretical results with experimental comparisons between our algorithm and standard methods as applied to various classes of random fields.Comment: To appear, IEEE Transactions on Signal Processin

arXiv.org e-Print Archive

CiteSeerX

Crossref

eScholarship - University of California