1 research outputs found

    Probabilistic Polynomials and Hamming Nearest Neighbors

    Full text link
    We show how to compute any symmetric Boolean function on nn variables over any field (as well as the integers) with a probabilistic polynomial of degree O(nlog⁑(1/Ο΅))O(\sqrt{n \log(1/\epsilon)}) and error at most Ο΅\epsilon. The degree dependence on nn and Ο΅\epsilon is optimal, matching a lower bound of Razborov (1987) and Smolensky (1987) for the MAJORITY function. The proof is constructive: a low-degree polynomial can be efficiently sampled from the distribution. This polynomial construction is combined with other algebraic ideas to give the first subquadratic time algorithm for computing a (worst-case) batch of Hamming distances in superlogarithmic dimensions, exactly. To illustrate, let c(n):Nβ†’Nc(n) : \mathbb{N} \rightarrow \mathbb{N}. Suppose we are given a database DD of nn vectors in {0,1}c(n)log⁑n\{0,1\}^{c(n) \log n} and a collection of nn query vectors QQ in the same dimension. For all u∈Qu \in Q, we wish to compute a v∈Dv \in D with minimum Hamming distance from uu. We solve this problem in n2βˆ’1/O(c(n)log⁑2c(n))n^{2-1/O(c(n) \log^2 c(n))} randomized time. Hence, the problem is in "truly subquadratic" time for O(log⁑n)O(\log n) dimensions, and in subquadratic time for d=o((log⁑2n)/(log⁑log⁑n)2)d = o((\log^2 n)/(\log \log n)^2). We apply the algorithm to computing pairs with maximum inner product, closest pair in β„“1\ell_1 for vectors with bounded integer entries, and pairs with maximum Jaccard coefficients.Comment: 16 pages. To appear in 56th Annual IEEE Symposium on Foundations of Computer Science (FOCS 2015
    corecore