Search CORE

5,228 research outputs found

Optimal Learning via the Fourier Transform for Sums of Independent Integer Random Variables

Author: Diakonikolas Ilias
Kane Daniel M.
Stewart Alistair
Publication venue
Publication date: 23/11/2015
Field of study

We study the structure and learnability of sums of independent integer random variables (SIIRVs). For

k \in \mathbb{Z}_{+}

, a

k

-SIIRV of order

n \in \mathbb{Z}_{+}

is the probability distribution of the sum of

n

independent random variables each supported on

\{0, 1, \dots, k-1\}

. We denote by

{\cal S}_{n,k}

the set of all

k

-SIIRVs of order

n

. In this paper, we tightly characterize the sample and computational complexity of learning

k

-SIIRVs. More precisely, we design a computationally efficient algorithm that uses

\widetilde{O}(k/\epsilon^2)

samples, and learns an arbitrary

k

-SIIRV within error

\epsilon,

in total variation distance. Moreover, we show that the {\em optimal} sample complexity of this learning problem is

\Theta((k/\epsilon^2)\sqrt{\log(1/\epsilon)}).

Our algorithm proceeds by learning the Fourier transform of the target

k

-SIIRV in its effective support. Its correctness relies on the {\em approximate sparsity} of the Fourier transform of

k

-SIIRVs -- a structural property that we establish, roughly stating that the Fourier transform of

k

-SIIRVs has small magnitude outside a small set. Along the way we prove several new structural results about

k

-SIIRVs. As one of our main structural contributions, we give an efficient algorithm to construct a sparse {\em proper}

\epsilon

-cover for

{\cal S}_{n,k},

in total variation distance. We also obtain a novel geometric characterization of the space of

k

-SIIRVs. Our characterization allows us to prove a tight lower bound on the size of

\epsilon

-covers for

{\cal S}_{n,k}

, and is the key ingredient in our tight sample complexity lower bound. Our approach of exploiting the sparsity of the Fourier transform in distribution learning is general, and has recently found additional applications.Comment: Main differences from v1: Changed title and restructured introduction. Added new sample optimal algorithm. Generalized sample lower bound for any value of

arXiv.org e-Print Archive

Edinburgh Research Explorer

A Size-Free CLT for Poisson Multinomials and its Applications

Author: Daskalakis Konstantinos
De Anindya
Kamath Gautam
Kamath Gautam Chetan
Tzamos Christos
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/06/2016
Field of study

(n,k)

-Poisson Multinomial Distribution (PMD) is the distribution of the sum of

n

independent random vectors supported on the set

{\cal B}_k=\{e_1,\ldots,e_k\}

of standard basis vectors in

\mathbb{R}^k

. We show that any

(n,k)

-PMD is

{\rm poly}\left({k\over \sigma}\right)

-close in total variation distance to the (appropriately discretized) multi-dimensional Gaussian with the same first two moments, removing the dependence on

n

from the Central Limit Theorem of Valiant and Valiant. Interestingly, our CLT is obtained by bootstrapping the Valiant-Valiant CLT itself through the structural characterization of PMDs shown in recent work by Daskalakis, Kamath, and Tzamos. In turn, our stronger CLT can be leveraged to obtain an efficient PTAS for approximate Nash equilibria in anonymous games, significantly improving the state of the art, and matching qualitatively the running time dependence on

n

and

1/\varepsilon

of the best known algorithm for two-strategy anonymous games. Our new CLT also enables the construction of covers for the set of

(n,k)

-PMDs, which are proper and whose size is shown to be essentially optimal. Our cover construction combines our CLT with the Shapley-Folkman theorem and recent sparsification results for Laplacian matrices by Batson, Spielman, and Srivastava. Our cover size lower bound is based on an algebraic geometric construction. Finally, leveraging the structural properties of the Fourier spectrum of PMDs we show that these distributions can be learned from

O_k(1/\varepsilon^2)

samples in

{\rm poly}_k(1/\varepsilon)

-time, removing the quasi-polynomial dependence of the running time on

1/\varepsilon

from the algorithm of Daskalakis, Kamath, and Tzamos.Comment: To appear in STOC 201

arXiv.org e-Print Archive

DSpace@MIT

Crossref

Quantum Algorithms for Learning and Testing Juntas

Author: A. Atıcı
Alp Atıcı
E. Bernstein
E. Kushilevitz
E. Mossel
H. Chockler
J.C. Jackson
L.G. Valiant
N. Bshouty
N.H. Bshouty
O. Goldreich
R. Rubinfeld
R.A. Servedio
Rocco A. Servedio
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 24/07/2007
Field of study

In this article we develop quantum algorithms for learning and testing juntas, i.e. Boolean functions which depend only on an unknown set of k out of n input variables. Our aim is to develop efficient algorithms: - whose sample complexity has no dependence on n, the dimension of the domain the Boolean functions are defined over; - with no access to any classical or quantum membership ("black-box") queries. Instead, our algorithms use only classical examples generated uniformly at random and fixed quantum superpositions of such classical examples; - which require only a few quantum examples but possibly many classical random examples (which are considered quite "cheap" relative to quantum examples). Our quantum algorithms are based on a subroutine FS which enables sampling according to the Fourier spectrum of f; the FS subroutine was used in earlier work of Bshouty and Jackson on quantum learning. Our results are as follows: - We give an algorithm for testing k-juntas to accuracy

\epsilon

that uses

O(k/\epsilon)

quantum examples. This improves on the number of examples used by the best known classical algorithm. - We establish the following lower bound: any FS-based k-junta testing algorithm requires

\Omega(\sqrt{k})

queries. - We give an algorithm for learning

k

-juntas to accuracy

\epsilon

that uses

O(\epsilon^{-1} k\log k)

quantum examples and

O(2^k \log(1/\epsilon))

random examples. We show that this learning algorithms is close to optimal by giving a related lower bound.Comment: 15 pages, 1 figure. Uses synttree package. To appear in Quantum Information Processin

arXiv.org e-Print Archive

Crossref