575 research outputs found
Distributional Property Testing in a Quantum World
A fundamental problem in statistics and learning theory is to test properties of distributions. We show that quantum computers can solve such problems with significant speed-ups. We also introduce a novel access model for quantum distributions, enabling the coherent preparation of quantum samples, and propose a general framework that can naturally handle both classical and quantum distributions in a unified manner. Our framework generalizes and improves previous quantum algorithms for testing closeness between unknown distributions, testing independence between two distributions, and estimating the Shannon / von Neumann entropy of distributions. For classical distributions our algorithms significantly improve the precision dependence of some earlier results. We also show that in our framework procedures for classical distributions can be directly lifted to the more general case of quantum distributions, and thus obtain the first speed-ups for testing properties of density operators that can be accessed coherently rather than only via sampling
Likelihood-free hypothesis testing
Consider the problem of testing vs from samples. Generally, to achieve a small error
rate it is necessary and sufficient to have , where
measures the separation between and in total
variation (). Achieving this, however, requires complete knowledge
of the distributions and and can be done, for example,
using the Neyman-Pearson test. In this paper we consider a variation of the
problem, which we call likelihood-free (or simulation-based) hypothesis
testing, where access to and (which are a priori only
known to belong to a large non-parametric family ) is given through
iid samples from each. We demostrate existence of a fundamental trade-off
between and given by ,
where is the minimax sample complexity of testing between the
hypotheses vs . We show this for three non-parametric families :
-smooth densities over , the Gaussian sequence model over a
Sobolev ellipsoid, and the collection of distributions on a large
alphabet with pmfs bounded by for fixed . The test that we
propose (based on the -distance statistic of Ingster) simultaneously
achieves all points on the tradeoff curve for these families. In particular,
when our test requires the number of simulation samples
to be orders of magnitude smaller than what is needed for density estimation
with accuracy (under ). This demonstrates the
possibility of testing without fully estimating the distributions.Comment: 48 pages, 1 figur
- …