Search CORE

2,696 research outputs found

Agnostic Learning of Disjunctions on Symmetric Distributions

Author: Feldman Vitaly
Kothari Pravesh
Publication venue
Publication date: 25/05/2015
Field of study

We consider the problem of approximating and learning disjunctions (or equivalently, conjunctions) on symmetric distributions over

\{0,1\}^n

. Symmetric distributions are distributions whose PDF is invariant under any permutation of the variables. We give a simple proof that for every symmetric distribution

\mathcal{D}

, there exists a set of

n^{O(\log{(1/\epsilon)})}

functions

\mathcal{S}

, such that for every disjunction

c

, there is function

p

, expressible as a linear combination of functions in

\mathcal{S}

, such that

p

\epsilon

-approximates

c

\ell_1

distance on

\mathcal{D}

\mathbf{E}_{x \sim \mathcal{D}}[ |c(x)-p(x)|] \leq \epsilon

. This directly gives an agnostic learning algorithm for disjunctions on symmetric distributions that runs in time

n^{O( \log{(1/\epsilon)})}

. The best known previous bound is

n^{O(1/\epsilon^4)}

and follows from approximation of the more general class of halfspaces (Wimmer, 2010). We also show that there exists a symmetric distribution

\mathcal{D}

, such that the minimum degree of a polynomial that

1/3

-approximates the disjunction of all

n

variables is

\ell_1

distance on

\mathcal{D}

\Omega( \sqrt{n})

. Therefore the learning result above cannot be achieved via

\ell_1

-regression with a polynomial basis used in most other agnostic learning algorithms. Our technique also gives a simple proof that for any product distribution

\mathcal{D}

and every disjunction

c

, there exists a polynomial

p

of degree

O(\log{(1/\epsilon)})

such that

p

\epsilon

-approximates

c

\ell_1

distance on

\mathcal{D}

. This was first proved by Blais et al. (2008) via a more involved argument

arXiv.org e-Print Archive

CiteSeerX

Learning using Local Membership Queries

Author: Awasthi Pranjal
Feldman Vitaly
Kanade Varun
Publication venue
Publication date: 17/04/2013
Field of study

We introduce a new model of membership query (MQ) learning, where the learning algorithm is restricted to query points that are \emph{close} to random examples drawn from the underlying distribution. The learning model is intermediate between the PAC model (Valiant, 1984) and the PAC+MQ model (where the queries are allowed to be arbitrary points). Membership query algorithms are not popular among machine learning practitioners. Apart from the obvious difficulty of adaptively querying labelers, it has also been observed that querying \emph{unnatural} points leads to increased noise from human labelers (Lang and Baum, 1992). This motivates our study of learning algorithms that make queries that are close to examples generated from the data distribution. We restrict our attention to functions defined on the

n

-dimensional Boolean hypercube and say that a membership query is local if its Hamming distance from some example in the (random) training data is at most

O(\log(n))

. We show the following results in this model: (i) The class of sparse polynomials (with coefficients in R) over

\{0,1\}^n

is polynomial time learnable under a large class of \emph{locally smooth} distributions using

O(\log(n))

-local queries. This class also includes the class of

O(\log(n))

-depth decision trees. (ii) The class of polynomial-sized decision trees is polynomial time learnable under product distributions using

O(\log(n))

-local queries. (iii) The class of polynomial size DNF formulas is learnable under the uniform distribution using

O(\log(n))

-local queries in time

n^{O(\log(\log(n)))}

. (iv) In addition we prove a number of results relating the proposed model to the traditional PAC model and the PAC+MQ model

arXiv.org e-Print Archive

CiteSeerX

Tailoring a coherent control solution landscape by linear transforms of spectral phase basis

Author: Herek Jennifer
Jafarpour Aliakbar
Offerhaus Herman
Walle Peter van der
Publication venue: Optical Society of America
Publication date: 01/01/2010
Field of study

Finding an optimal phase pattern in a multidimensional solution landscape becomes easier and faster if local optima are suppressed and contour lines are tailored towards closed convex patterns. Using wideband second harmonic generation as a coherent control test case, we show that a linear combination of spectral phase basis functions can result in such improvements and also in separable phase terms, each of which can be found independently. The improved shapes are attributed to a suppressed nonlinear shear, changing the relative orientation of contour lines. The first order approximation of the process shows a simple relation between input and output phase profiles, useful for pulse shaping at ultraviolet wavelengths

University of Twente Research Information