Search CORE

421 research outputs found

Efficiency versus Convergence of Boolean Kernels for On-Line Learning Algorithms

Author: Khardon R.
Roth D.
Servedio R. A.
Publication venue: 'AI Access Foundation'
Publication date: 09/09/2011
Field of study

The paper studies machine learning problems where each example is described using a set of Boolean features and where hypotheses are represented by linear threshold elements. One method of increasing the expressiveness of learned hypotheses in this context is to expand the feature set to include conjunctions of basic features. This can be done explicitly or where possible by using a kernel function. Focusing on the well known Perceptron and Winnow algorithms, the paper demonstrates a tradeoff between the computational efficiency with which the algorithm can be run over the expanded feature space and the generalization ability of the corresponding learning algorithm. We first describe several kernel functions which capture either limited forms of conjunctions or all conjunctions. We show that these kernels can be used to efficiently run the Perceptron algorithm over a feature space of exponentially many conjunctions; however we also show that using such kernels, the Perceptron algorithm can provably make an exponential number of mistakes even when learning simple functions. We then consider the question of whether kernel functions can analogously be used to run the multiplicative-update Winnow algorithm over an expanded feature space of exponentially many conjunctions. Known upper bounds imply that the Winnow algorithm can learn Disjunctive Normal Form (DNF) formulae with a polynomial mistake bound in this setting. However, we prove that it is computationally hard to simulate Winnows behavior for learning DNF over such a feature set. This implies that the kernel functions which correspond to running Winnow for this problem are not efficiently computable, and that there is no general construction that can run Winnow with kernels

arXiv.org e-Print Archive

Crossref

A Nearly Optimal Lower Bound on the Approximate Degree of AC $^0$

Author: Bun Mark
Thaler Justin
Publication venue
Publication date: 16/03/2017
Field of study

The approximate degree of a Boolean function

f \colon \{-1, 1\}^n \rightarrow \{-1, 1\}

is the least degree of a real polynomial that approximates

f

pointwise to error at most

1/3

. We introduce a generic method for increasing the approximate degree of a given function, while preserving its computability by constant-depth circuits. Specifically, we show how to transform any Boolean function

f

with approximate degree

d

into a function

F

O(n \cdot \operatorname{polylog}(n))

variables with approximate degree at least

D = \Omega(n^{1/3} \cdot d^{2/3})

. In particular, if

d= n^{1-\Omega(1)}

, then

D

is polynomially larger than

d

. Moreover, if

f

is computed by a polynomial-size Boolean circuit of constant depth, then so is

F

. By recursively applying our transformation, for any constant

\delta > 0

we exhibit an AC

^0

function of approximate degree

\Omega(n^{1-\delta})

. This improves over the best previous lower bound of

\Omega(n^{2/3})

due to Aaronson and Shi (J. ACM 2004), and nearly matches the trivial upper bound of

n

that holds for any function. Our lower bounds also apply to (quasipolynomial-size) DNFs of polylogarithmic width. We describe several applications of these results. We give: * For any constant

\delta > 0

, an

\Omega(n^{1-\delta})

lower bound on the quantum communication complexity of a function in AC

^0

. * A Boolean function

f

with approximate degree at least

C(f)^{2-o(1)}

, where

C(f)

is the certificate complexity of

f

. This separation is optimal up to the

o(1)

term in the exponent. * Improved secret sharing schemes with reconstruction procedures in AC

^0

.Comment: 40 pages, 1 figur

arXiv.org e-Print Archive

Crossref

Complexity Results on Learning by Neural Nets

Author: Lin Jyh-Han
Vitter Jeffrey Scott
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 21/03/2011
Field of study

We consider the computational complexity of learning by neural nets. We are inter- ested in how hard it is to design appropriate neural net architectures and to train neural nets for general and specialized learning tasks. Our main result shows that the training problem for 2-cascade neural nets (which have only two non-input nodes, one of which is hidden) is NP-complete, which implies that nding an optimal net (in terms of the number of non-input units) that is consistent with a set of exam- ples is also NP-complete. This result also demonstrates a surprising gap between the computational complexities of one-node (perceptron) and two-node neural net training problems, since the perceptron training problem can be solved in polynomial time by linear programming techniques. We conjecture that training a k-cascade neural net, which is a classical threshold network training problem, is also NP-complete, for each xed k 2. We also show that the problem of nding an optimal perceptron (in terms of the number of non-zero weights) consistent with a set of training examples is NP-hard. Our neural net learning model encapsulates the idea of modular neural nets, which is a popular approach to overcoming the scaling problem in training neural nets. We investigate how much easier the training problem becomes if the class of concepts to be learned is known a priori and the net architecture is allowed to be su ciently non-optimal. Finally, we classify several neural net optimization problems within the polynomial-time hierarchy

KU ScholarWorks

Neural Relax

Author: Cover T. M.
Elisa Benedetti
Hertz J.
Jackson J. D.
Marco Budinich
Marques G. C.
Pham D. T.
Publication venue: 'MIT Press - Journals'
Publication date: 01/01/2012
Field of study

We present an algorithm for data preprocessing of an associative memory inspired to an electrostatic problem that turns out to have intimate relations with information maximization

arXiv.org e-Print Archive

Archivio istituzionale della ricerca - Università di Trieste

CiteSeerX

Crossref

Communication Complexity Lower Bounds by Polynomials

Author: Buhrman Harry
de Wolf Ronald
Publication venue
Publication date: 01/01/1999
Field of study

The quantum version of communication complexity allows the two communicating parties to exchange qubits and/or to make use of prior entanglement (shared EPR-pairs). Some lower bound techniques are available for qubit communication complexity, but except for the inner product function, no bounds are known for the model with unlimited prior entanglement. We show that the log-rank lower bound extends to the strongest model (qubit communication + unlimited prior entanglement). By relating the rank of the communication matrix to properties of polynomials, we are able to derive some strong bounds for exact protocols. In particular, we prove both the "log-rank conjecture" and the polynomial equivalence of quantum and classical communication complexity for various classes of functions. We also derive some weaker bounds for bounded-error quantum protocols.Comment: 16 pages LaTeX, no figures. 2nd version: rewritten and some results adde

arXiv.org e-Print Archive

CiteSeerX

International Migration, Integration and Social Cohesion online publications