Search CORE

4,263 research outputs found

Universal bound on the cardinality of local hidden variables in networks

Author: Gisin Nicolas
Rosset Denis
Wolfe Elie
Publication venue: 'Rinton Press'
Publication date: 03/09/2017
Field of study

We present an algebraic description of the sets of local correlations in arbitrary networks, when the parties have finite inputs and outputs. We consider networks generalizing the usual Bell scenarios by the presence of multiple uncorrelated sources. We prove a finite upper bound on the cardinality of the value sets of the local hidden variables. Consequently, we find that the sets of local correlations are connected, closed and semialgebraic, and bounded by tight polynomial Bell-like inequalities.Comment: 5 pages + 6 pages appendix, 2 figures, comments welcom

arXiv.org e-Print Archive

Archive ouverte UNIGE

Hierarchical Models as Marginals of Hierarchical Models

Author: Montufar Guido
Rauh Johannes
Publication venue
Publication date: 07/03/2016
Field of study

We investigate the representation of hierarchical models in terms of marginals of other hierarchical models with smaller interactions. We focus on binary variables and marginals of pairwise interaction models whose hidden variables are conditionally independent given the visible variables. In this case the problem is equivalent to the representation of linear subspaces of polynomials by feedforward neural networks with soft-plus computational units. We show that every hidden variable can freely model multiple interactions among the visible variables, which allows us to generalize and improve previous results. In particular, we show that a restricted Boltzmann machine with less than

[ 2(\log(v)+1) / (v+1) ] 2^v-1

hidden binary variables can approximate every distribution of

v

visible binary variables arbitrarily well, compared to

2^{v-1}-1

from the best previously known result.Comment: 18 pages, 4 figures, 2 tables, WUPES'1

arXiv.org e-Print Archive

eScholarship - University of California

A Theory of Cheap Control in Embodied Systems

Author: B Sallans
C Paul
CE Shannon
CM Bishop
D Sol
DD Clark
DFB Haeufle
EA Rückert
G Montúfar
G Montúfar
G Montúfar
GE Hinton
GE Hinton
Guido Montúfar
GW Taylor
H Hauser
J Martens
J Pearl
JE Auerbach
Josh C. Bongard
K Aström
K Zahedi
K Zahedi
K Zahedi
Keyan Ghazi-Zahedi
L Berthouze
L Sokoloff
M Lungarella
N Ay
N Ay
N Le Roux
Nihat Ay
O Rivoire
R Pfeifer
R Pfeifer
R Pfeifer
R Pfeifer
RA Brooks
RA Brooks
T McGeer
V Braitenberg
VN Vapnik
W Bialek
Y Bengio
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 15/11/2014
Field of study

We present a framework for designing cheap control architectures for embodied agents. Our derivation is guided by the classical problem of universal approximation, whereby we explore the possibility of exploiting the agent's embodiment for a new and more efficient universal approximation of behaviors generated by sensorimotor control. This embodied universal approximation is compared with the classical non-embodied universal approximation. To exemplify our approach, we present a detailed quantitative case study for policy models defined in terms of conditional restricted Boltzmann machines. In contrast to non-embodied universal approximation, which requires an exponential number of parameters, in the embodied setting we are able to generate all possible behaviors with a drastically smaller model, thus obtaining cheap universal approximation. We test and corroborate the theory experimentally with a six-legged walking machine. The experiments show that the sufficient controller complexity predicted by our theory is tight, which means that the theory has direct practical implications. Keywords: cheap design, embodiment, sensorimotor loop, universal approximation, conditional restricted Boltzmann machineComment: 27 pages, 10 figure

arXiv.org e-Print Archive

Crossref

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

FigShare

Universal Approximation Depth and Errors of Narrow Belief Networks with Discrete Units

Author: Montúfar Guido F.
Publication venue
Publication date: 01/01/2014
Field of study

We generalize recent theoretical work on the minimal number of layers of narrow deep belief networks that can approximate any probability distribution on the states of their visible units arbitrarily well. We relax the setting of binary units (Sutskever and Hinton, 2008; Le Roux and Bengio, 2008, 2010; Mont\'ufar and Ay, 2011) to units with arbitrary finite state spaces, and the vanishing approximation error to an arbitrary approximation error tolerance. For example, we show that a

q

-ary deep belief network with

L\geq 2+\frac{q^{\lceil m-\delta \rceil}-1}{q-1}

layers of width

n \leq m + \log_q(m) + 1

for some

m\in \mathbb{N}

can approximate any probability distribution on

\{0,1,\ldots,q-1\}^n

without exceeding a Kullback-Leibler divergence of

\delta

. Our analysis covers discrete restricted Boltzmann machines and na\"ive Bayes models as special cases.Comment: 19 pages, 5 figures, 1 tabl

arXiv.org e-Print Archive

CiteSeerX

eScholarship - University of California

When Does a Mixture of Products Contain a Product of Mixtures?

Author: Montufar Guido F.
Morton Jason
Publication venue
Publication date: 01/01/2014
Field of study

We derive relations between theoretical properties of restricted Boltzmann machines (RBMs), popular machine learning models which form the building blocks of deep learning models, and several natural notions from discrete mathematics and convex geometry. We give implications and equivalences relating RBM-representable probability distributions, perfectly reconstructible inputs, Hamming modes, zonotopes and zonosets, point configurations in hyperplane arrangements, linear threshold codes, and multi-covering numbers of hypercubes. As a motivating application, we prove results on the relative representational power of mixtures of product distributions and products of mixtures of pairs of product distributions (RBMs) that formally justify widely held intuitions about distributed representations. In particular, we show that a mixture of products requiring an exponentially larger number of parameters is needed to represent the probability distributions which can be obtained as products of mixtures.Comment: 32 pages, 6 figures, 2 table

arXiv.org e-Print Archive

CiteSeerX

eScholarship - University of California

A Nonparametric Ensemble Binary Classifier and its Statistical Properties

Author: Chakraborty Ashis Kumar
Chakraborty Tanujit
Murthy C. A.
Publication venue
Publication date: 18/09/2018
Field of study

In this work, we propose an ensemble of classification trees (CT) and artificial neural networks (ANN). Several statistical properties including universal consistency and upper bound of an important parameter of the proposed classifier are shown. Numerical evidence is also provided using various real life data sets to assess the performance of the model. Our proposed nonparametric ensemble classifier doesn't suffer from the `curse of dimensionality' and can be used in a wide variety of feature selection cum classification problems. Performance of the proposed model is quite better when compared to many other state-of-the-art models used for similar situations

arXiv.org e-Print Archive