Search CORE

7 research outputs found

Mixture decompositions of exponential families using a decomposition of their sample spaces

Author: A. GIMIGLIANO
A.V. GERAMITA
M.V. CATALISANO
Publication venue
Publication date: 25/03/2010
Field of study

We study the problem of finding the smallest

m

such that every element of an exponential family can be written as a mixture of

m

elements of another exponential family. We propose an approach based on coverings and packings of the face lattice of the corresponding convex support polytopes and results from coding theory. We show that

m=q^{N-1}

is the smallest number for which any distribution of

N

q

-ary variables can be written as mixture of

m

independent

q

-ary variables. Furthermore, we show that any distribution of

N

binary variables is a mixture of

m = 2^{N-(k+1)}(1+ 1/(2^k-1))

elements of the

k

-interaction exponential family.Comment: 17 pages, 2 figure

arXiv.org e-Print Archive

Institute of Mathematics AS CR, v. v. i.

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

Archivio istituzionale della ricerca - Università di Genova

Open Access Repository

Mixtures and products in two graphical models

Author: Montufar Guido
Seigal Anna
Publication venue
Publication date: 15/09/2017
Field of study

We compare two statistical models of three binary random variables. One is a mixture model and the other is a product of mixtures model called a restricted Boltzmann machine. Although the two models we study look different from their parametrizations, we show that they represent the same set of distributions on the interior of the probability simplex, and are equal up to closure. We give a semi-algebraic description of the model in terms of six binomial inequalities and obtain closed form expressions for the maximum likelihood estimates. We briefly discuss extensions to larger models.Comment: 18 pages, 7 figure

arXiv.org e-Print Archive

eScholarship - University of California

Oxford University Research Archive

MPG.PuRe

Hierarchical Models as Marginals of Hierarchical Models

Author: Montufar Guido
Rauh Johannes
Publication venue
Publication date: 07/03/2016
Field of study

We investigate the representation of hierarchical models in terms of marginals of other hierarchical models with smaller interactions. We focus on binary variables and marginals of pairwise interaction models whose hidden variables are conditionally independent given the visible variables. In this case the problem is equivalent to the representation of linear subspaces of polynomials by feedforward neural networks with soft-plus computational units. We show that every hidden variable can freely model multiple interactions among the visible variables, which allows us to generalize and improve previous results. In particular, we show that a restricted Boltzmann machine with less than

[ 2(\log(v)+1) / (v+1) ] 2^v-1

hidden binary variables can approximate every distribution of

v

visible binary variables arbitrarily well, compared to

2^{v-1}-1

from the best previously known result.Comment: 18 pages, 4 figures, 2 tables, WUPES'1

arXiv.org e-Print Archive

Crossref

eScholarship - University of California

When Does a Mixture of Products Contain a Product of Mixtures?

Author: Montufar Guido F.
Morton Jason
Publication venue
Publication date: 01/01/2014
Field of study

We derive relations between theoretical properties of restricted Boltzmann machines (RBMs), popular machine learning models which form the building blocks of deep learning models, and several natural notions from discrete mathematics and convex geometry. We give implications and equivalences relating RBM-representable probability distributions, perfectly reconstructible inputs, Hamming modes, zonotopes and zonosets, point configurations in hyperplane arrangements, linear threshold codes, and multi-covering numbers of hypercubes. As a motivating application, we prove results on the relative representational power of mixtures of product distributions and products of mixtures of pairs of product distributions (RBMs) that formally justify widely held intuitions about distributed representations. In particular, we show that a mixture of products requiring an exponentially larger number of parameters is needed to represent the probability distributions which can be obtained as products of mixtures.Comment: 32 pages, 6 figures, 2 table

arXiv.org e-Print Archive

CiteSeerX

Crossref

eScholarship - University of California