Search CORE

7,369 research outputs found

Mean Field Theory for Sigmoid Belief Networks

Author: Jaakkola T.
Jordan M. I.
Saul L. K.
Publication venue
Publication date: 01/01/1996
Field of study

We develop a mean field theory for sigmoid belief networks based on ideas from statistical mechanics. Our mean field theory provides a tractable approximation to the true probability distribution in these networks; it also yields a lower bound on the likelihood of evidence. We demonstrate the utility of this framework on a benchmark problem in statistical pattern recognition---the classification of handwritten digits.Comment: See http://www.jair.org/ for any accompanying file

arXiv.org e-Print Archive

CiteSeerX

DSpace@MIT

Mean Field Methods for a Special Class of Belief Networks

Author: Bhattacharyya C.
Keerthi S. S.
Publication venue: 'AI Access Foundation'
Publication date: 01/06/2011
Field of study

The chief aim of this paper is to propose mean-field approximations for a broad class of Belief networks, of which sigmoid and noisy-or networks can be seen as special cases. The approximations are based on a powerful mean-field theory suggested by Plefka. We show that Saul, Jaakkola and Jordan' s approach is the first order approximation in Plefka's approach, via a variational derivation. The application of Plefka's theory to belief networks is not computationally tractable. To tackle this problem we propose new approximations based on Taylor series. Small scale experiments show that the proposed schemes are attractive

arXiv.org e-Print Archive

Crossref

Deep Exponential Families

Author: Blei David M.
Charlin Laurent
Ranganath Rajesh
Tang Linpeng
Publication venue
Publication date: 10/11/2014
Field of study

We describe \textit{deep exponential families} (DEFs), a class of latent variable models that are inspired by the hidden structures used in deep neural networks. DEFs capture a hierarchy of dependencies between latent variables, and are easily generalized to many settings through exponential families. We perform inference using recent "black box" variational inference techniques. We then evaluate various DEFs on text and combine multiple DEFs into a model for pairwise recommendation data. In an extensive study, we show that going beyond one layer improves predictions for DEFs. We demonstrate that DEFs find interesting exploratory structure in large data sets, and give better predictive performance than state-of-the-art models

arXiv.org e-Print Archive

CiteSeerX

Princeton University Open Access Repository