Search CORE

45,654 research outputs found

Estimating Dependency Structure as a Hidden Variable

Author: Jordan Michael I.
Meila Marina
Morris Quaid
Publication venue
Publication date: 01/01/1997
Field of study

This paper introduces a probability model, the mixture of trees that can account for sparse, dynamically changing dependence relationships. We present a family of efficient algorithms that use EM and the Minimum Spanning Tree algorithm to find the ML and MAP mixture of trees for a variety of priors, including the Dirichlet and the MDL priors. We also show that the single tree classifier acts like an implicit feature selector, thus making the classification performance insensitive to irrelevant attributes. Experimental results demonstrate the excellent performance of the new model both in density estimation and in classification

CiteSeerX

DSpace@MIT

Hidden Gibbs random fields model selection using Block Likelihood Information Criterion

Author: Alfò
Besag
Besag
Besag
Besag
Celeux
Clifford
Csiszár
Csiszár
Cucala
Everitt
Forbes
François
Friel
Friel
Friel
Friel
Gassiat
Geman
Green
Grelaud
Grimmett
Haughton
Hurn
Ji
Jordan
Kass
Kass
Keribin
Lindsay
Marin
McGrory
Moores
Okabayashi
Qian
Raftery
Reeves
Schwarz
Seymour
Stanford
Stoehr
Stoehr
Tierney
Publication venue: 'Wiley'
Publication date: 08/02/2016
Field of study

Performing model selection between Gibbs random fields is a very challenging task. Indeed, due to the Markovian dependence structure, the normalizing constant of the fields cannot be computed using standard analytical or numerical methods. Furthermore, such unobserved fields cannot be integrated out and the likelihood evaluztion is a doubly intractable problem. This forms a central issue to pick the model that best fits an observed data. We introduce a new approximate version of the Bayesian Information Criterion. We partition the lattice into continuous rectangular blocks and we approximate the probability measure of the hidden Gibbs field by the product of some Gibbs distributions over the blocks. On that basis, we estimate the likelihood and derive the Block Likelihood Information Criterion (BLIC) that answers model choice questions such as the selection of the dependency structure or the number of latent states. We study the performances of BLIC for those questions. In addition, we present a comparison with ABC algorithms to point out that the novel criterion offers a better trade-off between time efficiency and reliable results

arXiv.org e-Print Archive

Crossref

HAL AMU

HAL Descartes

A Deep Embedding Model for Co-occurrence Learning

Author: Chen Jianshu
Deng Li
Gao Jianfeng
He Xiaodong
Jin Ruoming
Shen Yelong
Publication venue
Publication date: 04/06/2015
Field of study

Co-occurrence Data is a common and important information source in many areas, such as the word co-occurrence in the sentences, friends co-occurrence in social networks and products co-occurrence in commercial transaction data, etc, which contains rich correlation and clustering information about the items. In this paper, we study co-occurrence data using a general energy-based probabilistic model, and we analyze three different categories of energy-based model, namely, the

L_1

L_2

and

L_k

models, which are able to capture different levels of dependency in the co-occurrence data. We also discuss how several typical existing models are related to these three types of energy models, including the Fully Visible Boltzmann Machine (FVBM) (

L_2

), Matrix Factorization (

L_2

), Log-BiLinear (LBL) models (

L_2

), and the Restricted Boltzmann Machine (RBM) model (

L_k

). Then, we propose a Deep Embedding Model (DEM) (an

L_k

model) from the energy model in a \emph{principled} manner. Furthermore, motivated by the observation that the partition function in the energy model is intractable and the fact that the major objective of modeling the co-occurrence data is to predict using the conditional probability, we apply the \emph{maximum pseudo-likelihood} method to learn DEM. In consequence, the developed model and its learning method naturally avoid the above difficulties and can be easily used to compute the conditional probability in prediction. Interestingly, our method is equivalent to learning a special structured deep neural network using back-propagation and a special sampling strategy, which makes it scalable on large-scale datasets. Finally, in the experiments, we show that the DEM can achieve comparable or better results than state-of-the-art methods on datasets across several application domains

arXiv.org e-Print Archive

Crossref

Estimating within-household contact networks from egocentric data

Author: Halloran M. Elizabeth
Handcock Mark S.
Longini Jr., Ira M.
Potter Gail E.
Publication venue: 'Institute of Mathematical Statistics'
Publication date: 01/01/2011
Field of study

Acute respiratory diseases are transmitted over networks of social contacts. Large-scale simulation models are used to predict epidemic dynamics and evaluate the impact of various interventions, but the contact behavior in these models is based on simplistic and strong assumptions which are not informed by survey data. These assumptions are also used for estimating transmission measures such as the basic reproductive number and secondary attack rates. Development of methodology to infer contact networks from survey data could improve these models and estimation methods. We contribute to this area by developing a model of within-household social contacts and using it to analyze the Belgian POLYMOD data set, which contains detailed diaries of social contacts in a 24-hour period. We model dependency in contact behavior through a latent variable indicating which household members are at home. We estimate age-specific probabilities of being at home and age-specific probabilities of contact conditional on two members being at home. Our results differ from the standard random mixing assumption. In addition, we find that the probability that all members contact each other on a given day is fairly low: 0.49 for households with two 0--5 year olds and two 19--35 year olds, and 0.36 for households with two 12--18 year olds and two 36+ year olds. We find higher contact rates in households with 2--3 members, helping explain the higher influenza secondary attack rates found in households of this size.Comment: Published in at http://dx.doi.org/10.1214/11-AOAS474 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org

arXiv.org e-Print Archive

Crossref

PubMed Central

eScholarship - University of California

Are you going to the party: depends, who else is coming? [Learning hidden group dynamics via conditional latent tree models]

Author: Anandkumar Animashree
Arabshahi Forough
Butts Carter T.
Fitshugh Sean M.
Huang Furong
Publication venue
Publication date: 01/11/2015
Field of study

Scalable probabilistic modeling and prediction in high dimensional multivariate time-series is a challenging problem, particularly for systems with hidden sources of dependence and/or homogeneity. Examples of such problems include dynamic social networks with co-evolving nodes and edges and dynamic student learning in online courses. Here, we address these problems through the discovery of hierarchical latent groups. We introduce a family of Conditional Latent Tree Models (CLTM), in which tree-structured latent variables incorporate the unknown groups. The latent tree itself is conditioned on observed covariates such as seasonality, historical activity, and node attributes. We propose a statistically efficient framework for learning both the hierarchical tree structure and the parameters of the CLTM. We demonstrate competitive performance in multiple real world datasets from different domains. These include a dataset on students' attempts at answering questions in a psychology MOOC, Twitter users participating in an emergency management discussion and interacting with one another, and windsurfers interacting on a beach in Southern California. In addition, our modeling framework provides valuable and interpretable information about the hidden group structures and their effect on the evolution of the time series

arXiv.org e-Print Archive

Crossref

Caltech Authors