Search CORE

14,833 research outputs found

Hierarchical relational models for document networks

Author: Blei David M.
Chang Jonathan
Publication venue: 'Institute of Mathematical Statistics'
Publication date: 01/01/2010
Field of study

We develop the relational topic model (RTM), a hierarchical model of both network structure and node attributes. We focus on document networks, where the attributes of each document are its words, that is, discrete observations taken from a fixed vocabulary. For each pair of documents, the RTM models their link as a binary random variable that is conditioned on their contents. The model can be used to summarize a network of documents, predict links between them, and predict words within them. We derive efficient inference and estimation algorithms based on variational methods that take advantage of sparsity and scale with the number of links. We evaluate the predictive performance of the RTM for large networks of scientific abstracts, web documents, and geographically tagged news.Comment: Published in at http://dx.doi.org/10.1214/09-AOAS309 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org

arXiv.org e-Print Archive

CiteSeerX

Princeton University Open Access Repository

Crossref

Reducing Reparameterization Gradient Variance

Author: Adams Ryan P.
D'Amour Alexander
Foti Nicholas J.
Miller Andrew C.
Publication venue
Publication date: 01/01/2017
Field of study

Optimization with noisy gradients has become ubiquitous in statistics and machine learning. Reparameterization gradients, or gradient estimates computed via the "reparameterization trick," represent a class of noisy gradients often used in Monte Carlo variational inference (MCVI). However, when these gradient estimators are too noisy, the optimization procedure can be slow or fail to converge. One way to reduce noise is to use more samples for the gradient estimate, but this can be computationally expensive. Instead, we view the noisy gradient as a random variable, and form an inexpensive approximation of the generating procedure for the gradient sample. This approximation has high correlation with the noisy gradient by construction, making it a useful control variate for variance reduction. We demonstrate our approach on non-conjugate multi-level hierarchical models and a Bayesian neural net where we observed gradient variance reductions of multiple orders of magnitude (20-2,000x)

arXiv.org e-Print Archive

Princeton University Open Access Repository

Rapid temporal accumulation in spider fear: Evidence from hierarchical drift diffusion modelling

Author: Tipples J
Publication venue: 'American Psychological Association (APA)'
Publication date: 01/01/2015
Field of study

Fear can distort our sense of time – making time seem slow or even stand still. Here, I used Hierarchical Drift Diffusion Modelling (HDDM; Vandekerckhove, Tuerlinckx, & Lee, 2008, 2011; Wiecki, Sofer, & Frank, 2013) to test the idea that temporal accumulation speeds up during fear. Eighteen high fearful and twenty-three low fearful participants judged the duration of both feared stimuli (spiders) and non-feared stimuli (birds) in a temporal bisection task. The drift diffusion modelling results support the main hypothesis. In high but not low fearful individuals evidence accumulated more rapidly toward a long duration decision - drift rates were higher – for spiders compared to birds. This result and further insights into how fear affects time perception would not have been possible based on analyses of choice proportion data alone. Further results were interpreted in the context of a recent two-stage model of time perception (Balci & Simen, 2014). The results highlight the usefulness of diffusion modelling to test process-based explanations of disordered cognition in emotional disorders

Repository@Hull - Worktribe

Crossref

Leeds Beckett Repository

Mixed membership stochastic blockmodels

Author: Airoldi Edoardo M
Blei David M
Fienberg Stephen E
Xing Eric P
Publication venue
Publication date: 30/05/2007
Field of study

Observations consisting of measurements on relationships for pairs of objects arise in many settings, such as protein interaction and gene regulatory networks, collections of author-recipient email, and social networks. Analyzing such data with probabilisic models can be delicate because the simple exchangeability assumptions underlying many boilerplate models no longer hold. In this paper, we describe a latent variable model of such data called the mixed membership stochastic blockmodel. This model extends blockmodels for relational data to ones which capture mixed membership latent relational structure, thus providing an object-specific low-dimensional representation. We develop a general variational inference algorithm for fast approximate posterior inference. We explore applications to social and protein interaction networks.Comment: 46 pages, 14 figures, 3 table

arXiv.org e-Print Archive

CiteSeerX

Hierarchical Gaussian process mixtures for regression

Author: A. Gelman
A. O?Hagan
A.M. Horowitz
B. Cheng
B.P. Carlin
D.M. Titterington
D.M. Titterington
G.J. McLachlan
J.O. Ramsay
J.Q. Shi
M. Stephens
R. Kamnik
R. Murray-Smith
S. Duane
S. Geman
S. Richardson
T.J. Thompson
V. Tresp
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2004
Field of study

As a result of their good performance in practice and their desirable analytical properties, Gaussian process regression models are becoming increasingly of interest in statistics, engineering and other fields. However, two major problems arise when the model is applied to a large data-set with repeated measurements. One stems from the systematic heterogeneity among the different replications, and the other is the requirement to invert a covariance matrix which is involved in the implementation of the model. The dimension of this matrix equals the sample size of the training data-set. In this paper, a Gaussian process mixture model for regression is proposed for dealing with the above two problems, and a hybrid Markov chain Monte Carlo (MCMC) algorithm is used for its implementation. Application to a real data-set is reported

CiteSeerX

Crossref

Enlighten