Search CORE

250 research outputs found

Adaptive approximate Bayesian computation for complex models

Author: CC Drovandi
D Wegmann
D Wegmann
Franck Jabot
Guillaume Deffuant
MA Beaumont
Maxime Lenormand
MGB Blum
P Glynn
P Marjoram
P Moral Del
P Moral Del
SA Sisson
T Toni
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Approximate Bayesian computation (ABC) is a family of computational techniques in Bayesian statistics. These techniques allow to fi t a model to data without relying on the computation of the model likelihood. They instead require to simulate a large number of times the model to be fi tted. A number of re finements to the original rejection-based ABC scheme have been proposed, including the sequential improvement of posterior distributions. This technique allows to de- crease the number of model simulations required, but it still presents several shortcomings which are particu- larly problematic for costly to simulate complex models. We here provide a new algorithm to perform adaptive approximate Bayesian computation, which is shown to perform better on both a toy example and a complex social model.Comment: 14 pages, 5 figure

arXiv.org e-Print Archive

Crossref

HAL Descartes

Non-linear regression models for Approximate Bayesian Computation

Author: A. Butler
A. Gelman
B. Schölkopf
B.D. Ripley
C. Gourieroux
C.M. Bishop
C.P. Robert
D.A. Nix
D.E. Reich
E.A. Nadaraya
G. Weiss
G.E.P. Box
G.S. Watson
I.J. Wilson
J. Fan
J. Hey
J.H. Friedman
J.K. Pritchard
J.K. Pritchard
J.P. King
J.S. Liu
K. Heggland
L.A. Zhivotovsky
L.A. Zhivotovsky
M. Stephens
M. Tanaka
M.A. Beaumont
M.D. Shriver
M.K. Kuhner
N.J.R. Fagundes
O. Ratmann
P. Bortot
P. Marjoram
P. Marjoram
P.J. Diggle
S. Tavaré
S. Tavaré
S.A. Sisson
T. Ohta
T. Toni
V.N. Vapnik
W. Härdle
Y.-X. Fu
Y.-X. Fu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 23/02/2009
Field of study

Approximate Bayesian inference on the basis of summary statistics is well-suited to complex problems for which the likelihood is either mathematically or computationally intractable. However the methods that use rejection suffer from the curse of dimensionality when the number of summary statistics is increased. Here we propose a machine-learning approach to the estimation of the posterior density by introducing two innovations. The new method fits a nonlinear conditional heteroscedastic regression of the parameter on the summary statistics, and then adaptively improves estimation using importance sampling. The new algorithm is compared to the state-of-the-art approximate Bayesian methods, and achieves considerable reduction of the computational burden in two examples of inference in statistical genetics and in a queueing model.Comment: 4 figures; version 3 minor changes; to appear in Statistics and Computin

arXiv.org e-Print Archive

Crossref

Choosing summary statistics by least angle regression for approximate Bayesian computation

Author: Andreas Futschik
Beaumont M.A.
Blum M.G.B.
Breiman L.
Hudson R.R.
Ijaz Hussain
Joyce P.
Marjoram P.
Mitwali Abd-el.Moemen
Muhammad Faisal
Nunes M.A.
Publication venue: 'Informa UK Limited'
Publication date: 16/12/2015
Field of study

YesBayesian statistical inference relies on the posterior distribution. Depending on the model, the posterior can be more or less difficult to derive. In recent years, there has been a lot of interest in complex settings where the likelihood is analytically intractable. In such situations, approximate Bayesian computation (ABC) provides an attractive way of carrying out Bayesian inference. For obtaining reliable posterior estimates however, it is important to keep the approximation errors small in ABC. The choice of an appropriate set of summary statistics plays a crucial role in this effort. Here, we report the development of a new algorithm that is based on least angle regression for choosing summary statistics. In two population genetic examples, the performance of the new algorithm is better than a previously proposed approach that uses partial least squares.Higher Education Commission (HEC), College Deanship of Scientific Research, King Saud University, Riyadh Saudi Arabia - research group project RGP-VPP-280

Crossref

Bradford Scholars

Simulation-based model selection for dynamical systems in systems and population biology

Author: Addy
Arbouzova
Beaumont
Blum
Burnham
Chen
Clancy
Darnell
Del Moral
Eigen
Excoffier
Gelman
Gillespie
Grelaud
Gutenkunst
Horvath
Jeffreys
Kass
Kim
Kirk
Klingmüller
Köster
Longini
Marjoram
May
Michael P. H. Stumpf
Möller
Müller
O'Neill
Prusiner
Ratmann
Ratmann
Secrier
Sisson
Swameye
Timmer
Tina Toni
Toni
Vyshemirsky
Wei
Wilkinson
Zi
Publication venue
Publication date: 01/01/2010
Field of study

Computer simulations have become an important tool across the biomedical sciences and beyond. For many important problems several different models or hypotheses exist and choosing which one best describes reality or observed data is not straightforward. We therefore require suitable statistical tools that allow us to choose rationally between different mechanistic models of e.g. signal transduction or gene regulation networks. This is particularly challenging in systems biology where only a small number of molecular species can be assayed at any given time and all measurements are subject to measurement uncertainty. Here we develop such a model selection framework based on approximate Bayesian computation and employing sequential Monte Carlo sampling. We show that our approach can be applied across a wide range of biological scenarios, and we illustrate its use on real data describing influenza dynamics and the JAK-STAT signalling pathway. Bayesian model selection strikes a balance between the complexity of the simulation models and their ability to describe observed data. The present approach enables us to employ the whole formal apparatus to any system that can be (efficiently) simulated, even when exact likelihoods are computationally intractable.Comment: This article is in press in Bioinformatics, 2009. Advance Access is available on Bioinformatics webpag

arXiv.org e-Print Archive

Crossref

PubMed Central

University of Melbourne Institutional Repository

Bayesian Parameter Estimation for Latent Markov Random Fields and Social Networks

Author: Andrieu C.
Andrieu C.
Beaumont M. A.
Besag J.
Besag J.
Besag J.
Caimo A.
Carter C.
Del Moral P.
Frank O.
Friel N.
Geyer C. J.
Geyer C. J.
Green P. J.
Grelaud A.
Hamze F.
Higdon D. M.
Koskinen J. H.
Marjoram P.
Murray I.
Murray I.
Møller J.
Neal R.
Pritchard J. K.
Propp J. G.
Richard G. Everitt
Robert C. P.
Sisson S. A.
Snijders T. A. B.
Tierney L.
Wasserman S.
Publication venue
Publication date: 01/01/2012
Field of study

Undirected graphical models are widely used in statistics, physics and machine vision. However Bayesian parameter estimation for undirected models is extremely challenging, since evaluation of the posterior typically involves the calculation of an intractable normalising constant. This problem has received much attention, but very little of this has focussed on the important practical case where the data consists of noisy or incomplete observations of the underlying hidden structure. This paper specifically addresses this problem, comparing two alternative methodologies. In the first of these approaches particle Markov chain Monte Carlo (Andrieu et al., 2010) is used to efficiently explore the parameter space, combined with the exchange algorithm (Murray et al., 2006) for avoiding the calculation of the intractable normalising constant (a proof showing that this combination targets the correct distribution in found in a supplementary appendix online). This approach is compared with approximate Bayesian computation (Pritchard et al., 1999). Applications to estimating the parameters of Ising models and exponential random graphs from noisy data are presented. Each algorithm used in the paper targets an approximation to the true posterior due to the use of MCMC to simulate from the latent graphical model, in lieu of being able to do this exactly in general. The supplementary appendix also describes the nature of the resulting approximation.Comment: 26 pages, 2 figures, accepted in Journal of Computational and Graphical Statistics (http://www.amstat.org/publications/jcgs.cfm

arXiv.org e-Print Archive

Central Archive at the University of Reading

CiteSeerX

Crossref

Warwick Research Archives Portal Repository

Global parameter identification of stochastic reaction networks from single trajectories

Author: A Auger
AL Barabási
B Munsky
C Andrieu
CL Müller
DT Gillespie
DT Gillespie
E Cinquemani
ET Jaynes
G Kjellström
G Kjellström
G Kjellström
G Stock
H Kitano
H Kitano
H Koeppl
H Qian
JA Helmuth
JK Pritchard
JR Lakowicz
K Koutroumpas
L Ljung
M Hafner
O Mason
O Wolkenhauer
P Marjoram
R Albert
R Grima
R Ramaswamy
R Ramaswamy
R Ramaswamy
R Ramaswamy
R Ramaswamy
RJ Boys
S Reinker
SH Strogatz
SK Poovathingal
T Toni
TG Kurtz
Publication venue
Publication date: 01/01/2011
Field of study

We consider the problem of inferring the unknown parameters of a stochastic biochemical network model from a single measured time-course of the concentration of some of the involved species. Such measurements are available, e.g., from live-cell fluorescence microscopy in image-based systems biology. In addition, fluctuation time-courses from, e.g., fluorescence correlation spectroscopy provide additional information about the system dynamics that can be used to more robustly infer parameters than when considering only mean concentrations. Estimating model parameters from a single experimental trajectory enables single-cell measurements and quantification of cell--cell variability. We propose a novel combination of an adaptive Monte Carlo sampler, called Gaussian Adaptation, and efficient exact stochastic simulation algorithms that allows parameter identification from single stochastic trajectories. We benchmark the proposed method on a linear and a non-linear reaction network at steady state and during transient phases. In addition, we demonstrate that the present method also provides an ellipsoidal volume estimate of the viable part of parameter space and is able to estimate the physical volume of the compartment in which the observed reactions take place.Comment: Article in print as a book chapter in Springer's "Advances in Systems Biology

arXiv.org e-Print Archive

Crossref

ZORA

Piecewise Approximate Bayesian Computation: fast inference for discretely observed Markov models using a factorised posterior distribution

Author: A. Golightly
B.W. Silverman
C. Andrieu
D. Moriña
D.J. Wilkinson
D.J. Wilkinson
E. Gabriel
E. McKenzie
E.B. Sudderth
G.B. Durham
G.J. Székely
J.C. Cox
J.K. Pritchard
J.M. Marin
K. Fukunaga
K.V. Mardia
M.A. Al-Osh
M.A. Beaumont
M.G.B. Blum
N.G. Kampen Van
P. Fearnhead
P. Marjoram
P. Neal
P. Wand
R.D. Wilkinson
R.J. Boys
S. P. Preston
S. R. White
T. Kypraios
T. McKinley
T. Toni
T.P. Minka
Y. Aït-Sahalia
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Many modern statistical applications involve inference for complicated stochastic models for which the likelihood function is difficult or even impossible to calculate, and hence conventional likelihood-based inferential techniques cannot be used. In such settings, Bayesian inference can be performed using Approximate Bayesian Computation (ABC). However, in spite of many recent developments to ABC methodology, in many applications the computational cost of ABC necessitates the choice of summary statistics and tolerances that can potentially severely bias the estimate of the posterior. We propose a new “piecewise” ABC approach suitable for discretely observed Markov models that involves writing the posterior density of the parameters as a product of factors, each a function of only a subset of the data, and then using ABC within each factor. The approach has the advantage of side-stepping the need to choose a summary statistic and it enables a stringent tolerance to be set, making the posterior “less approximate”. We investigate two methods for estimating the posterior density based on ABC samples for each of the factors: the first is to use a Gaussian approximation for each factor, and the second is to use a kernel density estimate. Both methods have their merits. The Gaussian approximation is simple, fast, and probably adequate for many applications. On the other hand, using instead a kernel density estimate has the benefit of consistently estimating the true piecewise ABC posterior as the number of ABC samples tends to infinity. We illustrate the piecewise ABC approach with four examples; in each case, the approach offers fast and accurate inference

Nottingham ePrints

Nottingham eTheses

Crossref

Repository@Nottingham

Springer - Publisher Connector

PubMed Central

Bayesian model comparison with un-normalised likelihoods

Author: A Caimo
A Grelaud
A Kong
Adam M. Johansen
C Andrieu
CC Drovandi
D Prangle
Ellen Rowing
J Møller
J Skilling
JM Marin
M Schweinberger
MA Beaumont
Melina Evdemon-Hogan
N Chopin
N Chopin
N Friel
N Friel
N Whiteley
P Fearnhead
P Marjoram
P Moral Del
P Moral Del
P Moral Del
RD Wilkinson
RG Everitt
Richard G. Everitt
RM Neal
S Tavaré
SA Sisson
SN Wood
X Didelot
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 20/01/2016
Field of study

Models for which the likelihood function can be evaluated only up to a parameter-dependent unknown normalizing constant, such as Markov random field models, are used widely in computer science, statistical physics, spatial statistics, and network analysis. However, Bayesian analysis of these models using standard Monte Carlo methods is not possible due to the intractability of their likelihood functions. Several methods that permit exact, or close to exact, simulation from the posterior distribution have recently been developed. However, estimating the evidence and Bayes’ factors for these models remains challenging in general. This paper describes new random weight importance sampling and sequential Monte Carlo methods for estimating BFs that use simulation to circumvent the evaluation of the intractable likelihood, and compares them to existing methods. In some cases we observe an advantage in the use of biased weight estimates. An initial investigation into the theoretical and empirical properties of this class of methods is presented. Some support for the use of biased estimates is presented, but we advocate caution in the use of such estimates

arXiv.org e-Print Archive

Central Archive at the University of Reading

Crossref

Warwick Research Archives Portal Repository

Methods for detecting associations between phenotype and aggregations of rare variants

Author: B Li
BE Madsen
Chul Joo Kang
Fan Yang
International Schizophrenia Consortium
LA Almasy
M Mitchell
P Visscher
Paul Marjoram
T Manolio
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Although genome-wide association studies have uncovered variants associated with more than 150 traits, the percentage of phenotypic variation explained by these associations remains small. This has led to the search for the dark matter that explains this missing genetic component of heritability. One potential explanation for dark matter is rare variants, and several statistics have been devised to detect associations resulting from aggregations of rare variants in relatively short regions of interest, such as candidate genes. In this paper we investigate the feasibility of extending this approach in an agnostic way, in which we consider all variants within a much broader region of interest, such as an entire chromosome or even the entire exome. Our method searches for subsets of variant sites using either Markov chain Monte Carlo or genetic algorithms. The analysis was performed with knowledge of the Genetic Analysis Workshop 17 answers

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Probabilistic machine learning and artificial intelligence.

Author: A Doucet
A Gelman
A Korattikara
A Krizhevsky
A O'Hagan
A Pfeffer
A Pfeffer
A Pfeffer
B Bakker
B De Finetti
B Fischer
B Milch
B Paige
C Freer
C Kemp
C Lu
C Shannon
C Thornton
CE Rasmussen
CE Rasmussen
CE Rasmussen
CM Bishop
CM Bishop
D Koller
D Koller
D Wingate
DE Wolstenholme
DJ Hand
DJ Lunn
DJC MacKay
DM Wolpert
DR Jones
ET Jaynes
F Wood
F Wood
G Hinton
GE Hinton
GF Marcus
H Kushner
H Robbins
I Sutskever
J Bergstra
J Hensman
J Snoek
JB Tenenbaum
JM Hernández-Lobato
JR Lloyd
K Doya
K Miller
KP Murphy
KS Van Horn
L Li
LR Rabiner
M Girolami
M Hoffman
M Jordan
M Medvedovic
M Schmidt
M Welling
MI Jordan
MP Deisenroth
N Goodman
N Hjort
N Houlsby
ND Goodman
ND Goodman
P Diaconis
P Hennig
P Marjoram
P Orbanz
P Poupart
P Sermanet
RB Grosse
RD King
RM Neal
RM Neal
RM Neal
RM Neal
RP Adams
RT Cox
S Deneve
S Russell
S Thrun
SJ Russell
TL Griffiths
TL Griffiths
TP Minka
TP Minka
TS Ferguson
V Mansinghka
WH Jefferys
Y Bengio
YW Teh
Z Ghahramani
Publication venue: 'The Nature Conservancy'
Publication date: 01/05/2015
Field of study

How can a machine learn from experience? Probabilistic modelling provides a framework for understanding what learning is, and has therefore emerged as one of the principal theoretical and practical approaches for designing machines that learn from data acquired through experience. The probabilistic framework, which describes how to represent and manipulate uncertainty about models and predictions, has a central role in scientific data analysis, machine learning, robotics, cognitive science and artificial intelligence. This Review provides an introduction to this framework, and discusses some of the state-of-the-art advances in the field, namely, probabilistic programming, Bayesian optimization, data compression and automatic model discovery.The author acknowledges an EPSRC grant EP/I036575/1, the DARPA PPAML programme, a Google Focused Research Award for the Automatic Statistician and support from Microsoft Research.This is the author accepted manuscript. The final version is available from NPG at http://www.nature.com/nature/journal/v521/n7553/full/nature14541.html#abstract

Crossref

Apollo (Cambridge)