Search CORE

326 research outputs found

On an adaptive preconditioned Crank-Nicolson MCMC algorithm for infinite dimensional Bayesian inferences

Author: Hu Zixi
Li Jinglai
Yao Zhewei
Publication venue
Publication date: 01/04/2016
Field of study

Many scientific and engineering problems require to perform Bayesian inferences for unknowns of infinite dimension. In such problems, many standard Markov Chain Monte Carlo (MCMC) algorithms become arbitrary slow under the mesh refinement, which is referred to as being dimension dependent. To this end, a family of dimensional independent MCMC algorithms, known as the preconditioned Crank-Nicolson (pCN) methods, were proposed to sample the infinite dimensional parameters. In this work we develop an adaptive version of the pCN algorithm, where the covariance operator of the proposal distribution is adjusted based on sampling history to improve the simulation efficiency. We show that the proposed algorithm satisfies an important ergodicity condition under some mild assumptions. Finally we provide numerical examples to demonstrate the performance of the proposed method

arXiv.org e-Print Archive

University of Liverpool Repository

University of Birmingham Research Portal

MCMC methods for functions modifying old algorithms to make\ud them faster

Author: Cotter S. L.
Roberts G. O.
Stuart A. M.
White D.
Publication venue
Publication date: 01/01/2012
Field of study

Many problems arising in applications result in the need\ud to probe a probability distribution for functions. Examples include Bayesian nonparametric statistics and conditioned diffusion processes. Standard MCMC algorithms typically become arbitrarily slow under the mesh refinement dictated by nonparametric description of the unknown function. We describe an approach to modifying a whole range of MCMC methods which ensures that their speed of convergence is robust under mesh refinement. In the applications of interest the data is often sparse and the prior specification is an essential part of the overall modeling strategy. The algorithmic approach that we describe is applicable whenever the desired probability measure has density with respect to a Gaussian process or Gaussian random field prior, and to some useful non-Gaussian priors constructed through random truncation. Applications are shown in density estimation, data assimilation in fluid mechanics, subsurface geophysics and image registration. The key design principle is to formulate the MCMC method for functions. This leads to algorithms which can be implemented via minor modification of existing algorithms, yet which show enormous speed-up on a wide range of applied problems

arXiv.org e-Print Archive

Warwick Research Archives Portal Repository

Oxford University Research Archive

Caltech Authors

MIMS EPrints

The University of Manchester - Institutional Repository

Information-geometric Markov Chain Monte Carlo methods using Diffusions

Author: Girolami Mark
Livingstone Samuel
Publication venue: 'MDPI AG'
Publication date: 01/01/2014
Field of study

Recent work incorporating geometric ideas in Markov chain Monte Carlo is reviewed in order to highlight these advances and their possible application in a range of domains beyond Statistics. A full exposition of Markov chains and their use in Monte Carlo simulation for Statistical inference and molecular dynamics is provided, with particular emphasis on methods based on Langevin diffusions. After this geometric concepts in Markov chain Monte Carlo are introduced. A full derivation of the Langevin diffusion on a Riemannian manifold is given, together with a discussion of appropriate Riemannian metric choice for different problems. A survey of applications is provided, and some open questions are discussed.Comment: 22 pages, 2 figure

arXiv.org e-Print Archive

Multidisciplinary Digital Publishing Institute

CiteSeerX

Directory of Open Access Journals

UCL Discovery

Warwick Research Archives Portal Repository

Markov Chain Monte Carlo confidence intervals

Author: Atchadé Yves F.
Publication venue: 'Bernoulli Society for Mathematical Statistics and Probability'
Publication date: 04/05/2016
Field of study

For a reversible and ergodic Markov chain

\{X_n,n\geq0\}

with invariant distribution

\pi

, we show that a valid confidence interval for

\pi(h)

can be constructed whenever the asymptotic variance

\sigma^2_P(h)

is finite and positive. We do not impose any additional condition on the convergence rate of the Markov chain. The confidence interval is derived using the so-called fixed-b lag-window estimator of

\sigma_P^2(h)

. We also derive a result that suggests that the proposed confidence interval procedure converges faster than classical confidence interval procedures based on the Gaussian distribution and standard central limit theorems for Markov chains.Comment: Published at http://dx.doi.org/10.3150/15-BEJ712 in the Bernoulli (http://isi.cbs.nl/bernoulli/) by the International Statistical Institute/Bernoulli Society (http://isi.cbs.nl/BS/bshome.htm

arXiv.org e-Print Archive

Crossref

Explicit convergence bounds for Metropolis Markov chains: isoperimetry, spectral gaps and profiles

Author: Andrieu Christophe
Lee Anthony
Power Sam
Wang Andi Q.
Publication venue
Publication date: 31/10/2023
Field of study

We derive the first explicit bounds for the spectral gap of a random walk Metropolis algorithm on

R^d

for any value of the proposal variance, which when scaled appropriately recovers the correct

d^{-1}

dependence on dimension for suitably regular invariant distributions. We also obtain explicit bounds on the

{\rm L}^2

-mixing time for a broad class of models. In obtaining these results, we refine the use of isoperimetric profile inequalities to obtain conductance profile bounds, which also enable the derivation of explicit bounds in a much broader class of models. We also obtain similar results for the preconditioned Crank--Nicolson Markov chain, obtaining dimension-independent bounds under suitable assumptions

arXiv.org e-Print Archive

Recommended from our members

Advanced Bayesian Monte Carlo Methods for Inference and Control

Author: Sell Torben
Publication venue: University of Cambridge
Publication date: 05/02/2021
Field of study

Monte Carlo methods are are an ubiquitous tool in modern statistics. Under the Bayesian paradigm, they are used for estimating otherwise intractable integrals arising when integrating a function

h

with respect to a posterior distribution

\pi

. This thesis discusses several aspects of such Monte Carlo methods. The first discussion evolves around the problem of sampling from only almost everywhere differentiable distributions, a class of distributions which includes all log-concave posteriors. A new sampling method based on a second-order diffusion process is proposed, new theoretical results are proved, and extensive numerical illustrations elucidate the benefits and weaknesses of various methods applicable in these settings. In high-dimensional settings, one can exploit local structures of inverse problems to parallelise computations. This will be explored in both fully localisable problems, and problems where conditional independence of variables given some others holds only approximately. This thesis proposes two algorithms using parallelisation techniques, and shows their empirical performance on two localisable imaging problems. Another problem arises when defining function space priors over high-dimensional domains. The commonly used Karhunen-Loève priors suffer from bad dimensional scaling: they require an orthogonal basis of the function space, which can often be obtained as a product of one-dimensional basis functions. This leads to the number of parameters growing exponentially in the dimension

d

of the function domain. The trace-class neural network prior, proposed in this thesis, scales more favourably in the dimension of the function's domain. This prior is a Bayesian neural network prior, where each weight and bias has an independent Gaussian prior, but with a key difference to existing Bayesian neural network priors: the variances decrease in the width of the network, such that the variances form a summable sequence and the infinite width limit neural network is well defined. As is shown in this thesis, the resulting posterior of the unknown function is amenable to sampling using Hilbert space Markov chain Monte Carlo methods. These sampling methods are favoured because they are stable under mesh-refinement, in the sense that the acceptance probability does not shrink to 0 as more parameters are introduced to better approximate the well-defined infinite limit. Both numerical illustrations and theoretical results show that these priors are competitive and have distinct advantages over other function space priors. These different function space priors are then used in stochastic control. To this end, a suitable likelihood for continuous value functions in a Bayesian approach to reinforcement learning is defined. This thesis proves that it can be used in conjunction with both the classical Karhunen-Loève prior and the proposed trace-class neural network prior. Numerical examples compare the resulting posteriors, and illustrate the new prior's performance and dimension robustness.Cantab Capital Institute for the Mathematics of Informatio

Apollo (Cambridge)

A hybrid adaptive MCMC algorithm in function spaces

Author: Hu Zixi
Li Jinglai
Yao Zhewei
Zhou Qingping
Publication venue
Publication date: 05/07/2016
Field of study

The preconditioned Crank-Nicolson (pCN) method is a Markov Chain Monte Carlo (MCMC) scheme, specifically designed to perform Bayesian inferences in function spaces. Unlike many standard MCMC algorithms, the pCN method can preserve the sampling efficiency under the mesh refinement, a property referred to as being dimension independent. In this work we consider an adaptive strategy to further improve the efficiency of pCN. In particular we develop a hybrid adaptive MCMC method: the algorithm performs an adaptive Metropolis scheme in a chosen finite dimensional subspace, and a standard pCN algorithm in the complement space of the chosen subspace. We show that the proposed algorithm satisfies certain important ergodicity conditions. Finally with numerical examples we demonstrate that the proposed method has competitive performance with existing adaptive algorithms.Comment: arXiv admin note: text overlap with arXiv:1511.0583

arXiv.org e-Print Archive

University of Birmingham Research Portal