56 research outputs found
Matrix norms and rapid mixing for spin systems
We give a systematic development of the application of matrix norms to rapid
mixing in spin systems. We show that rapid mixing of both random update Glauber
dynamics and systematic scan Glauber dynamics occurs if any matrix norm of the
associated dependency matrix is less than 1. We give improved analysis for the
case in which the diagonal of the dependency matrix is (as in heat
bath dynamics). We apply the matrix norm methods to random update and
systematic scan Glauber dynamics for coloring various classes of graphs. We
give a general method for estimating a norm of a symmetric nonregular matrix.
This leads to improved mixing times for any class of graphs which is hereditary
and sufficiently sparse including several classes of degree-bounded graphs such
as nonregular graphs, trees, planar graphs and graphs with given tree-width and
genus.Comment: Published in at http://dx.doi.org/10.1214/08-AAP532 the Annals of
Applied Probability (http://www.imstat.org/aap/) by the Institute of
Mathematical Statistics (http://www.imstat.org
Spatial Mixing and Non-local Markov chains
We consider spin systems with nearest-neighbor interactions on an -vertex
-dimensional cube of the integer lattice graph . We study the
effects that exponential decay with distance of spin correlations, specifically
the strong spatial mixing condition (SSM), has on the rate of convergence to
equilibrium distribution of non-local Markov chains. We prove that SSM implies
mixing of a block dynamics whose steps can be implemented
efficiently. We then develop a methodology, consisting of several new
comparison inequalities concerning various block dynamics, that allow us to
extend this result to other non-local dynamics. As a first application of our
method we prove that, if SSM holds, then the relaxation time (i.e., the inverse
spectral gap) of general block dynamics is , where is the number of
blocks. A second application of our technology concerns the Swendsen-Wang
dynamics for the ferromagnetic Ising and Potts models. We show that SSM implies
an bound for the relaxation time. As a by-product of this implication we
observe that the relaxation time of the Swendsen-Wang dynamics in square boxes
of is throughout the subcritical regime of the -state
Potts model, for all . We also prove that for monotone spin systems
SSM implies that the mixing time of systematic scan dynamics is . Systematic scan dynamics are widely employed in practice but have
proved hard to analyze. Our proofs use a variety of techniques for the analysis
of Markov chains including coupling, functional analysis and linear algebra
Rapid Mixing of Global Markov Chains via Spectral Independence: The Unbounded Degree Case
We consider spin systems on general n-vertex graphs of unbounded degree and explore the effects of spectral independence on the rate of convergence to equilibrium of global Markov chains. Spectral independence is a novel way of quantifying the decay of correlations in spin system models, which has significantly advanced the study of Markov chains for spin systems. We prove that whenever spectral independence holds, the popular Swendsen-Wang dynamics for the q-state ferromagnetic Potts model on graphs of maximum degree ?, where ? is allowed to grow with n, converges in O((? log n)^c) steps where c > 0 is a constant independent of ? and n. We also show a similar mixing time bound for the block dynamics of general spin systems, again assuming that spectral independence holds. Finally, for monotone spin systems such as the Ising model and the hardcore model on bipartite graphs, we show that spectral independence implies that the mixing time of the systematic scan dynamics is O(?^c log n) for a constant c > 0 independent of ? and n. Systematic scan dynamics are widely popular but are notoriously difficult to analyze. This result implies optimal O(log n) mixing time bounds for any systematic scan dynamics of the ferromagnetic Ising model on general graphs up to the tree uniqueness threshold. Our main technical contribution is an improved factorization of the entropy functional: this is the common starting point for all our proofs. Specifically, we establish the so-called k-partite factorization of entropy with a constant that depends polynomially on the maximum degree of the graph
Comparison Theorems for Gibbs Measures
The Dobrushin comparison theorem is a powerful tool to bound the difference
between the marginals of high-dimensional probability distributions in terms of
their local specifications. Originally introduced to prove uniqueness and decay
of correlations of Gibbs measures, it has been widely used in statistical
mechanics as well as in the analysis of algorithms on random fields and
interacting Markov chains. However, the classical comparison theorem requires
validity of the Dobrushin uniqueness criterion, essentially restricting its
applicability in most models to a small subset of the natural parameter space.
In this paper we develop generalized Dobrushin comparison theorems in terms of
influences between blocks of sites, in the spirit of Dobrushin-Shlosman and
Weitz, that substantially extend the range of applicability of the classical
comparison theorem. Our proofs are based on the analysis of an associated
family of Markov chains. We develop in detail an application of our main
results to the analysis of sequential Monte Carlo algorithms for filtering in
high dimension.Comment: 55 page
Sampling Colourings of the Triangular Lattice
We show that the Glauber dynamics on proper 9-colourings of the triangular
lattice is rapidly mixing, which allows for efficient sampling. Consequently,
there is a fully polynomial randomised approximation scheme (FPRAS) for
counting proper 9-colourings of the triangular lattice. Proper colourings
correspond to configurations in the zero-temperature anti-ferromagnetic Potts
model. We show that the spin system consisting of proper 9-colourings of the
triangular lattice has strong spatial mixing. This implies that there is a
unique infinite-volume Gibbs distribution, which is an important property
studied in statistical physics. Our results build on previous work by Goldberg,
Martin and Paterson, who showed similar results for 10 colours on the
triangular lattice. Their work was preceded by Salas and Sokal's 11-colour
result. Both proofs rely on computational assistance, and so does our 9-colour
proof. We have used a randomised heuristic to guide us towards rigourous
results.Comment: 42 pages. Added appendix that describes implementation. Added
ancillary file
Layerwise Systematic Scan: Deep Boltzmann Machines and Beyond
For Markov chain Monte Carlo methods, one of the greatest discrepancies
between theory and system is the scan order - while most theoretical
development on the mixing time analysis deals with random updates, real-world
systems are implemented with systematic scans. We bridge this gap for models
that exhibit a bipartite structure, including, most notably, the
Restricted/Deep Boltzmann Machine. The de facto implementation for these models
scans variables in a layerwise fashion. We show that the Gibbs sampler with a
layerwise alternating scan order has its relaxation time (in terms of epochs)
no larger than that of a random-update Gibbs sampler (in terms of variable
updates). We also construct examples to show that this bound is asymptotically
tight. Through standard inequalities, our result also implies a comparison on
the mixing times.Comment: v2: typo fixes and improved presentatio
- âŠ