73 research outputs found
Orthogonal polynomial kernels and canonical correlations for Dirichlet measures
We consider a multivariate version of the so-called Lancaster problem of
characterizing canonical correlation coefficients of symmetric bivariate
distributions with identical marginals and orthogonal polynomial expansions.
The marginal distributions examined in this paper are the Dirichlet and the
Dirichlet multinomial distribution, respectively, on the continuous and the
N-discrete d-dimensional simplex. Their infinite-dimensional limit
distributions, respectively, the Poisson-Dirichlet distribution and Ewens's
sampling formula, are considered as well. We study, in particular, the
possibility of mapping canonical correlations on the d-dimensional continuous
simplex (i) to canonical correlation sequences on the d+1-dimensional simplex
and/or (ii) to canonical correlations on the discrete simplex, and vice versa.
Driven by this motivation, the first half of the paper is devoted to providing
a full characterization and probabilistic interpretation of n-orthogonal
polynomial kernels (i.e., sums of products of orthogonal polynomials of the
same degree n) with respect to the mentioned marginal distributions. We
establish several identities and some integral representations which are
multivariate extensions of important results known for the case d=2 since the
1970s. These results, along with a common interpretation of the mentioned
kernels in terms of dependent Polya urns, are shown to be key features leading
to several non-trivial solutions to Lancaster's problem, many of which can be
extended naturally to the limit as .Comment: Published in at http://dx.doi.org/10.3150/11-BEJ403 the Bernoulli
(http://isi.cbs.nl/bernoulli/) by the International Statistical
Institute/Bernoulli Society (http://isi.cbs.nl/BS/bshome.htm
Computational inference beyond Kingman's coalescent
Full likelihood inference under Kingman's coalescent is a computationally challenging problem to which importance sampling (IS) and the product of approximate conditionals (PAC) method have been applied successfully. Both methods can be expressed in terms of families of intractable conditional sampling distributions (CSDs), and rely on principled approximations for accurate inference. Recently, more general Î- and Î- coalescents have been observed to provide better modelling ts to some genetic data sets. We derive families of approximate CSDs for nite sites Î- and Î-coalescents, and use them to obtain "approximately optimal" IS and PAC algorithms for Î coalescents, yielding substantial gains in efficiency over existing methods
Wright-Fisher diffusion bridges
{\bf Abstract} The trajectory of the frequency of an allele which begins at
at time and is known to have frequency at time can be modelled
by the bridge process of the Wright-Fisher diffusion. Bridges when are
particularly interesting because they model the trajectory of the frequency of
an allele which appears at a time, then is lost by random drift or mutation
after a time . The coalescent genealogy back in time of a population in a
neutral Wright-Fisher diffusion process is well understood. In this paper we
obtain a new interpretation of the coalescent genealogy of the population in a
bridge from a time . In a bridge with allele frequencies of 0 at
times 0 and the coalescence structure is that the population coalesces in
two directions from to and to such that there is just one
lineage of the allele under consideration at times and . The genealogy
in Wright-Fisher diffusion bridges with selection is more complex than in the
neutral model, but still with the property of the population branching and
coalescing in two directions from time . The density of the
frequency of an allele at time is expressed in a way that shows coalescence
in the two directions. A new algorithm for exact simulation of a neutral
Wright-Fisher bridge is derived. This follows from knowing the density of the
frequency in a bridge and exact simulation from the Wright-Fisher diffusion.
The genealogy of the neutral Wright-Fisher bridge is also modelled by branching
P\'olya urns, extending a representation in a Wright-Fisher diffusion. This is
a new very interesting representation that relates Wright-Fisher bridges to
classical urn models in a Bayesian setting
The effective strength of selection in random environment
We analyse a family of two-types Wright-Fisher models with selection in a
random environment and skewed offspring distribution. We provide a calculable
criterion to quantify the impact of different shapes of selection on the fate
of the weakest allele, and thus compare them. The main mathematical tool is
duality, which we prove to hold, also in presence of random environment
(quenched and in some cases annealed), between the population's allele
frequencies and genealogy, both in the case of finite population size and in
the scaling limit for large size. Duality also yields new insight on properties
of branching-coalescing processes in random environment, such as their long
term behaviour.Comment: 36 pages; v2 corrects an error in the proof of Thm 3.
Exact simulation of the Wright-Fisher diffusion
The WrightâFisher family of diffusion processes is a widely used class of evolutionary models. However, simulation is difficult because there is no known closed-form formula for its transition function. In this article, we demonstrate that it is in fact possible to simulate exactly from a broad class of WrightâFisher diffusion processes and their bridges. For those diffusions corresponding to reversible, neutral evolution, our key idea is to exploit an eigenfunction expansion of the transition function; this approach even applies to its infinite-dimensional analogue, the FlemingâViot process. We then develop an exact rejection algorithm for processes with more general drift functions, including those modelling natural selection, using ideas from retrospective simulation. Our approach also yields methods for exact simulation of the moment dual of the WrightâFisher diffusion, the ancestral process of an infinite-leaf Kingman coalescent tree. We believe our new perspective on diffusion simulation holds promise for other models admitting a transition eigenfunction expansion
Duality and fixation in Î-Wright-Fisher processes with frequency-dependent selection
A two-types, discrete-time population model with finite, constant size is constructed, allowing for a general form of frequency-dependent selection and skewed offspring distribution. Selection is defined based on the idea that individuals first choose a (random) number of \emph{potential} parents from the previous generation and then, from the selected pool, they inherit the type of the fittest parent. The probability distribution function of the number of potential parents per individual thus parametrises entirely the selection mechanism. Using sampling- and moment-duality, weak convergence is then proved both for the allele frequency process of the selectively weak type and for the population's ancestral process. The scaling limits are, respectively, a two-types -Fleming-Viot jump-diffusion process with frequency-dependent selection, and a branching-coalescing process with general branching and simultaneous multiple collisions. Duality also leads to a characterisation of the probability of extinction of the selectively weak allele, in terms of the ancestral process' ergodic properties
Bayesian non-parametric inference for Î-coalescents : posterior consistency and a parametric method
We investigate Bayesian non-parametric inference of the Î-measure of Î-coalescent processes with recurrent mutation, parametrised by probability measures on the unit interval. We give verifiable criteria on the prior for posterior consistency when observations form a time series, and prove that any non-trivial prior is inconsistent when all observations are contemporaneous. We then show that the likelihood given a data set of size nâN is constant across Î-measures whose leading nâ2 moments agree, and focus on inferring truncated sequences of moments. We provide a large class of functionals which can be extremised using finite computation given a credible region of posterior truncated moment sequences, and a pseudo-marginal MetropolisâHastings algorithm for sampling the posterior. Finally, we compare the efficiency of the exact and noisy pseudo-marginal algorithms with and without delayed acceptance acceleration using a simulation study
Duality and fixation in -Wright--Fisher processes with frequency-dependent selection
A two-types, discrete-time population model with finite, constant size is constructed, allowing for a general form of frequency-dependent selection and skewed offspring distribution. Selection is defined based on the idea that individuals first choose a (random) number of emphpotential parents from the previous generation and then, from the selected pool, they inherit the type of the fittest parent. The probability distribution function of the number of potential parents per individual thus parametrises entirely the selection mechanism. Using duality, weak convergence is then proved both for the allele frequency process of the selectively weak type and for the population's ancestral process. The scaling limits are, respectively, a two-types Î--Fleming-Viot jump-diffusion process with frequency-dependent selection, and a branching-coalescing process with general branching and simultaneous multiple collisions. Duality also leads to a characterisation of the probability of extinction of the selectively weak allele, in terms of the ancestral process' ergodic properties
Bernoulli factories and duality in Wright-Fisher and Allen-Cahn models of population genetics
Mathematical models of genetic evolution often come in pairs, connected by a
so-called duality relation. The most seminal example are the Wright-Fisher
diffusion and the Kingman coalescent, where the former describes the stochastic
evolution of neutral allele frequencies in a large population forward in time,
and the latter describes the genetic ancestry of randomly sampled individuals
from the population backward in time. As well as providing a richer description
than either model in isolation, duality often yields equations satisfied by
unknown quantities of interest. We employ the so-called Bernoulli factory, a
celebrated tool in simulation-based computing, to derive duality relations for
broad classes of genetics models. As concrete examples, we present
Wright-Fisher diffusions with general drift functions, and Allen-Cahn equations
with general, nonlinear forcing terms. The drift and forcing functions can be
interpreted as the action of frequency-dependent selection. To our knowledge,
this work is the first time a connection has been drawn between Bernoulli
factories and duality in models of population genetics.Comment: 10 page
- âŚ