73 research outputs found

    Orthogonal polynomial kernels and canonical correlations for Dirichlet measures

    Full text link
    We consider a multivariate version of the so-called Lancaster problem of characterizing canonical correlation coefficients of symmetric bivariate distributions with identical marginals and orthogonal polynomial expansions. The marginal distributions examined in this paper are the Dirichlet and the Dirichlet multinomial distribution, respectively, on the continuous and the N-discrete d-dimensional simplex. Their infinite-dimensional limit distributions, respectively, the Poisson-Dirichlet distribution and Ewens's sampling formula, are considered as well. We study, in particular, the possibility of mapping canonical correlations on the d-dimensional continuous simplex (i) to canonical correlation sequences on the d+1-dimensional simplex and/or (ii) to canonical correlations on the discrete simplex, and vice versa. Driven by this motivation, the first half of the paper is devoted to providing a full characterization and probabilistic interpretation of n-orthogonal polynomial kernels (i.e., sums of products of orthogonal polynomials of the same degree n) with respect to the mentioned marginal distributions. We establish several identities and some integral representations which are multivariate extensions of important results known for the case d=2 since the 1970s. These results, along with a common interpretation of the mentioned kernels in terms of dependent Polya urns, are shown to be key features leading to several non-trivial solutions to Lancaster's problem, many of which can be extended naturally to the limit as d→∞d\rightarrow\infty.Comment: Published in at http://dx.doi.org/10.3150/11-BEJ403 the Bernoulli (http://isi.cbs.nl/bernoulli/) by the International Statistical Institute/Bernoulli Society (http://isi.cbs.nl/BS/bshome.htm

    Computational inference beyond Kingman's coalescent

    Get PDF
    Full likelihood inference under Kingman's coalescent is a computationally challenging problem to which importance sampling (IS) and the product of approximate conditionals (PAC) method have been applied successfully. Both methods can be expressed in terms of families of intractable conditional sampling distributions (CSDs), and rely on principled approximations for accurate inference. Recently, more general Λ- and Ξ- coalescents have been observed to provide better modelling ts to some genetic data sets. We derive families of approximate CSDs for nite sites Λ- and Ξ-coalescents, and use them to obtain "approximately optimal" IS and PAC algorithms for Λ coalescents, yielding substantial gains in efficiency over existing methods

    Wright-Fisher diffusion bridges

    Get PDF
    {\bf Abstract} The trajectory of the frequency of an allele which begins at xx at time 00 and is known to have frequency zz at time TT can be modelled by the bridge process of the Wright-Fisher diffusion. Bridges when x=z=0x=z=0 are particularly interesting because they model the trajectory of the frequency of an allele which appears at a time, then is lost by random drift or mutation after a time TT. The coalescent genealogy back in time of a population in a neutral Wright-Fisher diffusion process is well understood. In this paper we obtain a new interpretation of the coalescent genealogy of the population in a bridge from a time t∈(0,T)t\in (0,T). In a bridge with allele frequencies of 0 at times 0 and TT the coalescence structure is that the population coalesces in two directions from tt to 00 and tt to TT such that there is just one lineage of the allele under consideration at times 00 and TT. The genealogy in Wright-Fisher diffusion bridges with selection is more complex than in the neutral model, but still with the property of the population branching and coalescing in two directions from time t∈(0,T)t\in (0,T). The density of the frequency of an allele at time tt is expressed in a way that shows coalescence in the two directions. A new algorithm for exact simulation of a neutral Wright-Fisher bridge is derived. This follows from knowing the density of the frequency in a bridge and exact simulation from the Wright-Fisher diffusion. The genealogy of the neutral Wright-Fisher bridge is also modelled by branching P\'olya urns, extending a representation in a Wright-Fisher diffusion. This is a new very interesting representation that relates Wright-Fisher bridges to classical urn models in a Bayesian setting

    The effective strength of selection in random environment

    Full text link
    We analyse a family of two-types Wright-Fisher models with selection in a random environment and skewed offspring distribution. We provide a calculable criterion to quantify the impact of different shapes of selection on the fate of the weakest allele, and thus compare them. The main mathematical tool is duality, which we prove to hold, also in presence of random environment (quenched and in some cases annealed), between the population's allele frequencies and genealogy, both in the case of finite population size and in the scaling limit for large size. Duality also yields new insight on properties of branching-coalescing processes in random environment, such as their long term behaviour.Comment: 36 pages; v2 corrects an error in the proof of Thm 3.

    Exact simulation of the Wright-Fisher diffusion

    Get PDF
    The Wright–Fisher family of diffusion processes is a widely used class of evolutionary models. However, simulation is difficult because there is no known closed-form formula for its transition function. In this article, we demonstrate that it is in fact possible to simulate exactly from a broad class of Wright–Fisher diffusion processes and their bridges. For those diffusions corresponding to reversible, neutral evolution, our key idea is to exploit an eigenfunction expansion of the transition function; this approach even applies to its infinite-dimensional analogue, the Fleming–Viot process. We then develop an exact rejection algorithm for processes with more general drift functions, including those modelling natural selection, using ideas from retrospective simulation. Our approach also yields methods for exact simulation of the moment dual of the Wright–Fisher diffusion, the ancestral process of an infinite-leaf Kingman coalescent tree. We believe our new perspective on diffusion simulation holds promise for other models admitting a transition eigenfunction expansion

    Duality and fixation in Ξ-Wright-Fisher processes with frequency-dependent selection

    Get PDF
    A two-types, discrete-time population model with finite, constant size is constructed, allowing for a general form of frequency-dependent selection and skewed offspring distribution. Selection is defined based on the idea that individuals first choose a (random) number of \emph{potential} parents from the previous generation and then, from the selected pool, they inherit the type of the fittest parent. The probability distribution function of the number of potential parents per individual thus parametrises entirely the selection mechanism. Using sampling- and moment-duality, weak convergence is then proved both for the allele frequency process of the selectively weak type and for the population's ancestral process. The scaling limits are, respectively, a two-types Ξ\Xi-Fleming-Viot jump-diffusion process with frequency-dependent selection, and a branching-coalescing process with general branching and simultaneous multiple collisions. Duality also leads to a characterisation of the probability of extinction of the selectively weak allele, in terms of the ancestral process' ergodic properties

    Bayesian non-parametric inference for Λ-coalescents : posterior consistency and a parametric method

    Get PDF
    We investigate Bayesian non-parametric inference of the Λ-measure of Λ-coalescent processes with recurrent mutation, parametrised by probability measures on the unit interval. We give verifiable criteria on the prior for posterior consistency when observations form a time series, and prove that any non-trivial prior is inconsistent when all observations are contemporaneous. We then show that the likelihood given a data set of size n∈N is constant across Λ-measures whose leading n−2 moments agree, and focus on inferring truncated sequences of moments. We provide a large class of functionals which can be extremised using finite computation given a credible region of posterior truncated moment sequences, and a pseudo-marginal Metropolis–Hastings algorithm for sampling the posterior. Finally, we compare the efficiency of the exact and noisy pseudo-marginal algorithms with and without delayed acceptance acceleration using a simulation study

    Duality and fixation in XiXi-Wright--Fisher processes with frequency-dependent selection

    Get PDF
    A two-types, discrete-time population model with finite, constant size is constructed, allowing for a general form of frequency-dependent selection and skewed offspring distribution. Selection is defined based on the idea that individuals first choose a (random) number of emphpotential parents from the previous generation and then, from the selected pool, they inherit the type of the fittest parent. The probability distribution function of the number of potential parents per individual thus parametrises entirely the selection mechanism. Using duality, weak convergence is then proved both for the allele frequency process of the selectively weak type and for the population's ancestral process. The scaling limits are, respectively, a two-types Ξ--Fleming-Viot jump-diffusion process with frequency-dependent selection, and a branching-coalescing process with general branching and simultaneous multiple collisions. Duality also leads to a characterisation of the probability of extinction of the selectively weak allele, in terms of the ancestral process' ergodic properties

    Bernoulli factories and duality in Wright-Fisher and Allen-Cahn models of population genetics

    Full text link
    Mathematical models of genetic evolution often come in pairs, connected by a so-called duality relation. The most seminal example are the Wright-Fisher diffusion and the Kingman coalescent, where the former describes the stochastic evolution of neutral allele frequencies in a large population forward in time, and the latter describes the genetic ancestry of randomly sampled individuals from the population backward in time. As well as providing a richer description than either model in isolation, duality often yields equations satisfied by unknown quantities of interest. We employ the so-called Bernoulli factory, a celebrated tool in simulation-based computing, to derive duality relations for broad classes of genetics models. As concrete examples, we present Wright-Fisher diffusions with general drift functions, and Allen-Cahn equations with general, nonlinear forcing terms. The drift and forcing functions can be interpreted as the action of frequency-dependent selection. To our knowledge, this work is the first time a connection has been drawn between Bernoulli factories and duality in models of population genetics.Comment: 10 page
    • …
    corecore