182 research outputs found

    The allelic partition for coalescent point processes

    Get PDF
    Assume that individuals alive at time tt in some population can be ranked in such a way that the coalescence times between consecutive individuals are i.i.d. The ranked sequence of these branches is called a coalescent point process. We have shown in a previous work that splitting trees are important instances of such populations. Here, individuals are given DNA sequences, and for a sample of nn DNA sequences belonging to distinct individuals, we consider the number SnS_n of polymorphic sites (sites at which at least two sequences differ), and the number AnA_n of distinct haplotypes (sequences differing at one site at least). It is standard to assume that mutations arrive at constant rate (on germ lines), and never hit the same site on the DNA sequence. We study the mutation pattern associated to coalescent point processes under this assumption. Here, SnS_n and AnA_n grow linearly as nn grows, with explicit rate. However, when the branch lengths have infinite expectation, SnS_n grows more rapidly, e.g. as nln(n)n \ln(n) for critical birth--death processes. Then, we study the frequency spectrum of the sample, that is, the numbers of polymorphic sites/haplotypes carried by kk individuals in the sample. These numbers are shown to grow also linearly with sample size, and we provide simple explicit formulae for mutation frequencies and haplotype frequencies. For critical birth--death processes, mutation frequencies are given by the harmonic series and haplotype frequencies by Fisher logarithmic series

    Random ultrametric trees and applications

    Full text link
    Ultrametric trees are trees whose leaves lie at the same distance from the root. They are used to model the genealogy of a population of particles co-existing at the same point in time. We show how the boundary of an ultrametric tree, like any compact ultrametric space, can be represented in a simple way via the so-called comb metric. We display a variety of examples of random combs and explain how they can be used in applications. In particular, we review some old and recent results regarding the genetic structure of the population when throwing neutral mutations on the skeleton of the tree.Comment: 20 pages, 7 figures, proceedings of MAS 2016, Grenoble, France (Stochastic modeling and Statistics Conference, French Society for Applied and Industrial Math, SMAI

    Species abundance distributions in neutral models with immigration or mutation and general lifetimes

    Full text link
    We consider a general, neutral, dynamical model of biodiversity. Individuals have i.i.d. lifetime durations, which are not necessarily exponentially distributed, and each individual gives birth independently at constant rate \lambda. We assume that types are clonally inherited. We consider two classes of speciation models in this setting. In the immigration model, new individuals of an entirely new species singly enter the population at constant rate \mu (e.g., from the mainland into the island). In the mutation model, each individual independently experiences point mutations in its germ line, at constant rate \theta. We are interested in the species abundance distribution, i.e., in the numbers, denoted I_n(k) in the immigration model and A_n(k) in the mutation model, of species represented by k individuals, k=1,2,...,n, when there are n individuals in the total population. In the immigration model, we prove that the numbers (I_t(k);k\ge 1) of species represented by k individuals at time t, are independent Poisson variables with parameters as in Fisher's log-series. When conditioning on the total size of the population to equal n, this results in species abundance distributions given by Ewens' sampling formula. In particular, I_n(k) converges as n\to\infty to a Poisson r.v. with mean \gamma /k, where \gamma:=\mu/\lambda. In the mutation model, as n\to\infty, we obtain the almost sure convergence of n^{-1}A_n(k) to a nonrandom explicit constant. In the case of a critical, linear birth--death process, this constant is given by Fisher's log-series, namely n^{-1}A_n(k) converges to \alpha^{k}/k, where \alpha :=\lambda/(\lambda+\theta). In both models, the abundances of the most abundant species are briefly discussed.Comment: 16 pages, 4 figures. To appear in Journal of Mathematical Biology. The final publication is available at http://www.springerlink.co

    The branching process with logistic growth

    Full text link
    In order to model random density-dependence in population dynamics, we construct the random analogue of the well-known logistic process in the branching process' framework. This density-dependence corresponds to intraspecific competition pressure, which is ubiquitous in ecology, and translates mathematically into a quadratic death rate. The logistic branching process, or LB-process, can thus be seen as (the mass of) a fragmentation process (corresponding to the branching mechanism) combined with constant coagulation rate (the death rate is proportional to the number of possible coalescing pairs). In the continuous state-space setting, the LB-process is a time-changed (in Lamperti's fashion) Ornstein-Uhlenbeck type process. We obtain similar results for both constructions: when natural deaths do not occur, the LB-process converges to a specified distribution; otherwise, it goes extinct a.s. In the latter case, we provide the expectation and the Laplace transform of the absorption time, as a functional of the solution of a Riccati differential equation. We also show that the quadratic regulatory term allows the LB-process to start at infinity, despite the fact that births occur infinitely often as the initial state goes to \infty. This result can be viewed as an extension of the pure-death process starting from infinity associated to Kingman's coalescent, when some independent fragmentation is added.Comment: Published at http://dx.doi.org/10.1214/105051605000000098 in the Annals of Applied Probability (http://www.imstat.org/aap/) by the Institute of Mathematical Statistics (http://www.imstat.org

    Splitting trees with neutral Poissonian mutations I: Small families

    Full text link
    We consider a neutral dynamical model of biological diversity, where individuals live and reproduce independently. They have i.i.d. lifetime durations (which are not necessarily exponentially distributed) and give birth (singly) at constant rate b. Such a genealogical tree is usually called a splitting tree, and the population counting process (N_t;t\ge 0) is a homogeneous, binary Crump--Mode--Jagers process. We assume that individuals independently experience mutations at constant rate \theta during their lifetimes, under the infinite-alleles assumption: each mutation instantaneously confers a brand new type, called allele, to its carrier. We are interested in the allele frequency spectrum at time t, i.e., the number A(t) of distinct alleles represented in the population at time t, and more specifically, the numbers A(k,t) of alleles represented by k individuals at time t, k=1,2,...,N_t. We mainly use two classes of tools: coalescent point processes and branching processes counted by random characteristics. We provide explicit formulae for the expectation of A(k,t) in a coalescent point process conditional on population size, which apply to the special case of splitting trees. We separately derive the a.s. limits of A(k,t)/N_t and of A(t)/N_t thanks to random characteristics. Last, we separately compute the expected homozygosity by applying a method characterizing the dynamics of the tree distribution as the origination time of the tree moves back in time, in the spirit of backward Kolmogorov equations.Comment: 32 pages, 2 figures. Companion paper in preparation "Splitting trees with neutral Poissonian mutations II: Large or old families

    Evolution of discrete populations and the canonical diffusion of adaptive dynamics

    Get PDF
    The biological theory of adaptive dynamics proposes a description of the long-term evolution of a structured asexual population. It is based on the assumptions of large population, rare mutations and small mutation steps, that lead to a deterministic ODE describing the evolution of the dominant type, called the ``canonical equation of adaptive dynamics.'' Here, in order to include the effect of stochasticity (genetic drift), we consider self-regulated randomly fluctuating populations subject to mutation, so that the number of coexisting types may fluctuate. We apply a limit of rare mutations to these populations, while keeping the population size finite. This leads to a jump process, the so-called ``trait substitution sequence,'' where evolution proceeds by successive invasions and fixations of mutant types. Then we apply a limit of small mutation steps (weak selection) to this jump process, that leads to a diffusion process that we call the ``canonical diffusion of adaptive dynamics,'' in which genetic drift is combined with directional selection driven by the gradient of the fixation probability, also interpreted as an invasion fitness. Finally, we study in detail the particular case of multitype logistic branching populations and seek explicit formulae for the invasion fitness of a mutant deviating slightly from the resident type. In particular, second-order terms of the fixation probability are products of functions of the initial mutant frequency, times functions of the initial total population size, called the invasibility coefficients of the resident by increased fertility, defence, aggressiveness, isolation or survival.Comment: Published at http://dx.doi.org/10.1214/105051606000000628 in the Annals of Applied Probability (http://www.imstat.org/aap/) by the Institute of Mathematical Statistics (http://www.imstat.org

    Macro-evolutionary models and coalescent point processes: The shape and probability of reconstructed phylogenies

    Full text link
    Forward-time models of diversification (i.e., speciation and extinction) produce phylogenetic trees that grow "vertically" as time goes by. Pruning the extinct lineages out of such trees leads to natural models for reconstructed trees (i.e., phylogenies of extant species). Alternatively, reconstructed trees can be modelled by coalescent point processes (CPP), where trees grow "horizontally" by the sequential addition of vertical edges. Each new edge starts at some random speciation time and ends at the present time; speciation times are drawn from the same distribution independently. CPP lead to extremely fast computation of tree likelihoods and simulation of reconstructed trees. Their topology always follows the uniform distribution on ranked tree shapes (URT). We characterize which forward-time models lead to URT reconstructed trees and among these, which lead to CPP reconstructed trees. We show that for any "asymmetric" diversification model in which speciation rates only depend on time and extinction rates only depend on time and on a non-heritable trait (e.g., age), the reconstructed tree is CPP, even if extant species are incompletely sampled. If rates additionally depend on the number of species, the reconstructed tree is (only) URT (but not CPP). We characterize the common distribution of speciation times in the CPP description, and discuss incomplete species sampling as well as three special model cases in detail: 1) extinction rate does not depend on a trait; 2) rates do not depend on time; 3) mass extinctions may happen additionally at certain points in the past
    corecore