247 research outputs found

    Sterile versus prolific individuals pertaining to linear-fractional Bienaym{\'e}-Galton-Watson trees

    Full text link
    In a Bienaym\'{e}-Galton-Watson process for which there is a positiveprobability for individuals of having no offspring, there is a subtlebalance and dependence between the sterile nodes (the dead nodes or leaves)and the prolific ones (the productive nodes) both at and up to the currentgeneration. We explore the many facets of this problem, especially in thecontext of an exactly solvable linear-fractional branching mechanism at allgeneration. Eased asymptotic issues are investigated. Relation of thisspecial branching process to skip-free to the left and simple random walks'excursions is then investigated. Mutual statistical information on theirshapes can be learnt from this association

    Employing phylogenetic tree shape statistics to resolve the underlying host population structure

    Get PDF
    BACKGROUND: Host population structure is a key determinant of pathogen and infectious disease transmission patterns. Pathogen phylogenetic trees are useful tools to reveal the population structure underlying an epidemic. Determining whether a population is structured or not is useful in informing the type of phylogenetic methods to be used in a given study. We employ tree statistics derived from phylogenetic trees and machine learning classification techniques to reveal an underlying population structure. RESULTS: In this paper, we simulate phylogenetic trees from both structured and non-structured host populations. We compute eight statistics for the simulated trees, which are: the number of cherries; Sackin, Colless and total cophenetic indices; ladder length; maximum depth; maximum width, and width-to-depth ratio. Based on the estimated tree statistics, we classify the simulated trees as from either a non-structured or a structured population using the decision tree (DT), K-nearest neighbor (KNN) and support vector machine (SVM). We incorporate the basic reproductive number ([Formula: see text]) in our tree simulation procedure. Sensitivity analysis is done to investigate whether the classifiers are robust to different choice of model parameters and to size of trees. Cross-validated results for area under the curve (AUC) for receiver operating characteristic (ROC) curves yield mean values of over 0.9 for most of the classification models. CONCLUSIONS: Our classification procedure distinguishes well between trees from structured and non-structured populations using the classifiers, the two-sample Kolmogorov-Smirnov, Cucconi and Podgor-Gastwirth tests and the box plots. SVM models were more robust to changes in model parameters and tree size compared to KNN and DT classifiers. Our classification procedure was applied to real -world data and the structured population was revealed with high accuracy of [Formula: see text] using SVM-polynomial classifier

    Coagulation and Fragmentation Models

    Get PDF
    Analysis of coagulation and fragmentation is crucial to understanding many processes of scientific and industrial importance. In recent years this has led to intensified research activities in the areas of differential equations, probability theory, and combinatorics. The purpose of the workshop was to bring together people from these different areas working on various aspects of coagulation and fragmentation. We believe that the insights resulting from the interactions which have been stimulated that week should lead to further advances both in the development of mathematical techniques and in new applications

    On joint subtree distributions under two evolutionary models

    Get PDF
    In population and evolutionary biology, hypotheses about micro-evolutionary and macroevolutionary processes are commonly tested by comparing the shape indices of empirical evolutionary trees with those predicted by neutral models. A key ingredient in this approach is the ability to compute and quantify distributions of various tree shape indices under random models of interest. As a step to meet this challenge, in this paper we investigate the joint distribution of cherries and pitchforks (that is, subtrees with two and three leaves) under two widely used null models: the Yule-Harding-Kingman (YHK) model and the proportional to distinguishable arrangements (PDA) model. Based on two novel recursive formulae, we propose a dynamic approach to numerically compute the exact joint distribution (and hence the marginal distributions) for trees of any size. We also obtained insights into the statistical properties of trees generated under these two models, including a constant correlation between the cherry and the pitchfork distributions under the YHK model, and the log-concavity and unimodality of the cherry distributions under both models. In addition, we show that there exists a unique change point for the cherry distributions between these two models

    Field theories for stochastic processes

    Get PDF
    This thesis is a collection of collaborative research work which uses field-theoretic techniques to approach three different areas of stochastic dynamics: Branching Processes, First-passage times of processes with are subject to both white and coloured noise, and numerical and analytical aspects of first-passage times in fractional Brownian Motion. Chapter 1 (joint work with Rosalba Garcia Millan, Johannes Pausch, and Gunnar Pruessner, appeared in Phys. Rev. E 98 (6):062107) contains an analysis of non-spatial branching processes with arbitrary offspring distribution. Here our focus lies on the statistics of the number of particles in the system at any given time. We calculate a host of observables using Doi-Peliti field theory and find that close to criticality these observables no longer depend on the details of the offspring distribution, and are thus universal. In Chapter 2 (joint work with Ignacio Bordeu, Saoirse Amarteifio, Rosalba Garcia Millan, Nanxin Wei, and Gunnar Pruessner, appeared in Sci. Rep. 9:15590) we study the number of sites visited by a branching random walk on general graphs. To do so, we introduce a fieldtheoretic tracing mechanism which keeps track of all already visited sites. We find the scaling laws of the moments of the distribution near the critical point. Chapter 3 (joint work with Gunnar Pruessner and Guillaume Salbreux, submitted, arXiv: 2006.00116) provides an analysis of the first-passage time problem for stochastic processes subject to white and coloured noise. By way of a perturbation theory, I give a systematic and controlled expansion of the moment generating function of first-passage times. In Chapter 4, we revise the tracing mechanism found earlier and use it to characterise three different extreme values, first-passage times, running maxima, and mean volume explored. By formulating these in field-theoretic language, we are able to derive new results for a class of non-Markovian stochastic processes. Chapter 5 and 6 are concerned with the first-passage time distribution of fractional Brownian Motion. Chapter 5 (joint work with Kay Wiese, appeared in Phys. Rev. E 101 (4):043312) introduces a new algorithm to sample them efficiently. Chapter 6 (joint work with Maxence Arutkin and Kay Wiese, submitted, arXiv:1908.10801) gives a field-theoretically obtained perturbative result of the first-passage time distribution in the presence of linear and non-linear drift.Open Acces

    Probabilités et biologie

    Get PDF
    Cette session présente divers aspects de la modélisation probabiliste des populations, en mettant l’accent sur le branchement : processus de branchement à taille de population aléatoire en milieux aléatoires; coexistence de la diversité dans des processus de branchement à taille de population constante; étude des longueurs de branches externes dans les arbres généalogiques de population à taille constante; modélisation de la taille d’une tumeur (population à taille aléatoire) traitée par radiothérapie

    On asymptotic joint distributions of cherries and pitchforks for random phylogenetic trees

    Get PDF
    Tree shape statistics provide valuable quantitative insights into evolutionary mechanisms underpinning phylogenetic trees, a commonly used graph representation of evolutionary relationships among taxonomic units ranging from viruses to species. We study two subtree counting statistics, the number of cherries and the number of pitchforks, for random phylogenetic trees generated by two widely used null tree models: the proportional to distinguishable arrangements (PDA) and the Yule-Harding-Kingman (YHK) models. By developing limit theorems for a version of extended Pólya urn models in which negative entries are permitted for their replacement matrices, we deduce the strong laws of large numbers and the central limit theorems for the joint distributions of these two counting statistics for the PDA and the YHK models. Our results indicate that the limiting behaviour of these two statistics, when appropriately scaled using the number of leaves in the underlying trees, is independent of the initial tree used in the tree generating process
    corecore