90,180 research outputs found
The shape of random tanglegrams
A tanglegram consists of two binary rooted trees with the same number of
leaves and a perfect matching between the leaves of the trees. We show that the
two halves of a random tanglegram essentially look like two independently
chosen random plane binary trees. This fact is used to derive a number of
results on the shape of random tanglegrams, including theorems on the number of
cherries and generally occurrences of subtrees, the root branches, the number
of automorphisms, and the height. For each of these, we obtain limiting
probabilities or distributions. Finally, we investigate the number of matched
cherries, for which the limiting distribution is identified as well
Tracing evolutionary links between species
The idea that all life on earth traces back to a common beginning dates back
at least to Charles Darwin's {\em Origin of Species}. Ever since, biologists
have tried to piece together parts of this `tree of life' based on what we can
observe today: fossils, and the evolutionary signal that is present in the
genomes and phenotypes of different organisms. Mathematics has played a key
role in helping transform genetic data into phylogenetic (evolutionary) trees
and networks. Here, I will explain some of the central concepts and basic
results in phylogenetics, which benefit from several branches of mathematics,
including combinatorics, probability and algebra.Comment: 18 pages, 6 figures (Invited review paper (draft version) for AMM
A new balance index for phylogenetic trees
Several indices that measure the degree of balance of a rooted phylogenetic
tree have been proposed so far in the literature. In this work we define and
study a new index of this kind, which we call the total cophenetic index: the
sum, over all pairs of different leaves, of the depth of their least common
ancestor. This index makes sense for arbitrary trees, can be computed in linear
time and it has a larger range of values and a greater resolution power than
other indices like Colless' or Sackin's. We compute its maximum and minimum
values for arbitrary and binary trees, as well as exact formulas for its
expected value for binary trees under the Yule and the uniform models of
evolution. As a byproduct of this study, we obtain an exact formula for the
expected value of the Sackin index under the uniform model, a result that seems
to be new in the literature.Comment: 24 pages, 2 figures, preliminary version presented at the JBI 201
Scaling properties of protein family phylogenies
One of the classical questions in evolutionary biology is how evolutionary
processes are coupled at the gene and species level. With this motivation, we
compare the topological properties (mainly the depth scaling, as a
characterization of balance) of a large set of protein phylogenies with a set
of species phylogenies. The comparative analysis shows that both sets of
phylogenies share remarkably similar scaling behavior, suggesting the
universality of branching rules and of the evolutionary processes that drive
biological diversification from gene to species level. In order to explain such
generality, we propose a simple model which allows us to estimate the
proportion of evolvability/robustness needed to approximate the scaling
behavior observed in the phylogenies, highlighting the relevance of the
robustness of a biological system (species or protein) in the scaling
properties of the phylogenetic trees. Thus, the rules that govern the
incapability of a biological system to diversify are equally relevant both at
the gene and at the species level.Comment: Replaced with final published versio
The Shape of Unlabeled Rooted Random Trees
We consider the number of nodes in the levels of unlabelled rooted random
trees and show that the stochastic process given by the properly scaled level
sizes weakly converges to the local time of a standard Brownian excursion.
Furthermore we compute the average and the distribution of the height of such
trees. These results extend existing results for conditioned Galton-Watson
trees and forests to the case of unlabelled rooted trees and show that they
behave in this respect essentially like a conditioned Galton-Watson process.Comment: 34 pages, 1 figur
- …