Skip to main content
Article thumbnail
Location of Repository

Using minimum bootstrap support for splits to construct confidence regions for trees

By Edward Susko


Many of the estimated topologies in phylogenetic studies are presented with the bootstrap support for each of the splits in the topology indicated. If phylogenetic estimation is unbiased, high bootstrap support for a split suggests that there is a good deal of certainty that the split actually is present in the tree and low bootstrap support suggests that one or more of the taxa on one side of the estimated split might in reality be located with taxa on the other side. In the latter case the follow-up questions about how many and which of the taxa could reasonably be incorrectly placed as well as where they might alternatively be placed are not addressed through the presented bootstrap support. We present here an algorithm that finds the set of all trees with minimum bootstrap support for their splits greater than some given value. The output is a ranked list of trees, ranked according to the minimum bootstrap supports for splits in the trees. The number of such trees and their topologies provides useful supplementary information in bootstrap analyses about the reasons for low bootstrap support for splits. We also present ways of quantifying low bootstrap support by considering the set of all topologies with minimum bootstrap greater than some quantity as providing a confidence region of topologies. Using a double bootstrap we are able to choose a cutoff so that the set of topologies with minimum bootstrap support for a split greater than that cutoff gives an approximate 95% confidence region. As with bootstrap support one advantage of the methods is that they are generally applicable to the wide variety of phylogenetic estimation methods

Topics: Original Research
Publisher: Libertas Academica
OAI identifier:
Provided by: PubMed Central

Suggested articles


  1. (2003). A classifi cation of consensus methods for phylogenies.
  2. (1979). A model of evolutionary change in proteins.
  3. (1993). A model search technique based on confi dence set and map of models.
  4. (1993). A randomization test of the null hypothesis that two cladograms are samples estimates of a parametric phylogenetic tree.
  5. (1998). An application of multiple comparisons techniques to model selection.
  6. (2002). An approximately unbiased test of phylogenetic tree selection.
  7. (1993). An empirical test of bootstrapping as a method for assessing confidence in phylogenetic analysis.
  8. (1993). An Introduction to the Bootstrap. Chapman and Hall, London and
  9. (1968). Atlas of protein sequence and structure 1967–1968. National Biomedical Research Foundation,
  10. (1996). Bootstrap confi dence levels for phylogenetic trees.
  11. (1993). Calibrating the bootstrap test of monophyly.
  12. (1997). Can three incongruence tests predict when data should be combined?
  13. (1998). Combinability of phylogenies and bootstrap confi dence envelopes.
  14. (1985). Confi dence limits on phylogenies: An approach using the bootstrap.
  15. (1989). Confi dence limits on phylogenies: The bootstrap revisited.
  16. (2003). Confi dence regions and hypothesis tests for topologies using generalized least squares.
  17. (1995). Estimation and confi dence in phylogeny: The complete-and-partial bootstrap technique.
  18. (2003). Estimation of ratesacross-sites distributions in phylogenetic substitution models.
  19. (1989). Evaluation of the maximum-likelihood estimate of the evolutionary tree topologies from DNA sequence data, and the branching order in Hominoidea.
  20. (2000). Improved bootstrap confi -dence limits in large-scale phylogenies, with an example from neoastragalus (Leguminosae).
  21. (2002). Increased taxon sampling greatly reduces phylogenetic error.
  22. (1993). Is there something wrong with the bootstrap on phylogenies? A reply to Hillis and Bull.
  23. (1996). Large deviations and dispersion effects.
  24. (2000). Likelihood-based tests of topologies in phylogenetics.
  25. (1997). Mass survival of birds across the cretaceoustertiary boundary: molecular evidence.
  26. (2001). Maximum likelihood phylogenetic analysis under a covarion-like model.
  27. (1996). Model of amino acid substitution in proteins encoded by mitochondrial DNA.
  28. (1999). Multiple comparisons of log likelihoods with applications to phylogenetic inference.
  29. (1997). PAML: A program package for phylogenetic analysis by maximum likelihood.
  30. (1993). PHYLIP (Phylogeny Inference Package) version 3.6a2. Distributed by the author.
  31. (1996). Phylogenetic inference.
  32. (1997). Phylogeny of lichen- and non-lichen-forming omphalinoid mushrooms and the utility of testing for combinability among multiple data sets.
  33. (1996). Quartet puzzling: A quartet maximum likelihood method for reconstructing tree topologies.
  34. (1992). Statistical properties of bootstrap estimation of phylogenetic variability from nucleotide sequences. I. Four taxa with a molecular clock.
  35. (2002). Testing a covariotide model of DNA substitution.
  36. (1998). The problem of regions.
  37. (1971). The recovery of trees from measures of dissimilarity.

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.