Article thumbnail

TreeKO: a duplication-aware algorithm for the comparison of phylogenetic trees

By Marina Marcet-Houben and Toni Gabaldón


Comparisons of tree topologies provide relevant information in evolutionary studies. Most existing methods share the drawback of requiring a complete and exact mapping of terminal nodes between the compared trees. This severely limits the scope of genome-wide analyses, since trees containing duplications are pruned arbitrarily or discarded. To overcome this, we have developed treeKO, an algorithm that enables the comparison of tree topologies, even in the presence of duplication and loss events. To do so treeKO recursively splits gene trees into pruned trees containing only orthologs to subsequently compute a distance based on the combined analyses of all pruned tree comparisons. In addition treeKO, implements the possibility of computing phylome support values, and reconciliation-based measures such as the number of inferred duplication and loss events

Topics: Methods Online
Publisher: Oxford University Press
OAI identifier:
Provided by: PubMed Central

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.

Suggested articles


  1. (2007). A congruence index for testing topological similarity between trees.
  2. (2009). A fungal phylogeny based on 82 complete genomes using the composition vector method.
  3. (2001). A phylogenomic approach to microbial evolution.
  4. (2009). A practical method for exact computation of subtree prune and regraft distance.
  5. (2005). A survey on tree edit distance and related problems.
  6. (2007). Accurate gene-tree reconstruction by learning gene- and species-specific substitution rates across multiple complete genomes.
  7. (2009). Additions, losses, and rearrangements on the evolutionary route from a reconstructed ancestor to the modern Saccharomyces cerevisiae genome.
  8. (2008). An algebraic metric for phylogenetic trees.
  9. (1981). Comparison of phylogenetic trees.
  10. (1985). Comparison of undirected phylogenetic trees based on subtrees of four evolutionary units.
  11. (2008). DupTree: a program for large-scale phylogenetic analyses using gene tree parsimony.
  12. (2010). Efficient genome-scale phylogenetic analysis under the duplication-loss and deep coalescence cost models.
  13. (2007). Estimating species phylogeny from gene-tree probabilities despite incomplete lineage sorting: an example from Melanoplus grasshoppers.
  14. (2010). ETE: a python Environment for Tree Exploration.
  15. (1979). Fitting the gene lineage into its species lineage, a parsimony strategy illustrated by cladograms constructed from globin sequences.
  16. (2000). From gene trees to species trees.
  17. (2009). Gene tree discordance, phylogenetic inference and the multispecies coalescent.
  18. (1998). GeneTree: comparing gene and species phylogenies using reconciled trees.
  19. (2011). Genome-scale phylogenetics: inferring the plant tree of life from 18,896 gene trees.
  20. (1996). Inferring phylogenies from protein sequences by parsimony, distance, and likelihood methods.
  21. (2005). Lineage-specific gene loss following mitochondrial endosymbiosis and its potential for function prediction in eukaryotes.
  22. (2004). On the computational complexity of the rooted subtree prune and regraft distance.
  23. (2000). Phylogenetic analysis using PHYLIP.
  24. (2009). Phylogenetic and functional assessment of orthologs inference projects and methods.
  25. (2006). Phylogenetic identification of lateral genetic transfer events.
  26. (2011). PhylomeDB v3.0: an expanding repository of genome-wide collections of trees, alignments and phylogeny-based orthology and paralogy predictions.
  27. (2009). PhyloPattern: regular expressions to identify complex patterns in phylogenetic trees.
  28. (2008). Reconstruction and analysis of large-scale phylogenetic data, challenges and opportunities.
  29. (2005). String Processing and Information Retrieval,
  30. (2005). Supertree construction in the genomic age.
  31. (2007). The K tree score: quantification of differences in the relative branch length and topology of phylogenetic trees.
  32. (2009). The tree versus the forest: the fungal tree of life and the topological diversity within the yeast phylome.
  33. (2007). TOPD/ FMTS: a new software to compare phylogenetic trees.
  34. (2007). Topological variation in single-gene phylogenetic trees.
  35. (2009). Trees from trees: construction of phylogenetic supertrees using clann. Methods Mol.