Although taxonomy is often used informally to evaluate the results of
phylogenetic inference and find the root of phylogenetic trees, algorithmic
methods to do so are lacking. In this paper we formalize these procedures and
develop algorithms to solve the relevant problems. In particular, we introduce
a new algorithm that solves a "subcoloring" problem for expressing the
difference between the taxonomy and phylogeny at a given rank. This algorithm
improves upon the current best algorithm in terms of asymptotic complexity for
the parameter regime of interest; we also describe a branch-and-bound algorithm
that saves orders of magnitude in computation on real data sets. We also
develop a formalism and an algorithm for rooting phylogenetic trees according
to a taxonomy. All of these algorithms are implemented in freely-available
software.Comment: Version submitted to Algorithms for Molecular Biology. A number of
  fixes from previous versio

A Stamatakis

Aaron Gallagher

D Dalevi

D McDonald

E Bachoore

Frederick A Matsen

HL Bodlaender

J Hein

M Price

O Ponta

R Tatusov

S Moran

VB Yap

English

arXiv

Springer - Publisher Connector

Reconciling taxonomy and phylogenetic inference: formalism and algorithms for describing discord and inferring taxonomic roots

Crossref

Algorithms for Molecular Biology

Abstract Background Although taxonomy is often used informally to evaluate the results of phylogenetic inference and the root of phylogenetic trees, algorithmic methods to do so are lacking. Results In this paper we formalize these procedures and develop algorithms to solve the relevant problems. In particular, we introduce a new algorithm that solves a "subcoloring" problem to express the difference between a taxonomy and a phylogeny at a given rank. This algorithm improves upon the current best algorithm in terms of asymptotic complexity for the parameter regime of interest; we also describe a branch-and-bound algorithm that saves orders of magnitude in computation on real data sets. We also develop a formalism and an algorithm for rooting phylogenetic trees according to a taxonomy. Conclusions The algorithms in this paper, and the associated freely-available software, will help biologists better use and understand taxonomically labeled phylogenetic trees.</p

Matsen Frederick A

Gallagher Aaron

Directory of Open Access Journals

A: FastTree 2-approximately maximum-likelihood trees for large alignments. PLoS One

Bodlaender H: Convex recoloring of leaf-colored trees.

et al: The COG database: an updated version includes eukaryotes.

Hugenholtz P: An improved Greengenes taxonomy with explicit ranks for ecological and evolutionary analyses of bacteria and archaea.

Hugenholtz P: Automated group assignment in large phylogenetic trees using GRUNT: GRouping, Ungrouping, Naming Tool.

On the complexity of comparing evolutionary trees. Discrete Applied Mathematics

Quadratic kernelization for convex recoloring of trees.

RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics

Rooting a phylogenetic tree with nonreversible substitution models. BMC Evolutionary Biology

Snir S: Convex recolorings of strings and trees: Definitions, hardness results and algorithms.

Snir S: Efficient approximation of convex recolorings.

Speeding up dynamic programming for some NP-hard graph recoloring problems.

Tassa T: Improved bounds on Bell numbers and on moments of sums of random variables.

http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.233.4288

Reconciling taxonomy and phylogenetic inference: formalism and
  algorithms for describing discord and inferring taxonomic roots

Reconciling taxonomy and phylogenetic inference: formalism and algorithms for describing discord and inferring taxonomic roots

Abstract

Similar works

Full text

Available Versions

Springer - Publisher Connector

Crossref

Directory of Open Access Journals