Search CORE

14 research outputs found

On Computing the Maximum Parsimony Score of a Phylogenetic Network

Author: Fischer Mareike
Kelk Steven
Scornavacca Celine
van Iersel Leo
Publication venue
Publication date: 01/01/2013
Field of study

Phylogenetic networks are used to display the relationship of different species whose evolution is not treelike, which is the case, for instance, in the presence of hybridization events or horizontal gene transfers. Tree inference methods such as Maximum Parsimony need to be modified in order to be applicable to networks. In this paper, we discuss two different definitions of Maximum Parsimony on networks, "hardwired" and "softwired", and examine the complexity of computing them given a network topology and a character. By exploiting a link with the problem Multicut, we show that computing the hardwired parsimony score for 2-state characters is polynomial-time solvable, while for characters with more states this problem becomes NP-hard but is still approximable and fixed parameter tractable in the parsimony score. On the other hand we show that, for the softwired definition, obtaining even weak approximation guarantees is already difficult for binary characters and restricted network topologies, and fixed-parameter tractable algorithms in the parsimony score are unlikely. On the positive side we show that computing the softwired parsimony score is fixed-parameter tractable in the level of the network, a natural parameter describing how tangled reticulate activity is in the network. Finally, we show that both the hardwired and softwired parsimony score can be computed efficiently using Integer Linear Programming. The software has been made freely available

arXiv.org e-Print Archive

CiteSeerX

TU Delft Repository

Phylogenetic Networks Do not Need to Be Complex: Using Fewer Reticulations to Represent Conflicting Clusters

Author: Bordewich
Bordewich
Choy
Daniel Huson
Gambette
Grass Phylogeny Working Group
Gusfield
Gusfield
Huson
Huson
Huson
Huson
Huynh
Jansson
Jansson
Leo van Iersel
Linz
Nakleh
Regula Rupp
Schmidt
Semple
Steven Kelk
To
van Iersel
van Iersel
van Iersel
Publication venue
Publication date: 16/10/2009
Field of study

Phylogenetic trees are widely used to display estimates of how groups of species evolved. Each phylogenetic tree can be seen as a collection of clusters, subgroups of the species that evolved from a common ancestor. When phylogenetic trees are obtained for several data sets (e.g. for different genes), then their clusters are often contradicting. Consequently, the set of all clusters of such a data set cannot be combined into a single phylogenetic tree. Phylogenetic networks are a generalization of phylogenetic trees that can be used to display more complex evolutionary histories, including reticulate events such as hybridizations, recombinations and horizontal gene transfers. Here we present the new CASS algorithm that can combine any set of clusters into a phylogenetic network. We show that the networks constructed by CASS are usually simpler than networks constructed by other available methods. Moreover, we show that CASS is guaranteed to produce a network with at most two reticulations per biconnected component, whenever such a network exists. We have implemented CASS and integrated it in the freely available Dendroscope software

arXiv.org e-Print Archive

Repository TU/e

A quadratic kernel for computing the hybridization number of multiple trees

Author: Iersel
Leo Van
Simone Linz
Publication venue
Publication date: 01/03/2012
Field of study

It has recently been shown that the NP-hard problem of calculating the minimum number of hybridization events that is needed to explain a set of rooted binary phylogenetic trees by means of a hybridization network is fixed-parameter tractable if an instance of the problem consists of precisely two such trees. In this paper, we show that this problem remains fixed-parameter tractable for an arbitrarily large set of rooted binary phylogenetic trees. In particular, we present a quadratic kernel

arXiv.org e-Print Archive

CiteSeerX

Repository TU/e

A Survey of Combinatorial Methods for Phylogenetic Networks

Author: Bandelt
Bandelt
Bandelt
Bandelt
Bandelt
Baroni
Bokhari
Bordewich
Bordewich
Bryant
Buneman
Casado
Castrucci
Celine Scornavacca
Choy
Clement
Daniel H. Huson
Delwiche
Dessimoz
Disotell
Doolittle
Doyon
Dress
Ebersberger
Excoffier
Grass Phylogeny Working Group
Griffiths
Griffiths
Grünewald
Gusfield
Gusfield
Gusfield
Gusfield
Hein
Hein
Holland
Holland
Huber
Huson
Huson
Huson
Huson
Huson
Huson
Huson
Huson
Huson
Huson
Jansson
Jin
Jin
Jin
Kanj
Kivisild
Linder
Maddison
Makarenkov
Morrison
Morrison
O'Donnell
Parida
Planet
Rieseberg
Rokas
Sneath
Song
Song
Song
Syvanen
Templeton
To
Tofigh
van Iersel
Wagele
Wang
Whidden
Whitfield
Publication venue: Oxford University Press
Publication date: 01/01/2011
Field of study

The evolutionary history of a set of species is usually described by a rooted phylogenetic tree. Although it is generally undisputed that bifurcating speciation events and descent with modifications are major forces of evolution, there is a growing belief that reticulate events also have a role to play. Phylogenetic networks provide an alternative to phylogenetic trees and may be more suitable for data sets where evolution involves significant amounts of reticulate events, such as hybridization, horizontal gene transfer, or recombination. In this article, we give an introduction to the topic of phylogenetic networks, very briefly describing the fundamental concepts and summarizing some of the most important combinatorial methods that are available for their computation

A tight kernel for computing the tree bisection and reconnection distance between two phylogenetic trees

Author: Kelk Steven
Linz Simone
Publication venue
Publication date: 01/01/2019
Field of study

In 2001 Allen and Steel showed that, if subtree and chain reduction rules have been applied to two unrooted phylogenetic trees, the reduced trees will have at most 28k taxa where k is the TBR (Tree Bisection and Reconnection) distance between the two trees. Here we reanalyse Allen and Steel's kernelization algorithm and prove that the reduced instances will in fact have at most 15k-9 taxa. Moreover we show, by describing a family of instances which have exactly 15k-9 taxa after reduction, that this new bound is tight. These instances also have no common clusters, showing that a third commonly-encountered reduction rule, the cluster reduction, cannot further reduce the size of the kernel in the worst case. To achieve these results we introduce and use "unrooted generators" which are analogues of rooted structures that have appeared earlier in the phylogenetic networks literature. Using similar argumentation we show that, for the minimum hybridization problem on two rooted trees, 9k-2 is a tight bound (when subtree and chain reduction rules have been applied) and 9k-4 is a tight bound (when, additionally, the cluster reduction has been applied) on the number of taxa, where k is the hybridization number of the two trees.Comment: One figure added, two small typos fixed. This version to appear in SIDMA (SIAM Journal on Discrete Mathematics

arXiv.org e-Print Archive

On unrooted and root-uncertain variants of several well-known phylogenetic network problems

Author: Boes Olivier
Kelk Steven
Stamoulis Georgios
Stougie Leen
van Iersel Leo
Publication venue
Publication date: 01/01/2016
Field of study

The hybridization number problem requires us to embed a set of binary rooted phylogenetic trees into a binary rooted phylogenetic network such that the number of nodes with indegree two is minimized. However, from a biological point of view accurately inferring the root location in a phylogenetic tree is notoriously difficult and poor root placement can artificially inflate the hybridization number. To this end we study a number of relaxed variants of this problem. We start by showing that the fundamental problem of determining whether an \emph{unrooted} phylogenetic network displays (i.e. embeds) an \emph{unrooted} phylogenetic tree, is NP-hard. On the positive side we show that this problem is FPT in reticulation number. In the rooted case the corresponding FPT result is trivial, but here we require more subtle argumentation. Next we show that the hybridization number problem for unrooted networks (when given two unrooted trees) is equivalent to the problem of computing the Tree Bisection and Reconnect (TBR) distance of the two unrooted trees. In the third part of the paper we consider the "root uncertain" variant of hybridization number. Here we are free to choose the root location in each of a set of unrooted input trees such that the hybridization number of the resulting rooted trees is minimized. On the negative side we show that this problem is APX-hard. On the positive side, we show that the problem is FPT in the hybridization number, via kernelization, for any number of input trees.Comment: 28 pages, 8 Figure

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server

Treewidth of display graphs: bounds, brambles and applications

Author: Janssen Remie
Jones Mark
Kelk Steven
Stamoulis Georgios
Wu Taoyang
Publication venue: 'Journal of Graph Algorithms and Applications'
Publication date: 04/09/2018
Field of study

Phylogenetic trees and networks are leaf-labelled graphs used to model evolution. Display graphs are created by identifying common leaf labels in two or more phylogenetic trees or networks. The treewidth of such graphs is bounded as a function of many common dissimilarity measures between phylogenetic trees and this has been leveraged in fixed parameter tractability results. Here we further elucidate the properties of display graphs and their interaction with treewidth. We show that it is NP-hard to recognize display graphs, but that display graphs of bounded treewidth can be recognized in linear time. Next we show that if a phylogenetic network displays (i.e. topologically embeds) a phylogenetic tree, the treewidth of their display graph is bounded by a function of the treewidth of the original network (and also by various other parameters). In fact, using a bramble argument we show that this treewidth bound is sharp up to an additive term of 1. We leverage this bound to give an FPT algorithm, parameterized by treewidth, for determining whether a network displays a tree, which is an intensively-studied problem in the field. We conclude with a discussion on the future use of display graphs and treewidth in phylogenetics

arXiv.org e-Print Archive

TU Delft Repository

A reconstruction problem for a class of phylogenetic networks with lateral gene transfers

Author: Francesc Rosselló
Gabriel Cardona
Joan Carles Pons
Publication venue: Springer Nature
Publication date: 01/01/2015
Field of study

Springer - Publisher Connector