Search CORE

5,076 research outputs found

SuperTriplets: a triplet-based supertree approach to phylogenomics

Author: A. Criscuolo
Beck
Bininda-Emonds
Blanga-Kanfi
Dixon
E. J. P. Douzery
Grunewald
Hickey
Janecka
Jeffroy
Ragan
Rambaut
Ranwez
Saitou
Sanderson
V. Ranwez
Publication venue: Oxford University Press
Publication date
Field of study

Motivation: Phylogenetic tree-building methods use molecular data to represent the evolutionary history of genes and taxa. A recurrent problem is to reconcile the various phylogenies built from different genomic sequences into a single one. This task is generally conducted by a two-step approach whereby a binary representation of the initial trees is first inferred and then a maximum parsimony (MP) analysis is performed on it. This binary representation uses a decomposition of all source trees that is usually based on clades, but that can also be based on triplets or quartets. The relative performances of these representations have been discussed but are difficult to assess since both are limited to relatively small datasets

CiteSeerX

Crossref

PubMed Central

Elucidating the phylodynamics of endemic rabies virus in eastern Africa using whole-genome sequencing

Author: Biek Roman
Brunker Kirstyn
Cleaveland Sarah
Fooks AR
Hampson Katie
Horton Daniel L
Kazwala Rudovick
Lembo Tiziana
Marston Denise A
Mtema Zacharia J
Ngeleja Chanasa
Sambo Maganga
Sikana Lwitiko
Wilkie Gavin
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2015
Field of study

Many of the pathogens perceived to pose the greatest risk to humans are viral zoonoses, responsible for a range of emerging and endemic infectious diseases. Phylogeography is a useful tool to understand the processes that give rise to spatial patterns and drive dynamics in virus populations. Increasingly, whole-genome information is being used to uncover these patterns, but the limits of phylogenetic resolution that can be achieved with this are unclear. Here, whole-genome variation was used to uncover fine-scale population structure in endemic canine rabies virus circulating in Tanzania. This is the first whole-genome population study of rabies virus and the first comprehensive phylogenetic analysis of rabies virus in East Africa, providing important insights into rabies transmission in an endemic system. In addition, sub-continental scale patterns of population structure were identified using partial gene data and used to determine population structure at larger spatial scales in Africa. While rabies virus has a defined spatial structure at large scales, increasingly frequent levels of admixture were observed at regional and local levels. Discrete phylogeographic analysis revealed long-distance dispersal within Tanzania, which could be attributed to human-mediated movement, and we found evidence of multiple persistent, co-circulating lineages at a very local scale in a single district, despite on-going mass dog vaccination campaigns. This may reflect the wider endemic circulation of these lineages over several decades alongside increased admixture due to human-mediated introductions. These data indicate that successful rabies control in Tanzania could be established at a national level, since most dispersal appears to be restricted within the confines of country borders but some coordination with neighbouring countries may be required to limit transboundary movements. Evidence of complex patterns of rabies circulation within Tanzania necessitates the use of whole-genome sequencing to delineate finer scale population structure that can that can guide interventions, such as the spatial scale and design of dog vaccination campaigns and dog movement controls to achieve and maintain freedom from disease

Sokoine University of Agriculture

PubMed Central

Enlighten

Surrey Research Insight

Sokoine University of Agriculture Institutional Repository

Split-based computation of majority-rule supertrees

Author: A Kupczok
A Kupczok
AG Rodrigo
Anne Kupczok
B Holland
BR Baum
C Semple
C Semple
CA Meacham
CA Phillips
CJ Creevey
CJ Creevey
D Bryant
D Fitzpatrick
D Pisani
D Wu
DF Robinson
DH Huson
DL Swofford
E Bapteste
GU Yule
HA Ross
HT Lin
J Dong
J Dong
J Dong
JA Cotton
JL Thorley
M Kennedy
M Steel
M Wilkinson
M Wilkinson
M Wilkinson
MA Ragan
MJ Sanderson
MJ Sanderson
MS Bansal
MS Waterman
N Galtier
ORP Bininda-Emonds
P Puigbò
PA Goloboff
R Beck
RB Davis
RDM Page
T Margush
WF Doolittle
WJ Baker
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Supertree methods combine overlapping input trees into a larger supertree. Here, I consider split-based supertree methods that first extract the split information of the input trees and subsequently combine this split information into a phylogeny. Well known split-based supertree methods are matrix representation with parsimony and matrix representation with compatibility. Combining input trees on the same taxon set, as in the consensus setting, is a well-studied task and it is thus desirable to generalize consensus methods to supertree methods. Results Here, three variants of majority-rule (MR) supertrees that generalize majority-rule consensus trees are investigated. I provide simple formulas for computing the respective score for bifurcating input- and supertrees. These score computations, together with a heuristic tree search minmizing the scores, were implemented in the python program PluMiST (Plus- and Minus SuperTrees) available from <url>http://www.cibiv.at/software/plumist</url>. The different MR methods were tested by simulation and on real data sets. The search heuristic was successful in combining compatible input trees. When combining incompatible input trees, especially one variant, MR(-) supertrees, performed well. Conclusions The presented framework allows for an efficient score computation of three majority-rule supertree variants and input trees. I combined the score computation with a heuristic search over the supertree space. The implementation was tested by simulation and on real data sets and showed promising results. Especially the MR(-) variant seems to be a reasonable score for supertree reconstruction. Generalizing these computations to multifurcating trees is an open problem, which may be tackled using this framework.</p

CiteSeerX

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

IST Austria: PubRep (Institute of Science and Technology)

Pattern-based phylogenetic distance estimation and tree reconstruction

Author: Höhl Michael
Ragan Mark A.
Rigoutsos Isidore
Publication venue
Publication date: 01/01/2006
Field of study

We have developed an alignment-free method that calculates phylogenetic distances using a maximum likelihood approach for a model of sequence change on patterns that are discovered in unaligned sequences. To evaluate the phylogenetic accuracy of our method, and to conduct a comprehensive comparison of existing alignment-free methods (freely available as Python package decaf+py at http://www.bioinformatics.org.au), we have created a dataset of reference trees covering a wide range of phylogenetic distances. Amino acid sequences were evolved along the trees and input to the tested methods; from their calculated distances we infered trees whose topologies we compared to the reference trees. We find our pattern-based method statistically superior to all other tested alignment-free methods on this dataset. We also demonstrate the general advantage of alignment-free methods over an approach based on automated alignments when sequences violate the assumption of collinearity. Similarly, we compare methods on empirical data from an existing alignment benchmark set that we used to derive reference distances and trees. Our pattern-based approach yields distances that show a linear relationship to reference distances over a substantially longer range than other alignment-free methods. The pattern-based approach outperforms alignment-free methods and its phylogenetic accuracy is statistically indistinguishable from alignment-based distances.Comment: 21 pages, 3 figures, 2 table

arXiv.org e-Print Archive

CiteSeerX

Directory of Open Access Journals

PubMed Central

University of Queensland eSpace