Search CORE

SuperTriplets: a triplet-based supertree approach to phylogenomics

Author: A. Criscuolo
Beck
Bininda-Emonds
Blanga-Kanfi
Dixon
E. J. P. Douzery
Grunewald
Hickey
Janecka
Jeffroy
Ragan
Rambaut
Ranwez
Saitou
Sanderson
V. Ranwez
Publication venue: Oxford University Press
Publication date
Field of study

Motivation: Phylogenetic tree-building methods use molecular data to represent the evolutionary history of genes and taxa. A recurrent problem is to reconcile the various phylogenies built from different genomic sequences into a single one. This task is generally conducted by a two-step approach whereby a binary representation of the initial trees is first inferred and then a maximum parsimony (MP) analysis is performed on it. This binary representation uses a decomposition of all source trees that is usually based on clades, but that can also be based on triplets or quartets. The relative performances of these representations have been discussed but are difficult to assess since both are limited to relatively small datasets

CiteSeerX

An experimental study of Quartets MaxCut and other supertree methods

Author: A Ben-dor
A Dress
A Stamatakis
B Holland
B Rannala
BR Baum
C Randal Linder
CJ Creevey
D Chen
D Chen
D Thain
DL Swofford
H Bolaender
JG Burleigh
K Strimmer
KC Nixon
KS John
LR Foulds
M Bansal
M Shel Swenson
MA Ragan
MS Swenson
ORP Bininda-Emonds
ORP Bininda-Emonds
Rahul Suri
S Snir
S Snir
T Jiang
T Jiang
Tandy Warnow
V Ranwez
V Ranwez
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Supertree methods represent one of the major ways by which the Tree of Life can be estimated, but despite many recent algorithmic innovations, matrix representation with parsimony (MRP) remains the main algorithmic supertree method. Results We evaluated the performance of several supertree methods based upon the Quartets MaxCut (QMC) method of Snir and Rao and showed that two of these methods usually outperform MRP and five other supertree methods that we studied, under many realistic model conditions. However, the QMC-based methods have scalability issues that may limit their utility on large datasets. We also observed that taxon sampling impacted supertree accuracy, with poor results obtained when all of the source trees were only sparsely sampled. Finally, we showed that the popular optimality criterion of minimizing the total topological distance of the supertree to the source trees is only weakly correlated with supertree topological accuracy. Therefore evaluating supertree methods on biological datasets is problematic. Conclusions Our results show that supertree methods that improve upon MRP are possible, and that an effort should be made to produce scalable and robust implementations of the most accurate supertree methods. Also, because topological accuracy depends upon taxon sampling strategies, attempts to construct very large phylogenetic trees using supertree methods should consider the selection of source tree datasets, as well as supertree methods. Finally, since supertree topological error is only weakly correlated with the supertree's topological distance to its source trees, development and testing of supertree methods presents methodological challenges.</p

CiteSeerX

Springer - Publisher Connector

Texas ScholarWorks

PhyDesign: an online application for profiling phylogenetic informativeness

Author: BC Mahon
CL Schoch
Francesc López-Giráldez
I Mayrose
JE Stajich
Jeffrey P Townsend
JP Townsend
JP Townsend
JP Townsend
JP Townsend
MP Cummings
S Klopfstein
S Marthey
SLK Pond
V Ranwez
YI Tekle
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Springer - Publisher Connector

Extensive Copy-Number Variation of Young Genes across Stickleback Populations

Author: A Abyzov
A Alexa
A Conesa
A Hussain
AJ Iafrate
AJ Sharp
AJ Vilella
AR Boyko
AR Quinlan
B Guo
BE Deagle
C Eizaguirre
C Eizaguirre
Christophe Eizaguirre
CL McGrath
CL Peichel
D Bryant
D Juan
D Tautz
DE Cook
DH Huson
DJ Turner
DR Schrider
DR Schrider
DR Zerbino
E Gazave
E Proux
Erich Bornberg-Bauer
FA Kondrashov
FC Jones
Frédéric J. J. Chain
G Gibson
G Orti
GC Conant
GH Perry
GH Perry
GM Cooper
H Kehrer-Sawatzki
H Li
Irene E. Samonte
J Sebat
JA Fawcett
Jianzhi Zhang
JJ Emerson
JK Colbourne
JO Korbel
JO Korbel
K Chen
K Khalturin
K Ye
KJ Lipinski
KJ Livak
KM Teshima
KM Wegner
L Xu
LC Hsing
LR Saraiva
M Hiraiwa
M Long
M Long
M Lynch
M Lynch
M Milinski
M Roesti
MA DePristo
Mahesh Panchal
Manfred Milinski
Martin Kalbe
Monika Stoll
N Ghanem
P Danecek
P Flicek
P Sjödin
PA Hohenlohe
PGD Feulner
PH Sudmant
Philine G. D. Feulner
PM Kim
R Redon
RC Iskow
S Moretti
S Sawyer
SF Altschul
SH Williamson
SM Waszak
SR Browning
T Marques-Bonet
T Rausch
TD Schmittgen
Thorsten B. H. Reusch
Tobias L. Lenz
V Guryev
V Katju
V Katju
V Ranwez
X Huang
Y Hashiguchi
Y Hashiguchi
Y Zheng
YE Zhang
YF Chan
Z Yang
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2014
Field of study

MM received funding from the Max Planck innovation funds for this project. PGDF was supported by a Marie Curie European Reintegration Grant (proposal nr 270891). CE was supported by German Science Foundation grants (DFG, EI 841/4-1 and EI 841/6-1). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript

OceanRep

Queen Mary Research Online

Bern Open Repository and Information System (BORIS)

MPG.PuRe

FigShare

Genome-wide signatures of convergent evolution in echolocating mammals

Author: A Schneider
A Stamatakis
A Stamatakis
A Terrinoni
AF Ryan
AG Clark
AJ Drummond
AM Hancock
BP Lewis
EB Kim
EC Teeling
EC Teeling
Elia Stupka
G Jones
G Jones
G Li
G Parra
G Parra
G Zhang
Georgia Tsagkogeorga
HB Zhao
HB Zhao
J Castresana
James A. Cotton
JF Hughes
JI Fasick
Joe Parker
JP Bielawski
JZ Zhang
K Katoh
K Kriener
K Lindblad-Toh
KG Becker
KTJ Davies
M Kanehisa
M Soskine
M Vater
N Lartillot
OR Bininda-Emonds
Paolo Provero
PR Grant
R Li
R She
RC Edgar
RR Hoy
Stephen J. Rossiter
T Junier
TA Castoe
TSK Prasad
V Ranwez
W Huang da
WJ Murphy
WM Fitch
WS Wong
WWL Au
X Zhou
Y Benjamini
Y Liu
Y Liu
Y Liu
Y-B Sun
Y-Y Shen
Yuan Liu
Z Yang
Z Yang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 04/09/2013
Field of study

Evolution is typically thought to proceed through divergence of genes, proteins, and ultimately phenotypes(1-3). However, similar traits might also evolve convergently in unrelated taxa due to similar selection pressures(4,5). Adaptive phenotypic convergence is widespread in nature, and recent results from a handful of genes have suggested that this phenomenon is powerful enough to also drive recurrent evolution at the sequence level(6-9). Where homoplasious substitutions do occur these have long been considered the result of neutral processes. However, recent studies have demonstrated that adaptive convergent sequence evolution can be detected in vertebrates using statistical methods that model parallel evolution(9,10) although the extent to which sequence convergence between genera occurs across genomes is unknown. Here we analyse genomic sequence data in mammals that have independently evolved echolocation and show for the first time that convergence is not a rare process restricted to a handful of loci but is instead widespread, continuously distributed and commonly driven by natural selection acting on a small number of sites per locus. Systematic analyses of convergent sequence evolution in 805,053 amino acids within 2,326 orthologous coding gene sequences compared across 22 mammals (including four new bat genomes) revealed signatures consistent with convergence in nearly 200 loci. Strong and significant support for convergence among bats and the dolphin was seen in numerous genes linked to hearing or deafness, consistent with an involvement in echolocation. Surprisingly we also found convergence in many genes linked to vision: the convergent signal of many sensory genes was robustly correlated with the strength of natural selection. This first attempt to detect genome-wide convergent sequence evolution across divergent taxa reveals the phenomenon to be much more pervasive than previously recognised

Southampton (e-Prints Soton)

Queen Mary Research Online

Enlighten

Invariant based quartet puzzling

Author: A Rambaut
Brian Hipp
D Maddison
D Robinson
D Vazquez
E Allman
H Schmidt
J Cavender
J Huelsenbeck
J Lake
J Sumner
J Sumner
Joseph P Rusinko
K Strimmer
K Tamura
L Jin
M Casanellas
M Casanellas
M Casanellas
M Donten-Bury
M Steel
N Eriksson
PSRJE Allman
S Coughlan
S Evans
S Snir
S Snir
S Snir
S Sturmfels
V Berry
V Ranwez
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Public Library of Science (PLOS)

Fast and Robust Characterization of Time-Heterogeneous Sequence Evolutionary Processes Using Substitution Mapping

Author: A Hobolth
A Hobolth
B Boussau
Bastien Boussau
D Liberles
David Liberles
DM Robinson
E Proux
Emeric Figuet
Emmanuel J. P. Douzery
GC Nickel
I Mayrose
IN Shindyalov
J De Magalhaes
J Dutheil
J Dutheil
J Dutheil
J Dutheil
J Dutheil
J Felsenstein
J Felsenstein
J Romiguier
Jonathan Romiguier
JS Escobar
Julien Y. Dutheil
K Popadin
K Tamura
KS Pollard
L Duret
L Duret
M Neiman
MW Dimmic
N Galtier
N Galtier
N Lartillot
N Lartillot
N Rodrigue
Nicolas Galtier
P Tataru
R Nielsen
R Nielsen
RW Jobson
S Paland
SLK Pond
T Ohta
T Pupko
TG Barraclough
TH Jukes
V Ranwez
Vincent Ranwez
VN Minin
W Zhai
Z Yang
Z Yang
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

Genes and genomes do not evolve similarly in all branches of the tree of life. Detecting and characterizing the heterogeneity in time, and between lineages, of the nucleotide (or amino acid) substitution process is an important goal of current molecular evolutionary research. This task is typically achieved through the use of non-homogeneous models of sequence evolution, which being highly parametrized and computationally-demanding are not appropriate for large-scale analyses. Here we investigate an alternative methodological option based on probabilistic substitution mapping. The idea is to first reconstruct the substitutional history of each site of an alignment under a homogeneous model of sequence evolution, then to characterize variations in the substitution process across lineages based on substitution counts. Using simulated and published datasets, we demonstrate that probabilistic substitution mapping is robust in that it typically provides accurate reconstruction of sequence ancestry even when the true process is heterogeneous, but a homogeneous model is adopted. Consequently, we show that the new approach is essentially as efficient as and extremely faster than (up to 25 000 times) existing methods, thus paving the way for a systematic survey of substitution process heterogeneity across genes and lineages

INRIA a CCSD electronic archive server

Public Library of Science (PLOS)

MACSE: Multiple Alignment of Coding SEquences Accounting for Frameshifts and Stop Codons

Author: A Löytynoja
B Chevreux
B Morgenstern
C Notredame
CNS Pedersen
D Huchon
D Przybylski
D Sankoff
D Zheng
DG Higgins
E Dermitzakis
Emmanuel J. P. Douzery
F Abascal
F Delsuc
Frédéric Delsuc
H Philippe
H Zhao
J Hein
J Kececioglu
J Kececioglu
J Raes
JD Thompson
K Katoh
KM Wong
L Arvestad
L Salmela
M Dayhoff
M Gouy
M Kircher
M Margulies
M Suyama
MT Gilbert
N Galtier
OR Bininda-Emonds
P Sneath
PJ Farabaugh
R Wernersson
RC Edgar
RC Edgar
RK Bradley
RR Stocsits
RW Meredith
S Henikoff
S Needleman
SF Altschul
SF Altschul
SS Steiger
Sébastien Harispe
T Smith
TA Demere
TJ Hubbard
TJ Wheeler
V Ranwez
Vincent Ranwez
William J. Murphy
X Guan
X Huang
Y Van de Peer
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Until now the most efficient solution to align nucleotide sequences containing open reading frames was to use indirect procedures that align amino acid translation before reporting the inferred gap positions at the codon level. There are two important pitfalls with this approach. Firstly, any premature stop codon impedes using such a strategy. Secondly, each sequence is translated with the same reading frame from beginning to end, so that the presence of a single additional nucleotide leads to both aberrant translation and alignment

Discovery and functional characterisation of a luqin-type neuropeptide signalling system in a deuterostome

Author: C Collin
C Zatylny-Gaudin
CD Keating
CP Tensen
DC Semmens
DC Semmens
DC Semmens
DC Semmens
EA Stemmler
F Hauser
FM Bree de
G Jékely
JA Veenstra
JA Veenstra
K Fujimoto
L Li
L Li
M Conzelmann
M Lin
M Lin
M Shyamala
ML Rowe
MR Elphick
MR Elphick
MR Elphick
ND Pattengale
O Mirabeau
P Bauknecht
PM Lanctot
Q Fu
R Melarange
RC Edgar
RS Aloyz
S Guindon
S Guindon
S Kumar
S Tian
S Tian
T Ida
T Maeda
T Mekata
U Omasits
V Baubet
V Ranwez
X Zhu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

The results presented in this paper have not been published previously in whole or in part. The work reported in this paper was supported by grants from the BBSRC awarded to M.R.E (BB/M001644/1) and J.H.S. (BB/M001032/1). L.A.Y.G is supported by a PhD studentship awarded by the Mexican Council of Science and Technology (CONACyT studentship no. 418612) and Queen Mary University of London. We are grateful to Philipp Bauknecht and Gáspár Jékely (Max Planck Institute for Developmental Biology, Tübingen, Germany) for providing the Gα16 plasmid and the CHO-G5A cells, which were originally generated by Baubet et al. (Proc Natl Acad Sci USA 97:7260–7265). We are also grateful to Phil Edwards for his help with collecting starfish, Paul Fletcher for maintaining our seawater aquarium and Maria Eugenia Guerra for creating the silhouettes of animals used in Figure 7