Search CORE

156 research outputs found

Recommended from our members

Interactions in the microbiome: communities of organisms and communities of genes

Author: Beiko R.G.
Boon E.
Langille M.G.I.
Meehan Conor J.
Whidden C.
Wong D. H.-J.
Publication venue: 'Wiley'
Publication date: 10/09/2019
Field of study

YesA central challenge in microbial community ecology is the delineation of appropriate units of biodiversity, which can be taxonomic, phylogenetic, or functional in nature. The term ‘community’ is applied ambiguously; in some cases, the term refers simply to a set of observed entities, while in other cases, it requires that these entities interact with one another. Microorganisms can rapidly gain and lose genes, potentially decoupling community roles from taxonomic and phylogenetic groupings. Trait-based approaches offer a useful alternative, but many traits can be defined based on gene functions, metabolic modules, and genomic properties, and the optimal set of traits to choose is often not obvious. An analysis that considers taxon assignment and traits in concert may be ideal, with the strengths of each approach offsetting the weaknesses of the other. Individual genes also merit consideration as entities in an ecological analysis, with characteristics such as diversity, turnover, and interactions modeled using genes rather than organisms as entities. We identify some promising avenues of research that are likely to yield a deeper understanding of microbial communities that shift from observation-based questions of ‘Who is there?’ and ‘What are they doing?’ to the mechanistically driven question of ‘How will they respond?

Bradford Scholars

PhyloPattern: regular expressions to identify complex patterns in phylogenetic trees

Author: A Bateman
A Levasseur
AK Wright
BE Engelhardt
CA Paulding
CM Zmasek
D Barker
D Durand
DH Huson
DHD Warren
J Felsenstein
J McCarthy
J Ruan
JD Thompson
JF Dufayard
JS Farris
Julie D Thompson
L Arvestad
N Krishnamurthy
O Sakarya
P Gouret
Philippe Gouret
Pierre Pontarotti
RG Beiko
T Blomme
T Dobzhansky
TJ Hubbard
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background To effectively apply evolutionary concepts in genome-scale studies, large numbers of phylogenetic trees have to be automatically analysed, at a level approaching human expertise. Complex architectures must be recognized within the trees, so that associated information can be extracted. Results Here, we present a new software library, PhyloPattern, for automating tree manipulations and analysis. PhyloPattern includes three main modules, which address essential tasks in high-throughput phylogenetic tree analysis: node annotation, pattern matching, and tree comparison. PhyloPattern thus allows the programmer to focus on: i) the use of predefined or user defined annotation functions to perform immediate or deferred evaluation of node properties, ii) the search for user-defined patterns in large phylogenetic trees, iii) the pairwise comparison of trees by dynamically generating patterns from one tree and applying them to the other. Conclusion PhyloPattern greatly simplifies and accelerates the work of the computer scientist in the evolutionary biology field. The library has been used to automatically identify phylogenetic evidence for domain shuffling or gene loss events in the evolutionary histories of protein sequences. However any workflow that relies on phylogenetic tree analysis, could be automated with PhyloPattern.</p

Crossref

HAL AMU

Springer - Publisher Connector

Directory of Open Access Journals

HAL-Inserm

PubMed Central

PhyloNet: a software package for analyzing and reconstructing reticulate evolutionary relationships

Author: B Moret
C Buerkle
C Linder
C Semple
C Than
C Than
C Than
Cuong Than
D Bryant
D Huson
D MacLeod
D Penny
D Posada
D Posada
D Robinson
D Ruths
D Ruths
D Ruths
Derek Ruths
DH Huson
DL Swofford
G Jin
H Ochman
H Shimodaira
I Kanj
J Felsenstein
J Felsenstein
J Mower
L Nakhleh
L Nakhleh
L Nakhleh
L Nakhleh
L Rieseberg
Luay Nakhleh
M Baroni
M Bordewich
M Hallett
M Steel
MM Morin
N Ellstrand
R Beiko
U Bergthorsson
U Bergthorsson
U Bergthorsson
V Makarenkov
Publication venue: BioMed Central
Publication date: 01/07/2008
Field of study

Abstract Background Phylogenies, i.e., the evolutionary histories of groups of taxa, play a major role in representing the interrelationships among biological entities. Many software tools for reconstructing and evaluating such phylogenies have been proposed, almost all of which assume the underlying evolutionary history to be a tree. While trees give a satisfactory first-order approximation for many families of organisms, other families exhibit evolutionary mechanisms that cannot be represented by trees. Processes such as horizontal gene transfer (HGT), hybrid speciation, and interspecific recombination, collectively referred to as <it>reticulate evolutionary events</it>, result in <it>networks</it>, rather than trees, of relationships. Various software tools have been recently developed to analyze reticulate evolutionary relationships, which include SplitsTree4, LatTrans, EEEP, HorizStory, and T-REX. Results In this paper, we report on the PhyloNet software package, which is a suite of tools for analyzing reticulate evolutionary relationships, or <it>evolutionary networks</it>, which are rooted, directed, acyclic graphs, leaf-labeled by a set of taxa. These tools can be classified into four categories: (1) evolutionary network representation: reading/writing evolutionary networks in a newly devised compact form; (2) evolutionary network characterization: analyzing evolutionary networks in terms of three basic building blocks – trees, clusters, and tripartitions; (3) evolutionary network comparison: comparing two evolutionary networks in terms of topological dissimilarities, as well as fitness to sequence evolution under a maximum parsimony criterion; and (4) evolutionary network reconstruction: reconstructing an evolutionary network from a species tree and a set of gene trees. Conclusion The software package, PhyloNet, offers an array of utilities to allow for efficient and accurate analysis of evolutionary networks. The software package will help significantly in analyzing large data sets, as well as in studying the performance of evolutionary network reconstruction methods. Further, the software package supports the proposed eNewick format for compact representation of evolutionary networks, a feature that allows for efficient interoperability of evolutionary network software tools. Currently, all utilities in PhyloNet are invoked on the command line.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

A new, fast algorithm for detecting protein coevolution using maximum compatible cliques

Author: A Rodionov
A Valencia
AK Ramani
Alex Rodionov
Alexandr Bezginov
AM Altenhoff
D MacLeod
D Robinson
Elisabeth RM Tillier
ERM Tillier
ERM Tillier
F Pazos
F Pazos
GW Clark
J Felsenstein
J Felsenstein
Jonathan Rose
K Katoh
MK Kuhner
PRJ Östergård
R Jothi
RG Beiko
RM Karp
S Razick
T Sato
V Soria-Carrasco
W Li
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background The MatrixMatchMaker algorithm was recently introduced to detect the similarity between phylogenetic trees and thus the coevolution between proteins. MMM finds the largest common submatrices between pairs of phylogenetic distance matrices, and has numerous advantages over existing methods of coevolution detection. However, these advantages came at the cost of a very long execution time. Results In this paper, we show that the problem of finding the maximum submatrix reduces to a multiple maximum clique subproblem on a graph of protein pairs. This allowed us to develop a new algorithm and program implementation, MMMvII, which achieved more than 600× speedup with comparable accuracy to the original MMM. Conclusions MMMvII will thus allow for more more extensive and intricate analyses of coevolution. Availability An implementation of the MMMvII algorithm is available at: <url>http://www.uhnresearch.ca/labs/tillier/MMMWEBvII/MMMWEBvII.php</url></p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Phylogenetic Detection of Recombination with a Bayesian Prior on the Distance between Trees

Author: A Gelman
AC Siepel
AJ Drummond
BL Allen
C Wiuf
CX Chan
D Bryant
D Husmeier
D Husmeier
D Husmeier
D MacLeod
D Posada
D Posada
DF Robinson
DL Swofford
F Al-Awadhi
F Fang
F Ge
F Ronquist
G Altekar
G Hickey
G McVean
GB Golding
GF Weiller
H Kishino
Hirohisa Kishino
I DiMatteo
J Felsenstein
J Felsenstein
J Hein
J Hein
J Hey
JL Thorne
JP Huelsenbeck
L Nakhleh
L Nakhleh
Leonardo de Oliveira Martins
M Hasegawa
M Sierra
M Steel
MA Suchard
MA Suchard
Mark Isalan
MHS Jotun Hein
MK Kuhner
ML Rajaram
MO Salminen
MT Hallett
NC Grassly
P Awadalla
P Fearnhead
P Lefeuvre
R Nielsen
RC Griffiths
RG Beiko
RG Beiko
RR Hudson
RR Hudson
VN Minin
VN Minin
W Hordijk
YS Song
YS Song
YS Song
Z Yang
Élcio Leal
Publication venue: Public Library of Science
Publication date: 07/06/2008
Field of study

Genomic regions participating in recombination events may support distinct topologies, and phylogenetic analyses should incorporate this heterogeneity. Existing phylogenetic methods for recombination detection are challenged by the enormous number of possible topologies, even for a moderate number of taxa. If, however, the detection analysis is conducted independently between each putative recombinant sequence and a set of reference parentals, potential recombinations between the recombinants are neglected. In this context, a recombination hotspot can be inferred in phylogenetic analyses if we observe several consecutive breakpoints. We developed a distance measure between unrooted topologies that closely resembles the number of recombinations. By introducing a prior distribution on these recombination distances, a Bayesian hierarchical model was devised to detect phylogenetic inconsistencies occurring due to recombinations. This model relaxes the assumption of known parental sequences, still common in HIV analysis, allowing the entire dataset to be analyzed at once. On simulated datasets with up to 16 taxa, our method correctly detected recombination breakpoints and the number of recombination events for each breakpoint. The procedure is robust to rate and transition∶transversion heterogeneities for simulations with and without recombination. This recombination distance is related to recombination hotspots. Applying this procedure to a genomic HIV-1 dataset, we found evidence for hotspots and de novo recombination

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Crossref

Repositório Institucional UNIFESP

Directory of Open Access Journals

PubMed Central

Spiral - Imperial College Digital Repository

Evidence, Content and Corroboration and the Tree of Life

Author: AG Kluge
AG Kluge
AG Kluge
AG Kluge
AG Kluge
AG Kluge
AG Kluge
C Darwin
D Agosti
DP Faith
DP Faith
DP Faith
DP Faith
E Bapteste
E. Kurt Lienau
EK Lienau
EK Lienau
J Felsenstein
J Felsenstein
J Felsenstein
J Felsenstein
J Gatesy
J Gatesy
J Gatesy
JM Carpenter
JS Farris
JS Farris
JS Farris
K Queiroz de
KG Helfenbein
KR Popper
KR Popper
MC Pinna de
ME Siddall
MF Mickevich
MJ Novacek
MP Simmons
O Rieppel
O Rieppel
ORP Bininda-Emonds
RG Beiko
RG Beiko
Rob DeSalle
T Grant
W Fitch
W Hennig
W Wheeler
WC Wheeler
WF Doolittle
Publication venue: Springer Netherlands
Publication date: 01/01/2008
Field of study

We examine three critical aspects of Popper’s formulation of the ‘Logic of Scientific Discovery’—evidence, content and degree of corroboration—and place these concepts in the context of the Tree of Life (ToL) problem with particular reference to molecular systematics. Content, in the sense discussed by Popper, refers to the breadth and scope of existence that a hypothesis purports to explain. Content, in conjunction with the amount of available and relevant evidence, determines the testability, or potential degree of corroboration, of a statement; content distinguishes scientific hypotheses from metaphysical assertions. Degree of corroboration refers to the relative and tentative confidence assigned to one hypothesis over another, based upon the performance of each under critical tests. Here we suggest that systematists attempt to maximize content and evidence to increase the potential degree of corroboration in all phylogenetic endeavors. Discussion of this “total evidence” approach leads to several interesting conclusions about generating ToL hypotheses

Crossref

Springer - Publisher Connector

PubMed Central

Identifying Currents in the Gene Pool for Bacterial Populations Using an Integrative Approach

Author: A Mascioni
B Silverman
C Chan
C Fraser
C Robert
Christophe Fraser
D Alber
D Bryant
D Falush
D Falush
D Hartl
DM Vu
EC Holmes
EJ Feil
EJ Feil
J Corander
J Corander
J Corander
J Corander
J Corander
J Corander
Jing Tang
Jukka Corander
K Tamura
KA Jolley
KA Jolley
KA Jolley
KA Jolley
KY Yeung
M Suchard
MCJ Maiden
P Marttinen
Philip E. Bourne
RG Beiko
RJ Whitaker
RR Hudson
SK Sheppard
VN Minin
William P. Hanage
WP Hanage
X Didelot
Publication venue: Public Library of Science
Publication date: 01/08/2009
Field of study

The evolution of bacterial populations has recently become considerably better understood due to large-scale sequencing of population samples. It has become clear that DNA sequences from a multitude of genes, as well as a broad sample coverage of a target population, are needed to obtain a relatively unbiased view of its genetic structure and the patterns of ancestry connected to the strains. However, the traditional statistical methods for evolutionary inference, such as phylogenetic analysis, are associated with several difficulties under such an extensive sampling scenario, in particular when a considerable amount of recombination is anticipated to have taken place. To meet the needs of large-scale analyses of population structure for bacteria, we introduce here several statistical tools for the detection and representation of recombination between populations. Also, we introduce a model-based description of the shape of a population in sequence space, in terms of its molecular variability and affinity towards other populations. Extensive real data from the genus Neisseria are utilized to demonstrate the potential of an approach where these population genetic tools are combined with an phylogenetic analysis. The statistical tools introduced here are freely available in BAPS 5.2 software, which can be downloaded from http://web.abo.fi/fak/mnf/mate/jc/software/baps.html

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Phylogenomic Analysis of Marine Roseobacters

Author: A Buchan
A Stamatakis
C Dutta
Carl Kingsford
Cathy H. Wu
CH Wu
CJ Creevey
CM Thomas
D Posada
DF Robinso
E Bapteste
E Lerat
E Susko
F Abascal
G Bouxin
G Talavera
GT Taylor
H Ochman
H Shimodaira
H Shimodaira
HA Schmidt
Hongzhan Huang
I Wagner-Dobler
I Wagner-Dobler
J Bergsten
J Castresana
J Felsenstein
JA Eisen
JD Thompson
JP Gogarten
JP Gogarten
JP Huelsenbeck
JR Brown
Kai Tang
KH Tang
L Li
LM Schouls
MA Moran
MS Poptsova
N Galtier
Nianzhi Jiao
NZ Jiao
O Zhaxybayeva
O Zhaxybayeva
R Jain
R Seshadri
RD Page
RG Beiko
RG Beiko
RL Charlebois
RL Tatusov
RS Poretsky
S Guindon
SF Altschul
SJ Sorensen
SM Sowell
T Brinkhoff
T Shi
TR Miller
V Daubin
VM Markowitz
Y Zhang
Y Zhao
ZS Kolber
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Background: Members of the Roseobacter clade which play a key role in the biogeochemical cycles of the ocean are diverse and abundant, comprising 10–25 % of the bacterioplankton in most marine surface waters. The rapid accumulation of whole-genome sequence data for the Roseobacter clade allows us to obtain a clearer picture of its evolution. Methodology/Principal Findings: In this study about 1,200 likely orthologous protein families were identified from 17 Roseobacter bacteria genomes. Functional annotations for these genes are provided by iProClass. Phylogenetic trees were constructed for each gene using maximum likelihood (ML) and neighbor joining (NJ). Putative organismal phylogenetic trees were built with phylogenomic methods. These trees were compared and analyzed using principal coordinates analysis (PCoA), approximately unbiased (AU) and Shimodaira–Hasegawa (SH) tests. A core set of 694 genes with vertical descent signal that are resistant to horizontal gene transfer (HGT) is used to reconstruct a robust organismal phylogeny. In addition, we also discovered the most likely 109 HGT genes. The core set contains genes that encode ribosomal apparatus, ABC transporters and chaperones often found in the environmental metagenomic and metatranscriptomic data. These genes in the core set are spread out uniformly among the various functional classes and biological processes. Conclusions/Significance: Here we report a new multigene-derived phylogenetic tree of the Roseobacter clade. Of particular interest is the HGT of eleven genes involved in vitamin B12 synthesis as well as key enzynmes fo

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Xiamen University Institutional Repository

A new sensitive PCR assay for one-step detection of 12 IDH1/2 mutations in glioma

Author: A Borodovsky
A von Deimling
AC Tsiatis
B Angulo
B Boisselier
C Hartmann
C Horbinski
C Houillier
CM Ida
D Capper
D Capper
D Loussouarn
D Rohle
DN Louis
H Yan
HJ Dubbink
J Beiko
J Meyer
JG Cairncross
JW Taylor
L Thomas
M Bujko
M Preusser
M Preusser
M Weller
M Weller
MJ van den Bent
MJ van den Bent
NCCN Clinical Practice Guidelines in Oncology (NCCN Guidelines)
P Setty
PA Wayne
PA Wayne
R Gupta
R Soffietti
S Turcan
W Wick
W Wick
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Bioinformatics and Structural Characterization of a Hypothetical Protein from Streptococcus mutans: Implication of Antibiotic Resistance

Author: AM Waterhouse
B Nyvad
BJ Paster
BW Matthews
C Vonrhein
CI Kado
D Ajdic
D Wilson
DM Deng
E Hoshino
E Krissinel
Erik Brostromer
G Niu
H Ochman
H Ochman
J Carlsson
J Dundas
J Wen
JD Peterson
JE Mogensen
JG Lawrence
JG Lawrence
Jie Nan
K Cowtan
L Holm
L Holm
L Jaroszewski
M Chevalier
MA Ragan
Ole Kristensen
PD Adams
PV Afonine
RC Edgar
RD Finn
RD Page
RG Beiko
RJ Zawada
SF Altschul
SP Wilkinson
TA Jones
TC Terwilliger
Wenqing Xu
WJ Loesche
WL DeLano
XD Su
Xiang-Yu Liu
Xiao-Dong Su
Z Otwinowski
Publication venue: Public Library of Science
Publication date: 01/01/2009
Field of study

As an oral bacterial pathogen, Streptococcus mutans has been known as the aetiologic agent of human dental caries. Among a total of 1960 identified proteins within the genome of this organism, there are about 500 without any known functions. One of these proteins, SMU.440, has very few homologs in the current protein databases and it does not fall into any protein functional families. Phylogenetic studies showed that SMU.440 is related to a particular ecological niche and conserved specifically in some oral pathogens, due to lateral gene transfer. The co-occurrence of a MarR protein within the same operon among these oral pathogens suggests that SMU.440 may be associated with antibiotic resistance. The structure determination of SMU.440 revealed that it shares the same fold and a similar pocket as polyketide cyclases, which indicated that it is very likely to bind some polyketide-like molecules. From the interlinking structural and bioinformatics studies, we have concluded that SMU.440 could be involved in polyketide-like antibiotic resistance, providing a better understanding of this hypothetical protein. Besides, the combination of multiple methods in this study can be used as a general approach for functional studies of a protein with unknown function

CiteSeerX

Public Library of Science (PLOS)

Lund University Publications

Crossref

Directory of Open Access Journals

PubMed Central

Copenhagen University Research Information System