Search CORE

48,528 research outputs found

Analysis of 1,000+ Type-Strain Genomes Substantially Improves Taxonomic Classification of Alphaproteobacteria.

Author: Gronow Sabine
Göker Markus
Hördt Anton
Kyrpides Nikos C
López Marina García
Meier-Kolthoff Jan P
Schleuning Marcel
Tindall Brian J
Weinhold Lisa-Maria
Woyke Tanja
Publication venue: eScholarship, University of California
Publication date: 01/01/2020
Field of study

The class Alphaproteobacteria is comprised of a diverse assemblage of Gram-negative bacteria that includes organisms of varying morphologies, physiologies and habitat preferences many of which are of clinical and ecological importance. Alphaproteobacteria classification has proved to be difficult, not least when taxonomic decisions rested heavily on a limited number of phenotypic features and interpretation of poorly resolved 16S rRNA gene trees. Despite progress in recent years regarding the classification of bacteria assigned to the class, there remains a need to further clarify taxonomic relationships. Here, draft genome sequences of a collection of genomes of more than 1000 Alphaproteobacteria and outgroup type strains were used to infer phylogenetic trees from genome-scale data using the principles drawn from phylogenetic systematics. The majority of taxa were found to be monophyletic but several orders, families and genera, including taxa recognized as problematic long ago but also quite recent taxa, as well as a few species were shown to be in need of revision. According proposals are made for the recognition of new orders, families and genera, as well as the transfer of a variety of species to other genera and of a variety of genera to other families. In addition, emended descriptions are given for many species mainly involving information on DNA G+C content and (approximate) genome size, both of which are confirmed as valuable taxonomic markers. Similarly, analysis of the gene content was shown to provide valuable taxonomic insights in the class. Significant incongruities between 16S rRNA gene and whole genome trees were not found in the class. The incongruities that became obvious when comparing the results of the present study with existing classifications appeared to be caused mainly by insufficiently resolved 16S rRNA gene trees or incomplete taxon sampling. Another probable cause of misclassifications in the past is the partially low overall fit of phenotypic characters to the sequence-based tree. Even though a significant degree of phylogenetic conservation was detected in all characters investigated, the overall fit to the tree varied considerably

OPUS Augsburg

eScholarship - University of California

Analysis of 81 Genes From 64 Plastid Genomes Resolves Relationships in Angiosperms and Identifies Genome-Scale Evolutionary Patterns

Author: Boore Jeffrey L.
Cai Zhenggiu
Chumley Timothy W.
Daniell Henry
dePamphilis Claude W.
Guisinger-Bellian Mary
Haberie Rosemarie C.
Hansen Anne K.
Jansen Robert K.
Keuhl Jennifer V.
Lee Seung-Bum
Leebens-Mack James
McNeal Joel R.
Muller Kai F.
Peery Rhiannon
Raubeson Linda A.
Publication venue: ScholarlyCommons
Publication date: 01/01/2007
Field of study

Angiosperms are the largest and most successful clade of land plants with \u3e250,000 species distributed in nearly every terrestrial habitat. Many phylogenetic studies have been based on DNA sequences of one to several genes, but, despite decades of intensive efforts, relationships among early diverging lineages and several of the major clades remain either incompletely resolved or weakly supported. We performed phylogenetic analyses of 81 plastid genes in 64 sequenced genomes, including 13 new genomes, to estimate relationships among the major angiosperm clades, and the resulting trees are used to examine the evolution of gene and intron content. Phylogenetic trees from multiple methods, including model-based approaches, provide strong support for the position of Amborella as the earliest diverging lineage of flowering plants, followed by Nymphaeales and Austrobaileyales. The plastid genome trees also provide strong support for a sister relationship between eudicots and monocots, and this group is sister to a clade that includes Chloranthales and magnoliids. Resolution of relationships among the major clades of angiosperms provides the necessary framework for addressing numerous evolutionary questions regarding the rapid diversification of angiosperms. Gene and intron content are highly conserved among the early diverging angiosperms and basal eudicots, but 62 independent gene and intron losses are limited to the more derived monocot and eudicot clades. Moreover, a lineage-specific correlation was detected between rates of nucleotide substitutions, indels, and genomic rearrangements

ScholarWorks at Central Washington University

PubMed Central

ScholarlyCommons@Penn

University of Central Florida (UCF): STARS (Showcase of Text, Archives, Research & Scholarship)

Genome-scale phylogenetic analysis finds extensive gene transfer among Fungi

Author: Boussau Bastien
Daubin Vincent
Davín Adrián Arellano
Szöllősi Gergely J.
Tannier Eric
Publication venue
Publication date: 01/01/2015
Field of study

Although the role of lateral gene transfer is well recognized in the evolution of bacteria, it is generally assumed that it has had less influence among eukaryotes. To explore this hypothesis we compare the dynamics of genome evolution in two groups of organisms: Cyanobacteria and Fungi. Ancestral genomes are inferred in both clades using two types of methods. First, Count, a gene tree unaware method that models gene duplications, gains and losses to explain the observed numbers of genes present in a genome. Second, ALE, a more recent gene tree-aware method that reconciles gene trees with a species tree using a model of gene duplication, loss, and transfer. We compare their merits and their ability to quantify the role of transfers, and assess the impact of taxonomic sampling on their inferences. We present what we believe is compelling evidence that gene transfer plays a significant role in the evolution of Fungi

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server

PubMed Central

HAL

Repository of the Academy's Library

ELTE Digital Institutional Repository (EDIT)

Hal-Diderot

The effect of primer choice and short read sequences on the outcome of 16S rRNA gene based diversity studies

Author: De Vos Paul
Ghyselinck Jonas
Heylen Kim
Pfeiffer Stefan
Sessitsch Angela
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2013
Field of study

Different regions of the bacterial 16S rRNA gene evolve at different evolutionary rates. The scientific outcome of short read sequencing studies therefore alters with the gene region sequenced. We wanted to gain insight in the impact of primer choice on the outcome of short read sequencing efforts. All the unknowns associated with sequencing data, i.e. primer coverage rate, phylogeny, OTU-richness and taxonomic assignment, were therefore implemented in one study for ten well established universal primers (338f/r, 518f/r, 799f/r, 926f/r and 1062f/r) targeting dispersed regions of the bacterial 16S rRNA gene. All analyses were performed on nearly full length and in silico generated short read sequence libraries containing 1175 sequences that were carefully chosen as to present a representative substitute of the SILVA SSU database. The 518f and 799r primers, targeting the V4 region of the 16S rRNA gene, were found to be particularly suited for short read sequencing studies, while the primer 1062r, targeting V6, seemed to be least reliable. Our results will assist scientists in considering whether the best option for their study is to select the most informative primer, or the primer that excludes interferences by host-organelle DNA. The methodology followed can be extrapolated to other primers, allowing their evaluation prior to the experiment

CiteSeerX

Ghent University Academic Bibliography

Directory of Open Access Journals

PubMed Central

FigShare

A MOSAIC of methods: Improving ortholog detection through integration of algorithmic diversity

Author: Hernandez Ryan D.
Maher M. Cyrus
Publication venue
Publication date: 18/04/2014
Field of study

Ortholog detection (OD) is a critical step for comparative genomic analysis of protein-coding sequences. In this paper, we begin with a comprehensive comparison of four popular, methodologically diverse OD methods: MultiParanoid, Blat, Multiz, and OMA. In head-to-head comparisons, these methods are shown to significantly outperform one another 12-30% of the time. This high complementarity motivates the presentation of the first tool for integrating methodologically diverse OD methods. We term this program MOSAIC, or Multiple Orthologous Sequence Analysis and Integration by Cluster optimization. Relative to component and competing methods, we demonstrate that MOSAIC more than quintuples the number of alignments for which all species are present, while simultaneously maintaining or improving functional-, phylogenetic-, and sequence identity-based measures of ortholog quality. Further, we demonstrate that this improvement in alignment quality yields 40-280% more confidently aligned sites. Combined, these factors translate to higher estimated levels of overall conservation, while at the same time allowing for the detection of up to 180% more positively selected sites. MOSAIC is available as python package. MOSAIC alignments, source code, and full documentation are available at http://pythonhosted.org/bio-MOSAIC

arXiv.org e-Print Archive

FigShare

PLAZA 4.0 : an integrative resource for functional, evolutionary and comparative plant genomics

Author: Botzki Alexander
Coppens Frederik
Diels Tim
Kreft Lukasz
Van Bel Michiel
Van de Peer Yves
Vancaester Emmelien
Vandepoele Klaas
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2018
Field of study

PLAZA (https://bioinformatics.psb.ugent.be/plaza) is a plant-oriented online resource for comparative, evolutionary and functional genomics. The PLAZA platform consists of multiple independent instances focusing on different plant clades, while also providing access to a consistent set of reference species. Each PLAZA instance contains structural and functional gene annotations, gene family data and phylogenetic trees and detailed gene colinearity information. A user-friendly web interface makes the necessary tools and visualizations accessible, specific for each data type. Here we present PLAZA 4.0, the latest iteration of the PLAZA framework. This version consists of two new instances (Dicots 4.0 and Monocots 4.0) providing a large increase in newly available species, and offers access to updated and newly implemented tools and visualizations, helping users with the ever-increasing demands for complex and in-depth analyzes. The total number of species across both instances nearly doubles from 37 species in PLAZA 3.0 to 71 species in PLAZA 4.0, with a much broader coverage of crop species (e.g. wheat, palm oil) and species of evolutionary interest (e.g. spruce, Marchantia). The new PLAZA instances can also be accessed by a programming interface through a RESTful web service, thus allowing bioinformaticians to optimally leverage the power of the PLAZA platform

Crossref

Ghent University Academic Bibliography

Archivsystem Ask23

UPSpace at the University of Pretoria