Search CORE

3,142 research outputs found

The genome of the medieval Black Death agent (extended abstract)

Author: Chauve Cedric
Rajaraman Ashok
Tannier Eric
Publication venue
Publication date: 29/07/2013
Field of study

The genome of a 650 year old Yersinia pestis bacteria, responsible for the medieval Black Death, was recently sequenced and assembled into 2,105 contigs from the main chromosome. According to the point mutation record, the medieval bacteria could be an ancestor of most Yersinia pestis extant species, which opens the way to reconstructing the organization of these contigs using a comparative approach. We show that recent computational paleogenomics methods, aiming at reconstructing the organization of ancestral genomes from the comparison of extant genomes, can be used to correct, order and complete the contig set of the Black Death agent genome, providing a full chromosome sequence, at the nucleotide scale, of this ancient bacteria. This sequence suggests that a burst of mobile elements insertions predated the Black Death, leading to an exceptional genome plasticity and increase in rearrangement rate.Comment: Extended abstract of a talk presented at the conference JOBIM 2013, https://colloque.inra.fr/jobim2013_eng/. Full paper submitte

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server

HAL

Hal-Diderot

The inference of gene trees with species trees

Author: Bastien Boussau
Eric Tannier
Gergely J. Szöllősi
Montbonnot France
Vincent Daubin
Publication venue
Publication date: 04/11/2013
Field of study

Molecular phylogeny has focused mainly on improving models for the reconstruction of gene trees based on sequence alignments. Yet, most phylogeneticists seek to reveal the history of species. Although the histories of genes and species are tightly linked, they are seldom identical, because genes duplicate, are lost or horizontally transferred, and because alleles can co-exist in populations for periods that may span several speciation events. Building models describing the relationship between gene and species trees can thus improve the reconstruction of gene trees when a species tree is known, and vice-versa. Several approaches have been proposed to solve the problem in one direction or the other, but in general neither gene trees nor species trees are known. Only a few studies have attempted to jointly infer gene trees and species trees. In this article we review the various models that have been used to describe the relationship between gene trees and species trees. These models account for gene duplication and loss, transfer or incomplete lineage sorting. Some of them consider several types of events together, but none exists currently that considers the full repertoire of processes that generate gene trees along the species tree. Simulations as well as empirical studies on genomic data show that combining gene tree-species tree models with models of sequence evolution improves gene tree reconstruction. In turn, these better gene trees provide a better basis for studying genome evolution or reconstructing ancestral chromosomes and ancestral gene sequences. We predict that gene tree-species tree methods that can deal with genomic data sets will be instrumental to advancing our understanding of genomic evolution.Comment: Review article in relation to the "Mathematical and Computational Evolutionary Biology" conference, Montpellier, 201

arXiv.org e-Print Archive

CiteSeerX

Crossref

INRIA a CCSD electronic archive server

PubMed Central

HAL

Repository of the Academy's Library

ELTE Digital Institutional Repository (EDIT)

Hal-Diderot

Comparative pan-genome analysis of Piscirickettsia salmonis reveals genomic divergences within genogroups

Author: Cárcamo J.G.
Espinoza-Rojas D.A.
Figueroa J.E.
Mancilla M.
Maracaja-Coutinho V.
Molina C.F.
Nourdin-Galindo G.
Oliver C.
Ruiz P.
Sánchez P.
Vargas-Chacoff L.
Yañez A.J.
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2017
Field of study

Indexación: Scopus.Piscirickettsia salmonis is the etiological agent of salmonid rickettsial septicemia, a disease that seriously affects the salmonid industry. Despite efforts to genomically characterize P. salmonis, functional information on the life cycle, pathogenesis mechanisms, diagnosis, treatment, and control of this fish pathogen remain lacking. To address this knowledge gap, the present study conducted an in silico pan-genome analysis of 19 P. salmonis strains from distinct geographic locations and genogroups. Results revealed an expected open pan-genome of 3,463 genes and a core-genome of 1,732 genes. Two marked genogroups were identified, as confirmed by phylogenetic and phylogenomic relationships to the LF-89 and EM-90 reference strains, as well as by assessments of genomic structures. Different structural configurations were found for the six identified copies of the ribosomal operon in the P. salmonis genome, indicating translocation throughout the genetic material. Chromosomal divergences in genomic localization and quantity of genetic cassettes were also found for the Dot/Icm type IVB secretion system. To determine divergences between core-genomes, additional pan-genome descriptions were compiled for the so-termed LF and EM genogroups. Open pan-genomes composed of 2,924 and 2,778 genes and core-genomes composed of 2,170 and 2,228 genes were respectively found for the LF and EM genogroups. The core-genomes were functionally annotated using the Gene Ontology, KEGG, and Virulence Factor databases, revealing the presence of several shared groups of genes related to basic function of intracellular survival and bacterial pathogenesis. Additionally, the specific pan-genomes for the LF and EM genogroups were defined, resulting in the identification of 148 and 273 exclusive proteins, respectively. Notably, specific virulence factors linked to adherence, colonization, invasion factors, and endotoxins were established. The obtained data suggest that these genes could be directly associated with inter-genogroup differences in pathogenesis and host-pathogen interactions, information that could be useful in designing novel strategies for diagnosing and controlling P. salmonis infection. © 2017 Nourdin-Galindo, Sánchez, Molina, Espinoza-Rojas, Oliver, Ruiz, Vargas-Chacoff, Cárcamo, Figueroa, Mancilla, Maracaja-Coutinho and Yañez.https://www.frontiersin.org/articles/10.3389/fcimb.2017.00459/ful

Frontiers - Publisher Connector

Repositorio Institucional Académico Universidad Andrés Bello

Bacterial microevolution and the Pangenome

Author: A Bankevich
AE Darling
AE Darling
AJ Page
AO Kislyuk
B Charlesworth
C Buckee
C Collins
C Wiuf
CM Thomas
CS Pepperell
DJ Wilson
DR Zerbino
E Jacox
F Lassalle
GE Sims
GJ Szollosi
GJ Szollosi
GJ Szollosi
H Ochman
IJ Wilson
J Hedge
J Lawrence
JB Joy
JFC Kingman
KAA Jolley
KE Dingle
KT Konstantinidis
L Li
L Petersen
M Csurös
M Nordborg
M Pagel
M Steinegger
M Touchon
M Vos
M Vos
M Vos
MJ Ward
MTG Holden
NA Rosenberg
NJ Croucher
P Donnelly
PAP Moran
R Griffiths
RC Griffiths
RG Everitt
RK Aziz
S Castillo-Ramírez
S Kurtz
S Wright
SF Altschul
SK Sheppard
SK Sheppard
SS Abby
SV Angiuoli
T Ohta
T Seemann
TG Vaughan
WP Maddison
X Didelot
X Didelot
X Didelot
X Didelot
X Didelot
X Didelot
X Didelot
Z Yang
Z Zhou
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/05/2020
Field of study

The comparison of multiple genome sequences sampled from a bacterial population reveals considerable diversity in both the core and the accessory parts of the pangenome. This diversity can be analysed in terms of microevolutionary events that took place since the genomes shared a common ancestor, especially deletion, duplication, and recombination. We review the basic modelling ingredients used implicitly or explicitly when performing such a pangenome analysis. In particular, we describe a basic neutral phylogenetic framework of bacterial pangenome microevolution, which is not incompatible with evaluating the role of natural selection. We survey the different ways in which pangenome data is summarised in order to be included in microevolutionary models, as well as the main methodological approaches that have been proposed to reconstruct pangenome microevolutionary history

Crossref

Warwick Research Archives Portal Repository

Global Functional Atlas of \u3cem\u3eEscherichia coli\u3c/em\u3e Encompassing Previously Uncharacterized Proteins

Author: Ali Mehrab
Babu Mohan
Butland Gareth
Chandran Shamanta
Christopolous Constantine
Emili Andrew
Eroukova Veronika
Golshani Ashkan
Greenblatt Jack F.
Guao Xinghua
Hu Pingzhao
Janga Sarah Chandra
Moreno-Hagelsieb Gabriel
Musso Gabriela
Nazarians-Armavil Anaies
Nazemof Nazila
Paccanaro Alberto
Phanse Sadhna
Pogoutse Oxana
Wong Peter
Yang Wenhong
Publication venue: Scholars Commons @ Laurier
Publication date: 01/04/2009
Field of study

One-third of the 4,225 protein-coding genes of Escherichia coli K-12 remain functionally unannotated (orphans). Many map to distant clades such as Archaea, suggesting involvement in basic prokaryotic traits, whereas others appear restricted to E. coli, including pathogenic strains. To elucidate the orphans’ biological roles, we performed an extensive proteomic survey using affinity-tagged E. coli strains and generated comprehensive genomic context inferences to derive a high-confidence compendium for virtually the entire proteome consisting of 5,993 putative physical interactions and 74,776 putative functional associations, most of which are novel. Clustering of the respective probabilistic networks revealed putative orphan membership in discrete multiprotein complexes and functional modules together with annotated gene products, whereas a machine-learning strategy based on network integration implicated the orphans in specific biological processes. We provide additional experimental evidence supporting orphan participation in protein synthesis, amino acid metabolism, biofilm formation, motility, and assembly of the bacterial cell envelope. This resource provides a “systems-wide” functional blueprint of a model microbe, with insights into the biological and evolutionary significance of previously uncharacterized proteins

Wilfrid Laurier University

Structure, Function, and Evolution of the Thiomonas spp. Genome

Author: A Derbise
A Hartwig
A Schluter
AB Olshen
AM Earl
AM Nascimento
Audrey Heinrich-Salmeron
Aurélie Lieutaud
B Lesic
C Baker-Austin
C Casiot
C Fraser
C Workman
Caroline Michel
Caroline Proux
Catherine Joulian
CG Bryan
CG Friedrich
Christopher G. Bryan
Claudine Médigue
Céline Brochier-Armanet
D Costechareyre
D Medini
D Moreira
D Muller
D Vallenet
Daniel Muller
David Roche
DB Johnson
Didier Lièvremont
Djamila Slyemi
E Maiques
EC Berglund
Emmanuel Talla
Evelyne Krin
F Battaglia-Brunet
F Battaglia-Brunet
F Boyer
Fabienne Battaglia-Brunet
Florence Arsène-Ploetze
Florence Hommais
G Perriere
G Zeidner
GC Kettler
Grégory Salvignol
H Brussow
H Tettelin
H Tettelin
H Tettelin
HL Hamilton
J Chen
J De Ley
J Hacker
J London
JA Schrader
JD Thompson
Jean Weissenbach
Jean-Yves Coppée
Jessica Cleiss-Arnold
JJ Harrison
JL Slonczewski
JP Benzécri
K Coupland
K Duquesne
K Stingl
KB Hallberg
KB Hallberg
KB Hallberg
LG Wayne
LJ Rothschild
M Achtman
M Juhas
M Juhas
M Li
Marie Marchal
Mathieu Erhardt
Michael Chandler
ML Coleman
Mohamed Barakat
Nancy A. Moran
NU Frigaard
O Bruneel
Odile Bruneel
Patricia Siguier
Philippe N. Bertin
Philippe Ortet
PM Sharp
R Hengge
R Iyer
S Bentley
S Fukiya
S Silver
S Silver
Sandrine Koechler
Stéphane Cruveiller
Stéphanie Weiss
T Dammeyer
T Schirmer
T Schwerdtle
TJ Treangen
TT Binnewies
V Burrus
V Sentchilo
Valérie Barbe
VAR Huss
Violaine Bonnefoy
Y Katayama
YL Meng
Zoé Rouy
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Bacteria of the Thiomonas genus are ubiquitous in extreme environments, such as arsenic-rich acid mine drainage (AMD). The genome of one of these strains, Thiomonas sp. 3As, was sequenced, annotated, and examined, revealing specific adaptations allowing this bacterium to survive and grow in its highly toxic environment. In order to explore genomic diversity as well as genetic evolution in Thiomonas spp., a comparative genomic hybridization (CGH) approach was used on eight different strains of the Thiomonas genus, including five strains of the same species. Our results suggest that the Thiomonas genome has evolved through the gain or loss of genomic islands and that this evolution is influenced by the specific environmental conditions in which the strains live

HAL AMU

Directory of Open Access Journals

Public Library of Science (PLOS)

Red de Bibliotecas Virtuales de Ciencias Sociales de América Latina y El Caribe

Horizon / Pleins textes

HAL-Pasteur

Genes Translocated into the Plastid Inverted Repeat Show Decelerated Substitution Rates and Elevated GC Content.

Author: Kuo Li-Yaung
Li Fay-Wei
Pryer Kathleen
Rothfels Carl
Publication venue: eScholarship, University of California
Publication date: 10/07/2016
Field of study

Plant chloroplast genomes (plastomes) are characterized by an inverted repeat (IR) region and two larger single copy (SC) regions. Patterns of molecular evolution in the IR and SC regions differ, most notably by a reduced rate of nucleotide substitution in the IR compared to the SC region. In addition, the organization and structure of plastomes is fluid, and rearrangements through time have repeatedly shuffled genes into and out of the IR, providing recurrent natural experiments on how chloroplast genome structure can impact rates and patterns of molecular evolution. Here we examine four loci (psbA, ycf2, rps7, and rps12 exon 2-3) that were translocated from the SC into the IR during fern evolution. We use a model-based method, within a phylogenetic context, to test for substitution rate shifts. All four loci show a significant, 2- to 3-fold deceleration in their substitution rate following translocation into the IR, a phenomenon not observed in any other, nontranslocated plastid genes. Also, we show that after translocation, the GC content of the third codon position and of the noncoding regions is significantly increased, implying that gene conversion within the IR is GC-biased. Taken together, our results suggest that the IR region not only reduces substitution rates, but also impacts nucleotide composition. This finding highlights a potential vulnerability of correlating substitution rate heterogeneity with organismal life history traits without knowledge of the underlying genome structure

PubMed Central

eScholarship - University of California

Pan-genome Analysis, Visualization and Exploration

Author: Ding Wei
Publication venue: Universität Tübingen
Publication date: 01/01/2017
Field of study

The dynamics of prokaryotic genomes are driven by the intricate interplay of different evolutionary forces such as gene duplication, gene loss and horizontal transfer. Even closely related strains can exhibit remarkable genetic diversity and substantial gene presence/absence variation. The pan-genome, namely the complete inventory of genes in a collection of strains, can be several times larger than the genome of any single strain. Although several tools for pan-genome analysis have been published, there is still much room for algorithmic improvement, as well as needs for applications that better interactively visualize and explore pan-genomes. Therefore, we have developed panX, an automated computational pipeline for efficient identification of orthologous gene clusters in the pan-genome. PanX identifies homologous relationships among genes using DIAMOND and MCL and then harnesses phylogeny-based post- processing to separate orthologs from paralogs. Furthermore, we take advantage of a divide-and-conquer strategy to achieve an approximately linear runtime on large datasets. The analysis result can be visualized by the accompanying software, an easy-to-use and powerful web-based visualization application for interactive exploration of the pan-genome. The visualization dashboard encompasses a variety of connected components that allow rapid searching, filtering and sorting of genes and flexible investigation of evolutionary relationships among strains and their genes. PanX seamlessly interlinks gene clusters with their alignments and gene phylogenies, maps mutations on the branches of gene tree and highlights gene gain and loss events on the core-genome phylogeny that can also be colored by metadata associated with strains. By using 120 simulated pan-genome datasets for benchmarking and comparing clustering results on real dataset between different tools, panX exhibits overall good performance across a large range of diversities. PanX is available at pangenome.de, with a wide range of microbial pan-genomes established. Besides, user-provided pan-genomes can be visualized either via a web server or by running panX locally as a web-based application

Publikationsserver der Universität Tübingen

PSAT: A web tool to compare genomic neighborhoods of multiple prokaryotic genomes

Author: Brittnacher Mitchell J
Fong Christine
Radey Matthew
Rohmer Laurence
Wasnick Michael
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background The conservation of gene order among prokaryotic genomes can provide valuable insight into gene function, protein interactions, or events by which genomes have evolved. Although some tools are available for visualizing and comparing the order of genes between genomes of study, few support an efficient and organized analysis between large numbers of genomes. The Prokaryotic Sequence homology Analysis Tool (PSAT) is a web tool for comparing gene neighborhoods among multiple prokaryotic genomes. Results PSAT utilizes a database that is preloaded with gene annotation, BLAST hit results, and gene-clustering scores designed to help identify regions of conserved gene order. Researchers use the PSAT web interface to find a gene of interest in a reference genome and efficiently retrieve the sequence homologs found in other bacterial genomes. The tool generates a graphic of the genomic neighborhood surrounding the selected gene and the corresponding regions for its homologs in each comparison genome. Homologs in each region are color coded to assist users with analyzing gene order among various genomes. In contrast to common comparative analysis methods that filter sequence homolog data based on alignment score cutoffs, PSAT leverages gene context information for homologs, including those with weak alignment scores, enabling a more sensitive analysis. Features for constraining or ordering results are designed to help researchers browse results from large numbers of comparison genomes in an organized manner. PSAT has been demonstrated to be useful for helping to identify gene orthologs and potential functional gene clusters, and detecting genome modifications that may result in loss of function. Conclusion PSAT allows researchers to investigate the order of genes within local genomic neighborhoods of multiple genomes. A PSAT web server for public use is available for performing analyses on a growing set of reference genomes through any web browser with no client side software setup or installation required. Source code is freely available to researchers interested in setting up a local version of PSAT for analysis of genomes not available through the public server. Access to the public web server and instructions for obtaining source code can be found at <url>http://www.nwrce.org/psat</url>.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central