8,002 research outputs found

    Bacterial riboproteogenomics : the era of N-terminal proteoform existence revealed

    Get PDF
    With the rapid increase in the number of sequenced prokaryotic genomes, relying on automated gene annotation became a necessity. Multiple lines of evidence, however, suggest that current bacterial genome annotations may contain inconsistencies and are incomplete, even for so-called well-annotated genomes. We here discuss underexplored sources of protein diversity and new methodologies for high-throughput genome re-annotation. The expression of multiple molecular forms of proteins (proteoforms) from a single gene, particularly driven by alternative translation initiation, is gaining interest as a prominent contributor to bacterial protein diversity. In consequence, riboproteogenomic pipelines were proposed to comprehensively capture proteoform expression in prokaryotes by the complementary use of (positional) proteomics and the direct readout of translated genomic regions using ribosome profiling. To complement these discoveries, tailored strategies are required for the functional characterization of newly discovered bacterial proteoforms

    Initiator tRNA genes template the 3\u27 CCA end at high frequencies in bacteria.

    Get PDF
    BACKGROUND: While the CCA sequence at the mature 3\u27 end of tRNAs is conserved and critical for translational function, a genetic template for this sequence is not always contained in tRNA genes. In eukaryotes and Archaea, the CCA ends of tRNAs are synthesized post-transcriptionally by CCA-adding enzymes. In Bacteria, tRNA genes template CCA sporadically. RESULTS: In order to understand the variation in how prokaryotic tRNA genes template CCA, we re-annotated tRNA genes in tRNAdb-CE database version 0.8. Among 132,129 prokaryotic tRNA genes, initiator tRNA genes template CCA at the highest average frequency (74.1%) over all functional classes except selenocysteine and pyrrolysine tRNA genes (88.1% and 100% respectively). Across bacterial phyla and a wide range of genome sizes, many lineages exist in which predominantly initiator tRNA genes template CCA. Convergent and parallel retention of CCA templating in initiator tRNA genes evolved in independent histories of reductive genome evolution in Bacteria. Also, in a majority of cyanobacterial and actinobacterial genera, predominantly initiator tRNA genes template CCA. We also found that a surprising fraction of archaeal tRNA genes template CCA. CONCLUSIONS: We suggest that cotranscriptional synthesis of initiator tRNA CCA 3\u27 ends can complement inefficient processing of initiator tRNA precursors, bootstrap rapid initiation of protein synthesis from a non-growing state, or contribute to an increase in cellular growth rates by reducing overheads of mass and energy to maintain nonfunctional tRNA precursor pools. More generally, CCA templating in structurally non-conforming tRNA genes can afford cells robustness and greater plasticity to respond rapidly to environmental changes and stimuli

    Initiator tRNA genes template the 3' CCA end at high frequencies in bacteria.

    Get PDF
    BackgroundWhile the CCA sequence at the mature 3' end of tRNAs is conserved and critical for translational function, a genetic template for this sequence is not always contained in tRNA genes. In eukaryotes and Archaea, the CCA ends of tRNAs are synthesized post-transcriptionally by CCA-adding enzymes. In Bacteria, tRNA genes template CCA sporadically.ResultsIn order to understand the variation in how prokaryotic tRNA genes template CCA, we re-annotated tRNA genes in tRNAdb-CE database version 0.8. Among 132,129 prokaryotic tRNA genes, initiator tRNA genes template CCA at the highest average frequency (74.1%) over all functional classes except selenocysteine and pyrrolysine tRNA genes (88.1% and 100% respectively). Across bacterial phyla and a wide range of genome sizes, many lineages exist in which predominantly initiator tRNA genes template CCA. Convergent and parallel retention of CCA templating in initiator tRNA genes evolved in independent histories of reductive genome evolution in Bacteria. Also, in a majority of cyanobacterial and actinobacterial genera, predominantly initiator tRNA genes template CCA. We also found that a surprising fraction of archaeal tRNA genes template CCA.ConclusionsWe suggest that cotranscriptional synthesis of initiator tRNA CCA 3' ends can complement inefficient processing of initiator tRNA precursors, "bootstrap" rapid initiation of protein synthesis from a non-growing state, or contribute to an increase in cellular growth rates by reducing overheads of mass and energy to maintain nonfunctional tRNA precursor pools. More generally, CCA templating in structurally non-conforming tRNA genes can afford cells robustness and greater plasticity to respond rapidly to environmental changes and stimuli

    High-throughput comparison, functional annotation, and metabolic modeling of plant genomes using the PlantSEED resource

    Get PDF
    The increasing number of sequenced plant genomes is placing new demands on the methods applied to analyze, annotate, and model these genomes. Today's annotation pipelines result in inconsistent gene assignments that complicate comparative analyses and prevent efficient construction of metabolic models. To overcome these problems, we have developed the PlantSEED, an integrated, metabolism-centric database to support subsystems-based annotation and metabolic model reconstruction for plant genomes. PlantSEED combines SEED subsystems technology, first developed for microbial genomes, with refined protein families and biochemical data to assign fully consistent functional annotations to orthologous genes, particularly those encoding primary metabolic pathways. Seamless integration with its parent, the prokaryotic SEED database, makes PlantSEED a unique environment for cross-kingdom comparative analysis of plant and bacterial genomes. The consistent annotations imposed by PlantSEED permit rapid reconstruction and modeling of primary metabolism for all plant genomes in the database. This feature opens the unique possibility of model-based assessment of the completeness and accuracy of gene annotation and thus allows computational identification of genes and pathways that are restricted to certain genomes or need better curation. We demonstrate the PlantSEED system by producing consistent annotations for 10 reference genomes. We also produce a functioning metabolic model for each genome, gapfilling to identify missing annotations and proposing gene candidates for missing annotations. Models are built around an extended biomass composition representing the most comprehensive published to date. To our knowledge, our models are the first to be published for seven of the genomes analyzed

    Comparative Genomics of a Parthenogenesis-Inducing Wolbachia Symbiont.

    Get PDF
    Wolbachia is an intracellular symbiont of invertebrates responsible for inducing a wide variety of phenotypes in its host. These host-Wolbachia relationships span the continuum from reproductive parasitism to obligate mutualism, and provide a unique system to study genomic changes associated with the evolution of symbiosis. We present the genome sequence from a parthenogenesis-inducing Wolbachia strain (wTpre) infecting the minute parasitoid wasp Trichogramma pretiosum The wTpre genome is the most complete parthenogenesis-inducing Wolbachia genome available to date. We used comparative genomics across 16 Wolbachia strains, representing five supergroups, to identify a core Wolbachia genome of 496 sets of orthologous genes. Only 14 of these sets are unique to Wolbachia when compared to other bacteria from the Rickettsiales. We show that the B supergroup of Wolbachia, of which wTpre is a member, contains a significantly higher number of ankyrin repeat-containing genes than other supergroups. In the wTpre genome, there is evidence for truncation of the protein coding sequences in 20% of ORFs, mostly as a result of frameshift mutations. The wTpre strain represents a conversion from cytoplasmic incompatibility to a parthenogenesis-inducing lifestyle, and is required for reproduction in the Trichogramma host it infects. We hypothesize that the large number of coding frame truncations has accompanied the change in reproductive mode of the wTpre strain

    A genomic view of food-related and probiotic Enterococcus strains

    Get PDF
    The study of enterococcal genomes has grown considerably in recent years. While special attentionis paid to comparative genomic analysis among clinical relevant isolates, in this study we performedan exhaustive comparative analysis of enterococcal genomes of food origin and/or with potential tobe used as probiotics. Beyond common genetic features, we especially aimed to identify those thatare specific to enterococcal strains isolated from a certain food-related source as well as features presentin a species-specific manner. Thus, the genome sequences of 25 Enterococcus strains, from 7different species, were examined and compared. Their phylogenetic relationship was reconstructedbased on orthologous proteins and whole genomes. Likewise, markers associated with a successfulcolonization (bacteriocin genes and genomic islands) and genome plasticity (phages and clusteredregularly interspaced short palindromic repeats) were investigated for lifestyle specific genetic features.At the same time, a search for antibiotic resistance genes was carried out, since they are of bigconcern in the food industry. Finally, it was possible to locate 1617 FIGfam families as a core proteomeuniversally present among the genera and to determine that most of the accessory genes codefor hypothetical proteins, providing reasonable hints to support their functional characterization.Fil: Bonacina, Julieta. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Tucuman. Centro de Referencia Para Lactobacilos; ArgentinaFil: Suárez, Nadia Elina. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Tucuman. Centro de Referencia Para Lactobacilos; ArgentinaFil: Hormigo, Daniel Ricardo. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Tucuman. Centro de Referencia Para Lactobacilos; ArgentinaFil: Fadda, Silvina G.. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Tucuman. Centro de Referencia Para Lactobacilos; ArgentinaFil: Lechner, Marcus. University Marburg; AlemaniaFil: Saavedra, Maria Lucila. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Tucuman. Centro de Referencia Para Lactobacilos; Argentin

    Genomic and experimental evidence for multiple metabolic functions in the RidA/YjgF/YER057c/UK114 (Rid) protein family.

    Get PDF
    BackgroundIt is now recognized that enzymatic or chemical side-reactions can convert normal metabolites to useless or toxic ones and that a suite of enzymes exists to mitigate such metabolite damage. Examples are the reactive imine/enamine intermediates produced by threonine dehydratase, which damage the pyridoxal 5'-phosphate cofactor of various enzymes causing inactivation. This damage is pre-empted by RidA proteins, which hydrolyze the imines before they do harm. RidA proteins belong to the YjgF/YER057c/UK114 family (here renamed the Rid family). Most other members of this diverse and ubiquitous family lack defined functions.ResultsPhylogenetic analysis divided the Rid family into a widely distributed, apparently archetypal RidA subfamily and seven other subfamilies (Rid1 to Rid7) that are largely confined to bacteria and often co-occur in the same organism with RidA and each other. The Rid1 to Rid3 subfamilies, but not the Rid4 to Rid7 subfamilies, have a conserved arginine residue that, in RidA proteins, is essential for imine-hydrolyzing activity. Analysis of the chromosomal context of bacterial RidA genes revealed clustering with genes for threonine dehydratase and other pyridoxal 5'-phosphate-dependent enzymes, which fits with the known RidA imine hydrolase activity. Clustering was also evident between Rid family genes and genes specifying FAD-dependent amine oxidases or enzymes of carbamoyl phosphate metabolism. Biochemical assays showed that Salmonella enterica RidA and Rid2, but not Rid7, can hydrolyze imines generated by amino acid oxidase. Genetic tests indicated that carbamoyl phosphate overproduction is toxic to S. enterica cells lacking RidA, and metabolomic profiling of Rid knockout strains showed ten-fold accumulation of the carbamoyl phosphate-related metabolite dihydroorotate.ConclusionsLike the archetypal RidA subfamily, the Rid2, and probably the Rid1 and Rid3 subfamilies, have imine-hydrolyzing activity and can pre-empt damage from imines formed by amine oxidases as well as by pyridoxal 5'-phosphate enzymes. The RidA subfamily has an additional damage pre-emption role in carbamoyl phosphate metabolism that has yet to be biochemically defined. Finally, the Rid4 to Rid7 subfamilies appear not to hydrolyze imines and thus remain mysterious
    corecore