27,239 research outputs found
Recommended from our members
Shotgun protein sequencing.
A novel experimental and computational technique based on multiple enzymatic digestion of a protein or protein mixture that reconstructs protein sequences from sequences of overlapping peptides is described in this SAND report. This approach, analogous to shotgun sequencing of DNA, is to be used to sequence alternative spliced proteins, to identify post-translational modifications, and to sequence genetically engineered proteins
Recommended from our members
Extensive horizontal gene transfer in cheese-associated bacteria.
Acquisition of genes through horizontal gene transfer (HGT) allows microbes to rapidly gain new capabilities and adapt to new or changing environments. Identifying widespread HGT regions within multispecies microbiomes can pinpoint the molecular mechanisms that play key roles in microbiome assembly. We sought to identify horizontally transferred genes within a model microbiome, the cheese rind. Comparing 31 newly sequenced and 134 previously sequenced bacterial isolates from cheese rinds, we identified over 200 putative horizontally transferred genomic regions containing 4733 protein coding genes. The largest of these regions are enriched for genes involved in siderophore acquisition, and are widely distributed in cheese rinds in both Europe and the US. These results suggest that HGT is prevalent in cheese rind microbiomes, and that identification of genes that are frequently transferred in a particular environment may provide insight into the selective forces shaping microbial communities
PROTEOFORMER: deep proteome coverage through ribosome profiling and MS integration
An increasing amount of studies integrate mRNA sequencing data into MS-based proteomics to complement the translation product search space. However, several factors, including extensive regulation of mRNA translation and the need for three- or six-frame-translation, impede the use of mRNA-seq data for the construction of a protein sequence search database. With that in mind, we developed the PROTEOFORMER tool that automatically processes data of the recently developed ribosome profiling method (sequencing of ribosome-protected mRNA fragments), resulting in genome-wide visualization of ribosome occupancy. Our tool also includes a translation initiation site calling algorithm allowing the delineation of the open reading frames (ORFs) of all translation products. A complete protein synthesis-based sequence database can thus be compiled for mass spectrometry-based identification. This approach increases the overall protein identification rates with 3% and 11% (improved and new identifications) for human and mouse, respectively, and enables proteome-wide detection of 5'-extended proteoforms, upstream ORF translation and near-cognate translation start sites. The PROTEOFORMER tool is available as a stand-alone pipeline and has been implemented in the galaxy framework for ease of use
Recommended from our members
The impact of sequencing depth on the inferred taxonomic composition and AMR gene content of metagenomic samples
Shotgun metagenomics is increasingly used to characterise microbial communities, particularly for the investigation of antimicrobial resistance (AMR) in different animal and environmental contexts. There are many different approaches for inferring the taxonomic composition and AMR gene content of complex community samples from shotgun metagenomic data, but there has been little work establishing the optimum sequencing depth, data processing and analysis methods for these samples. In this study we used shotgun metagenomics and sequencing of cultured isolates from the same samples to address these issues. We sampled three potential environmental AMR gene reservoirs (pig caeca, river sediment, effluent) and sequenced samples with shotgun metagenomics at high depth (~ 200 million reads per sample). Alongside this, we cultured single-colony isolates of Enterobacteriaceae from the same samples and used hybrid sequencing (short- and long-reads) to create high- quality assemblies for comparison to the metagenomic data. To automate data processing, we developed an open- source software pipeline, âResPipeâ
Recommended from our members
Gut bacteria responding to dietary change encode sialidases that exhibit preference for red meat-associated carbohydrates.
Dietary habits have been associated with alterations of the human gut resident microorganisms contributing to obesity, diabetes and cancer1. In Western diets, red meat is a frequently eaten food2, but long-term consumption has been associated with increased risk of disease3,4. Red meat is enriched in N-glycolylneuraminic acid (Neu5Gc) that cannot be synthesized by humans5. However, consumption can cause Neu5Gc incorporation into cell surface glycans6, especially in carcinomas4,7. As a consequence, an inflammatory response is triggered when Neu5Gc-containing glycans encounter circulating anti-Neu5Gc antibodies8,9. Although bacteria can use free sialic acids as a nutrient source10-12, it is currently unknown if gut microorganisms contribute to releasing Neu5Gc from food. We found that a Neu5Gc-rich diet induces changes in the gut microbiota, with Bacteroidales and Clostridiales responding the most. Genome assembling of mouse and human shotgun metagenomic sequencing identified bacterial sialidases with previously unobserved substrate preference for Neu5Gc-containing glycans. X-ray crystallography revealed key amino acids potentially contributing to substrate preference. Additionally, we verified that mouse and human sialidases were able to release Neu5Gc from red meat. The release of Neu5Gc from red meat using bacterial sialidases could reduce the risk of inflammatory diseases associated with red meat consumption, including colorectal cancer4 and atherosclerosis13
Reference genome and comparative genome analysis for the WHO reference strain for Mycobacterium bovis BCG Danish, the present tuberculosis vaccine
Background: Mycobacterium bovis bacillus Calmette-Guerin (M. bovis BCG) is the only vaccine available against tuberculosis (TB). In an effort to standardize the vaccine production, three substrains, i.e. BCG Danish 1331, Tokyo 172-1 and Russia BCG-1 were established as the WHO reference strains. Both for BCG Tokyo 172-1 as Russia BCG-1, reference genomes exist, not for BCG Danish. In this study, we set out to determine the completely assembled genome sequence for BCG Danish and to establish a workflow for genome characterization of engineering-derived vaccine candidate strains.ResultsBy combining second (Illumina) and third (PacBio) generation sequencing in an integrated genome analysis workflow for BCG, we could construct the completely assembled genome sequence of BCG Danish 1331 (07/270) (and an engineered derivative that is studied as an improved vaccine candidate, a SapM KO), including the resolution of the analytically challenging long duplication regions. We report the presence of a DU1-like duplication in BCG Danish 1331, while this tandem duplication was previously thought to be exclusively restricted to BCG Pasteur. Furthermore, comparative genome analyses of publicly available data for BCG substrains showed the absence of a DU1 in certain BCG Pasteur substrains and the presence of a DU1-like duplication in some BCG China substrains. By integrating publicly available data, we provide an update to the genome features of the commonly used BCG strains.
Conclusions: We demonstrate how this analysis workflow enables the resolution of genome duplications and of the genome of engineered derivatives of the BCG Danish vaccine strain. The BCG Danish WHO reference genome will serve as a reference for future engineered strains and the established workflow can be used to enhance BCG vaccine standardization
Recommended from our members
Shotgun metagenomic analysis of microbial communities from the Loxahatchee nature preserve in the Florida Everglades.
BackgroundCurrently, much is unknown about the taxonomic diversity and the mechanisms of methane metabolism in the Florida Everglades ecosystem. The Loxahatchee National Wildlife Refuge is a section of the Florida Everglades that is almost entirely unstudied in regard to taxonomic profiling. This short report analyzes the metagenome of soil samples from this Refuge to investigate the predominant taxa, as well as the abundance of genes involved in environmentally significant metabolic pathways related to methane production (nitrogen fixation and dissimilatory sulfite reduction).MethodsShotgun metagenomic sequencing using the Illumina platform was performed on 17 soil samples from four different sites within the Loxahatchee National Wildlife Refuge, and underwent quality control, assembly, and annotation. The soil from each sample was tested for water content and concentrations of organic carbon and nitrogen.ResultsThe three most common phyla of bacteria for every site were Actinobacteria, Acidobacteria, and Proteobacteria; however, there was variation in relative phylum composition. The most common phylum of Archaea was Euryarchaeota for all sites. Alpha and beta diversity analyses indicated significant congruity in taxonomic diversity in most samples from Sites 1, 3, and 4 and negligible congruity between Site 2 and the other sites. Shotgun metagenomic sequencing revealed the presence of biogeochemical biomarkers of particular interest (e.g., mrcA, nifH, and dsrB) within the samples. The normalized abundances of mcrA, nifH, and dsrB exhibited a positive correlation with nitrogen concentration and water content, and a negative correlation with organic carbon concentration.ConclusionThis Everglades soil metagenomic study allowed examination of wetlands biological processes and showed expected correlations between measured organic constituents and prokaryotic gene frequency. Additionally, the taxonomic profile generated gives a basis for the diversity of prokaryotic microbial life throughout the Everglades
Mass spectrometry and ribosome profiling, a perfect combination towards a more comprehensive identification strategy of true in vivo protein forms
An increasing number of studies involve integrative analysis of gene and protein expression data, taking advantage of new technologies such as next-generation transcriptome sequencing (RNA-Seq) and highly sensitive mass spectrometry (MS). Recently, a strategy, termed ribosome profiling, based on deep sequencing of ribosome-protected mRNA fragments, indirectly monitoring protein synthesis, has been described. In contrast to routinely employed protein databases in proteomics searches, RIBO-seq derived data gives a more representative expression state and accounts for sequence variation information and alternative translation initiation.
To verify the potential of ribosome profiling in providing us with a true snapshot of the translational landscape, we devised a proteogenomic approach generating a database of translation products based on ribosome profiling experiments. The raw and untreated RIBO-seq data is analyzed for both splice isoforms and single nucleotide polymorphisms, as such taking into account transcriptional variation. Next to that, RIBO-seq data for translation start site discovery (treated with harringtonine, lactomidomycin or puromycin) is used to obtain a genome wide blueprint of all possible translation initiation sites and as such taking into account translation variation. By adding protein-DB annotation to the genomic RIBO-seq derived data and after in silico translation a protein database is constructed reflecting the full complexity of the proteome.
Using a first version of our proteogenomic approach on an undifferentiated mouse embryonic stem cell line (E14) we could demonstrate an increase of the overall protein identification rate with 2.5% as compared to only searching UniProtKB-SwissProt. Furthermore, identification of N-terminal COFRADIC data resulted in detection of 16 alternative start sites giving rise to N-terminally extended protein variants besides the identification of four translated uORFs
Recommended from our members
Gut Microbial Metabolism and Nonalcoholic Fatty Liver Disease.
The gut microbiome, the multispecies community of microbes that exists in the gastrointestinal tract, encodes several orders of magnitude more functional genes than the human genome. It also plays a pivotal role in human health, in part due to metabolism of environmental, dietary, and host-derived substrates, which produce bioactive metabolites. Perturbations to the composition and associated metabolic output of the gut microbiome have been associated with a number of chronic liver diseases, including nonalcoholic fatty liver disease (NAFLD). Here, we review the rapidly evolving suite of next-generation techniques used for studying gut microbiome composition, functional gene content, and bioactive products and discuss relationships with the pathogenesis of NAFLD
- âŠ