174 research outputs found

    Benchmarking viromics: An in silico

    Get PDF
    Background Viral metagenomics (viromics) is increasingly used to obtain uncultivated viral genomes, evaluate community diversity, and assess ecological hypotheses. While viromic experimental methods are relatively mature and widely accepted by the research community, robust bioinformatics standards remain to be established. Here we used in silico mock viral communities to evaluate the viromic sequence-to-ecological-inference pipeline, including (i) read pre-processing and metagenome assembly, (ii) thresholds applied to estimate viral relative abundances based on read mapping to assembled contigs, and (iii) normalization methods applied to the matrix of viral relative abundances for alpha and beta diversity estimates. Results Tools specifically designed for metagenomes, specifically metaSPAdes, MEGAHIT, and IDBA-UD, were the most effective at assembling viromes. Read pre-processing, such as partitioning, had virtually no impact on assembly output, but may be useful when hardware is limited. Viral populations with 2–5 × coverage typically assembled well, whereas lesser coverage led to fragmented assembly. Strain heterogeneity within populations hampered assembly, especially when strains were closely related (average nucleotide identity, or ANI ≥97%) and when the most abundant strain represented <50% of the population. Viral community composition assessments based on read recruitment were generally accurate when the following thresholds for detection were applied: (i) ≥10 kb contig lengths to define populations, (ii) coverage defined from reads mapping at ≥90% identity, and (iii) ≥75% of contig length with ≥1 × coverage. Finally, although data are limited to the most abundant viruses in a community, alpha and beta diversity patterns were robustly estimated (±10%) when comparing samples of similar sequencing depth, but more divergent (up to 80%) when sequencing depth was uneven across the dataset. In the latter cases, the use of normalization methods specifically developed for metagenomes provided the best estimates. Conclusions These simulations provide benchmarks for selecting analysis cut-offs and establish that an optimized sample-to-ecological-inference viromics pipeline is robust for making ecological inferences from natural viral communities. Continued development to better accessing RNA, rare, and/or diverse viral populations and improved reference viral genome availability will alleviate many of viromics remaining limitations

    Substrate availability and not thermal acclimation controls microbial temperature sensitivity response to long‐term warming

    Get PDF
    Microbes are responsible for cycling carbon (C) through soils, and predicted changes in soil C stocks under climate change are highly sensitive to shifts in the mechanisms assumed to control the microbial physiological response to warming. Two mechanisms have been suggested to explain the long-term warming impact on microbial physiology: microbial thermal acclimation and changes in the quantity and quality of substrates available for microbial metabolism. Yet studies disentangling these two mechanisms are lacking. To resolve the drivers of changes in microbial physiology in response to long-term warming, we sampled soils from 13- and 28-year-old soil warming experiments in different seasons. We performed short-term laboratory incubations across a range of temperatures to measure the relationships between temperature sensitivity of physiology (growth, respiration, carbon use efficiency, and extracellular enzyme activity) and the chemical composition of soil organic matter. We observed apparent thermal acclimation of microbial respiration, but only in summer, when warming had exacerbated the seasonally-induced, already small dissolved organic matter pools. Irrespective of warming, greater quantity and quality of soil carbon increased the extracellular enzymatic pool and its temperature sensitivity. We propose that fresh litter input into the system seasonally cancels apparent thermal acclimation of C-cycling processes to decadal warming. Our findings reveal that long-term warming has indirectly affected microbial physiology via reduced C availability in this system, implying that earth system models including these negative feedbacks may be best suited to describe long-term warming effects on these soils

    Position-Specific Metabolic Probing and Metagenomics of Microbial Communities Reveal Conserved Central Carbon Metabolic Network Activities at High Temperatures

    Full text link
    Temperature is a primary driver of microbial community composition and taxonomic diversity; however, it is unclear to what extent temperature affects characteristics of central carbon metabolic pathways (CCMPs) at the community level. In this study, 16S rRNA gene amplicon and metagenome sequencing were combined with 13C-labeled metabolite probing of the CCMPs to assess community carbon metabolism along a temperature gradient (60–95°C) in Great Boiling Spring, NV. 16S rRNA gene amplicon diversity was inversely proportional to temperature, and Archaea were dominant at higher temperatures. KO richness and diversity were also inversely proportional to temperature, yet CCMP genes were similarly represented across the temperature gradient and many individual metagenome-assembled genomes had complete pathways. In contrast, genes encoding cellulosomes and many genes involved in plant matter degradation and photosynthesis were absent at higher temperatures. In situ 13C-CO2 production from labeled isotopomer pairs of glucose, pyruvate, and acetate suggested lower relative oxidative pentose phosphate pathway activity and/or fermentation at 60°C, and a stable or decreased maintenance energy demand at higher temperatures. Catabolism of 13C-labeled citrate, succinate, L-alanine, L-serine, and L-cysteine was observed at 85°C, demonstrating broad heterotrophic activity and confirming functioning of the TCA cycle. Together, these results suggest that temperature-driven losses in biodiversity and gene content in geothermal systems may not alter CCMP function or maintenance energy demands at a community level

    Insights into the Dynamics Between Viruses and their Hosts in a Hot Spring Microbial Mat

    Get PDF
    © 2020, The Author(s). Our current knowledge of host–virus interactions in biofilms is limited to computational predictions based on laboratory experiments with a small number of cultured bacteria. However, natural biofilms are diverse and chiefly composed of uncultured bacteria and archaea with no viral infection patterns and lifestyle predictions described to date. Herein, we predict the first DNA sequence-based host–virus interactions in a natural biofilm. Using single-cell genomics and metagenomics applied to a hot spring mat of the Cone Pool in Mono County, California, we provide insights into virus–host range, lifestyle and distribution across different mat layers. Thirty-four out of 130 single cells contained at least one viral contig (26%), which, together with the metagenome-assembled genomes, resulted in detection of 59 viruses linked to 34 host species. Analysis of single-cell amplification kinetics revealed a lack of active viral replication on the single-cell level. These findings were further supported by mapping metagenomic reads from different mat layers to the obtained host–virus pairs, which indicated a low copy number of viral genomes compared to their hosts. Lastly, the metagenomic data revealed high layer specificity of viruses, suggesting limited diffusion to other mat layers. Taken together, these observations indicate that in low mobility environments with high microbial abundance, lysogeny is the predominant viral lifestyle, in line with the previously proposed “Piggyback-the-Winner” theory

    Optimizing de novo genome assembly from PCR-amplified metagenomes

    Get PDF
    Background Metagenomics has transformed our understanding of microbial diversity across ecosystems, with recent advances enabling de novo assembly of genomes from metagenomes. These metagenome-assembled genomes are critical to provide ecological, evolutionary, and metabolic context for all the microbes and viruses yet to be cultivated. Metagenomes can now be generated from nanogram to subnanogram amounts of DNA. However, these libraries require several rounds of PCR amplification before sequencing, and recent data suggest these typically yield smaller and more fragmented assemblies than regular metagenomes. Methods Here we evaluate de novo assembly methods of 169 PCR-amplified metagenomes, including 25 for which an unamplified counterpart is available, to optimize specific assembly approaches for PCR-amplified libraries. We first evaluated coverage bias by mapping reads from PCR-amplified metagenomes onto reference contigs obtained from unamplified metagenomes of the same samples. Then, we compared different assembly pipelines in terms of assembly size (number of bp in contigs ≥ 10 kb) and error rates to evaluate which are the best suited for PCR-amplified metagenomes. Results Read mapping analyses revealed that the depth of coverage within individual genomes is significantly more uneven in PCR-amplified datasets versus unamplified metagenomes, with regions of high depth of coverage enriched in short inserts. This enrichment scales with the number of PCR cycles performed, and is presumably due to preferential amplification of short inserts. Standard assembly pipelines are confounded by this type of coverage unevenness, so we evaluated other assembly options to mitigate these issues. We found that a pipeline combining read deduplication and an assembly algorithm originally designed to recover genomes from libraries generated after whole genome amplification (single-cell SPAdes) frequently improved assembly of contigs ≥10 kb by 10 to 100-fold for low input metagenomes. Conclusions PCR-amplified metagenomes have enabled scientists to explore communities traditionally challenging to describe, including some with extremely low biomass or from which DNA is particularly difficult to extract. Here we show that a modified assembly pipeline can lead to an improved de novo genome assembly from PCR-amplified datasets, and enables a better genome recovery from low input metagenomes
    corecore