775 research outputs found

    The Effects of Alignment Quality, Distance Calculation Method, Sequence Filtering, and Region on the Analysis of 16S rRNA Gene-Based Studies

    Get PDF
    Pyrosequencing of PCR-amplified fragments that target variable regions within the 16S rRNA gene has quickly become a powerful method for analyzing the membership and structure of microbial communities. This approach has revealed and introduced questions that were not fully appreciated by those carrying out traditional Sanger sequencing-based methods. These include the effects of alignment quality, the best method of calculating pairwise genetic distances for 16S rRNA genes, whether it is appropriate to filter variable regions, and how the choice of variable region relates to the genetic diversity observed in full-length sequences. I used a diverse collection of 13,501 high-quality full-length sequences to assess each of these questions. First, alignment quality had a significant impact on distance values and downstream analyses. Specifically, the greengenes alignment, which does a poor job of aligning variable regions, predicted higher genetic diversity, richness, and phylogenetic diversity than the SILVA and RDP-based alignments. Second, the effect of different gap treatments in determining pairwise genetic distances was strongly affected by the variation in sequence length for a region; however, the effect of different calculation methods was subtle when determining the sample's richness or phylogenetic diversity for a region. Third, applying a sequence mask to remove variable positions had a profound impact on genetic distances by muting the observed richness and phylogenetic diversity. Finally, the genetic distances calculated for each of the variable regions did a poor job of correlating with the full-length gene. Thus, while it is tempting to apply traditional cutoff levels derived for full-length sequences to these shorter sequences, it is not advisable. Analysis of β-diversity metrics showed that each of these factors can have a significant impact on the comparison of community membership and structure. Taken together, these results urge caution in the design and interpretation of analyses using pyrosequencing data

    Robust estimation of microbial diversity in theory and in practice

    Get PDF
    Quantifying diversity is of central importance for the study of structure, function and evolution of microbial communities. The estimation of microbial diversity has received renewed attention with the advent of large-scale metagenomic studies. Here, we consider what the diversity observed in a sample tells us about the diversity of the community being sampled. First, we argue that one cannot reliably estimate the absolute and relative number of microbial species present in a community without making unsupported assumptions about species abundance distributions. The reason for this is that sample data do not contain information about the number of rare species in the tail of species abundance distributions. We illustrate the difficulty in comparing species richness estimates by applying Chao's estimator of species richness to a set of in silico communities: they are ranked incorrectly in the presence of large numbers of rare species. Next, we extend our analysis to a general family of diversity metrics ("Hill diversities"), and construct lower and upper estimates of diversity values consistent with the sample data. The theory generalizes Chao's estimator, which we retrieve as the lower estimate of species richness. We show that Shannon and Simpson diversity can be robustly estimated for the in silico communities. We analyze nine metagenomic data sets from a wide range of environments, and show that our findings are relevant for empirically-sampled communities. Hence, we recommend the use of Shannon and Simpson diversity rather than species richness in efforts to quantify and compare microbial diversity.Comment: To be published in The ISME Journal. Main text: 16 pages, 5 figures. Supplement: 16 pages, 4 figure

    Prospecting environmental mycobacteria: combined molecular approaches reveal unprecedented diversity

    Get PDF
    Background: Environmental mycobacteria (EM) include species commonly found in various terrestrial and aquatic environments, encompassing animal and human pathogens in addition to saprophytes. Approximately 150 EM species can be separated into fast and slow growers based on sequence and copy number differences of their 16S rRNA genes. Cultivation methods are not appropriate for diversity studies; few studies have investigated EM diversity in soil despite their importance as potential reservoirs of pathogens and their hypothesized role in masking or blocking M. bovis BCG vaccine. Methods: We report here the development, optimization and validation of molecular assays targeting the 16S rRNA gene to assess diversity and prevalence of fast and slow growing EM in representative soils from semi tropical and temperate areas. New primer sets were designed also to target uniquely slow growing mycobacteria and used with PCR-DGGE, tag-encoded Titanium amplicon pyrosequencing and quantitative PCR. Results: PCR-DGGE and pyrosequencing provided a consensus of EM diversity; for example, a high abundance of pyrosequencing reads and DGGE bands corresponded to M. moriokaense, M. colombiense and M. riyadhense. As expected pyrosequencing provided more comprehensive information; additional prevalent species included M. chlorophenolicum, M. neglectum, M. gordonae, M. aemonae. Prevalence of the total Mycobacterium genus in the soil samples ranged from 2.3×107 to 2.7×108 gene targets g−1; slow growers prevalence from 2.9×105 to 1.2×107 cells g−1. Conclusions: This combined molecular approach enabled an unprecedented qualitative and quantitative assessment of EM across soil samples. Good concordance was found between methods and the bioinformatics analysis was validated by random resampling. Sequences from most pathogenic groups associated with slow growth were identified in extenso in all soils tested with a specific assay, allowing to unmask them from the Mycobacterium whole genus, in which, as minority members, they would have remained undetected

    Ovine pedomics : the first study of the ovine foot 16S rRNA-based microbiome

    Get PDF
    We report the first study of the bacterial microbiome of ovine interdigital skin based on 16S rRNA by pyrosequencing and conventional cloning with Sanger-sequencing. Three flocks were selected, one a flock with no signs of footrot or interdigital dermatitis, a second flock with interdigital dermatitis alone and a third flock with both interdigital dermatitis and footrot. The sheep were classified as having either healthy interdigital skin (H), interdigital dermatitis (ID) or virulent footrot (VFR). The ovine interdigital skin bacterial community varied significantly by flock and clinical condition. The diversity and richness of operational taxonomic units was greater in tissue from sheep with ID than H or VFR affected sheep. Actinobacteria, Bacteriodetes, Firmicutes and Proteobacteria were the most abundant phyla comprising 25 genera. Peptostreptococcus, Corynebacterium and Staphylococcus were associated with H, ID and VFR respectively. Sequences of Dichelobacter nodosus, the causal agent of ovine footrot, were not amplified due to mismatches in the 16S rRNA universal forward primer (27F). A specific real time PCR assay was used to demonstrate the presence of D. nodosus which was detected in all samples including the flock with no signs of ID or VFR. Sheep with ID had significantly higher numbers of D. nodosus (104-109 cells/g tissue) than those with H or VFR feet

    Dirichlet Multinomial Mixtures: Generative Models for Microbial Metagenomics

    Get PDF
    We introduce Dirichlet multinomial mixtures (DMM) for the probabilistic modelling of microbial metagenomics data. This data can be represented as a frequency matrix giving the number of times each taxa is observed in each sample. The samples have different size, and the matrix is sparse, as communities are diverse and skewed to rare taxa. Most methods used previously to classify or cluster samples have ignored these features. We describe each community by a vector of taxa probabilities. These vectors are generated from one of a finite number of Dirichlet mixture components each with different hyperparameters. Observed samples are generated through multinomial sampling. The mixture components cluster communities into distinct ‘metacommunities’, and, hence, determine envirotypes or enterotypes, groups of communities with a similar composition. The model can also deduce the impact of a treatment and be used for classification. We wrote software for the fitting of DMM models using the ‘evidence framework’ (http://code.google.com/p/microbedmm/). This includes the Laplace approximation of the model evidence. We applied the DMM model to human gut microbe genera frequencies from Obese and Lean twins. From the model evidence four clusters fit this data best. Two clusters were dominated by Bacteroides and were homogenous; two had a more variable community composition. We could not find a significant impact of body mass on community structure. However, Obese twins were more likely to derive from the high variance clusters. We propose that obesity is not associated with a distinct microbiota but increases the chance that an individual derives from a disturbed enterotype. This is an example of the ‘Anna Karenina principle (AKP)’ applied to microbial communities: disturbed states having many more configurations than undisturbed. We verify this by showing that in a study of inflammatory bowel disease (IBD) phenotypes, ileal Crohn's disease (ICD) is associated with a more variable community

    Timbre, Genre, and Polystylism in Sonic the Hedgehog 3

    Get PDF
    In the soundtrack for the Sega Genesis game Sonic the Hedgehog 3 (1992), the genres represented include calypso, funk, carnival, new wave, prog rock, and more. Soundtracks for video games frequently shift genres this way, to create aesthetic themes for different levels or characters. Turning toward an account of the game’s soundtrack as a unified and continuous work, I posit that the music of Sonic the Hedgehog 3 might be understood as analogous to a series of “samples” within a polystylistic whole, following Leydon 2010. Leydon notes that instrumentation “bears the bulk of the semiotic burden” in communicating genre, but stops short of detailing how different instrumental timbres themselves might signify these genres. In my close analysis of two specific levels from Sonic the Hedgehog 3—Ice Cap Zone and Marble Garden Zone—I detail how timbre, as a musical parameter separate from instrumentation, can evoke specific inter-textual and extramusical associations from a listener, based on implied genres in the soundtrack. In doing this, I will show how timbre, a musical parameter that remains overlooked in a great deal of music analysis, might inform and en-hance dialogue in music analyses of genre within video game music and more broadly

    The Diversity of Coral Reefs: What Are We Missing?

    Get PDF
    Tropical reefs shelter one quarter to one third of all marine species but one third of the coral species that construct reefs are now at risk of extinction. Because traditional methods for assessing reef diversity are extremely time consuming, taxonomic expertise for many groups is lacking, and marine organisms are thought to be less vulnerable to extinction, most discussions of reef conservation focus on maintenance of ecosystem services rather than biodiversity loss. In this study involving the three major oceans with reef growth, we provide new biodiversity estimates based on quantitative sampling and DNA barcoding. We focus on crustaceans, which are the second most diverse group of marine metazoans. We show exceptionally high numbers of crustacean species associated with coral reefs relative to sampling effort (525 species from a combined, globally distributed sample area of 6.3 m2). The high prevalence of rare species (38% encountered only once), the low level of spatial overlap (81% found in only one locality) and the biogeographic patterns of diversity detected (Indo-West Pacific>Central Pacific>Caribbean) are consistent with results from traditional survey methods, making this approach a reliable and efficient method for assessing and monitoring biodiversity. The finding of such large numbers of species in a small total area suggests that coral reef diversity is seriously under-detected using traditional survey methods, and by implication, underestimated

    CORE: A Phylogenetically-Curated 16S rDNA Database of the Core Oral Microbiome

    Get PDF
    Comparing bacterial 16S rDNA sequences to GenBank and other large public databases via BLAST often provides results of little use for identification and taxonomic assignment of the organisms of interest. The human microbiome, and in particular the oral microbiome, includes many taxa, and accurate identification of sequence data is essential for studies of these communities. For this purpose, a phylogenetically curated 16S rDNA database of the core oral microbiome, CORE, was developed. The goal was to include a comprehensive and minimally redundant representation of the bacteria that regularly reside in the human oral cavity with computationally robust classification at the level of species and genus. Clades of cultivated and uncultivated taxa were formed based on sequence analyses using multiple criteria, including maximum-likelihood-based topology and bootstrap support, genetic distance, and previous naming. A number of classification inconsistencies for previously named species, especially at the level of genus, were resolved. The performance of the CORE database for identifying clinical sequences was compared to that of three publicly available databases, GenBank nr/nt, RDP and HOMD, using a set of sequencing reads that had not been used in creation of the database. CORE offered improved performance compared to other public databases for identification of human oral bacterial 16S sequences by a number of criteria. In addition, the CORE database and phylogenetic tree provide a framework for measures of community divergence, and the focused size of the database offers advantages of efficiency for BLAST searching of large datasets. The CORE database is available as a searchable interface and for download at http://microbiome.osu.edu

    Amp-PCR: Combining a Random Unbiased Phi29-Amplification with a Specific Real-Time PCR, Performed in One Tube to Increase PCR Sensitivity

    Get PDF
    In clinical situations where a diagnostic real-time PCR assay is not sensitive enough, leading to low or falsely negative results, or where detection earlier in a disease progression would benefit the patient, an unbiased pre-amplification prior to the real-time PCR could be beneficial. In Amp-PCR, an unbiased random Phi29 pre-amplification is combined with a specific real-time PCR reaction. The two reactions are separated physically by a wax-layer (AmpliWax®) and are run in sequel in the same sealed tube. Amp-PCR can increase the specific PCR signal at least 100×106-fold and make it possible to detect positive samples normally under the detection limit of the specific real-time PCR. The risk of contamination is eliminated and Amp-PCR could replace nested-PCR in situations where increased sensitivity is needed e.g. in routine PCR diagnostic analysis. We show Amp-PCR to work on clinical samples containing circular and linear viral dsDNA genomes, but can work well on DNA of any origin, both from non-cellular (virus) and cellular sources (bacteria, archae, eukaryotes)
    corecore