45,734 research outputs found

    Viral population estimation using pyrosequencing

    Get PDF
    The diversity of virus populations within single infected hosts presents a major difficulty for the natural immune response as well as for vaccine design and antiviral drug therapy. Recently developed pyrophosphate based sequencing technologies (pyrosequencing) can be used for quantifying this diversity by ultra-deep sequencing of virus samples. We present computational methods for the analysis of such sequence data and apply these techniques to pyrosequencing data obtained from HIV populations within patients harboring drug resistant virus strains. Our main result is the estimation of the population structure of the sample from the pyrosequencing reads. This inference is based on a statistical approach to error correction, followed by a combinatorial algorithm for constructing a minimal set of haplotypes that explain the data. Using this set of explaining haplotypes, we apply a statistical model to infer the frequencies of the haplotypes in the population via an EM algorithm. We demonstrate that pyrosequencing reads allow for effective population reconstruction by extensive simulations and by comparison to 165 sequences obtained directly from clonal sequencing of four independent, diverse HIV populations. Thus, pyrosequencing can be used for cost-effective estimation of the structure of virus populations, promising new insights into viral evolutionary dynamics and disease control strategies.Comment: 23 pages, 13 figure

    PCR biases distort bacterial and archaeal community structure in pyrosequencing datasets

    Get PDF
    As 16S rRNA gene targeted massively parallel sequencing has become a common tool for microbial diversity investigations, numerous advances have been made to minimize the influence of sequencing and chimeric PCR artifacts through rigorous quality control measures. However, there has been little effort towards understanding the effect of multi-template PCR biases on microbial community structure. In this study, we used three bacterial and three archaeal mock communities consisting of, respectively, 33 bacterial and 24 archaeal 16S rRNA gene sequences combined in different proportions to compare the influences of (1) sequencing depth, (2) sequencing artifacts (sequencing errors and chimeric PCR artifacts), and (3) biases in multi-template PCR, towards the interpretation of community structure in pyrosequencing datasets. We also assessed the influence of each of these three variables on α- and β-diversity metrics that rely on the number of OTUs alone (richness) and those that include both membership and the relative abundance of detected OTUs (diversity). As part of this study, we redesigned bacterial and archaeal primer sets that target the V3–V5 region of the 16S rRNA gene, along with multiplexing barcodes, to permit simultaneous sequencing of PCR products from the two domains. We conclude that the benefits of deeper sequencing efforts extend beyond greater OTU detection and result in higher precision in β-diversity analyses by reducing the variability between replicate libraries, despite the presence of more sequencing artifacts. Additionally, spurious OTUs resulting from sequencing errors have a significant impact on richness or shared-richness based α- and β-diversity metrics, whereas metrics that utilize community structure (including both richness and relative abundance of OTUs) are minimally affected by spurious OTUs. However, the greatest obstacle towards accurately evaluating community structure are the errors in estimated mean relative abundance of each detected OTU due to biases associated with multi-template PCR reactions

    Bacterial exchange in household washing machines

    Get PDF
    Household washing machines (WMs) launder soiled clothes and textiles, but do not sterilize them. We investigated the microbial exchange occurring in five household WMs. Samples from a new cotton T-shirt were laundered together with a normal laundry load. Analyses were performed on the influent water and the ingoing cotton samples, as well as the greywater and the washed cotton samples. The number of living bacteria was generally not lower in the WM effluent water as compared to the influent water. The laundering process caused a microbial exchange of influent water bacteria, skin-, and clothes related bacteria and biofilm-related bacteria in the WM. A variety of biofilm-producing bacteria were enriched in the effluent after laundering, although their presence in the cotton sample was low. Nearly all bacterial genera detected on the initial cotton sample were still present in the washed cotton samples. A selection for typical skin- and clothes related microbial species occurred in the cotton samples after laundering. Accordingly, malodour-causing microbial species might be further distributed to other clothes. The bacteria on the ingoing textiles contributed for a large part to the microbiome found in the textiles after laundering

    Bacterial diversity assessment in Antarctic terrestrial and aquatic microbial mats : a comparison between bidirectional pyrosequencing and cultivation

    Get PDF
    The application of high-throughput sequencing of the 16S rRNA gene has increased the size of microbial diversity datasets by several orders of magnitude, providing improved access to the rare biosphere compared with cultivation-based approaches and more established cultivation-independent techniques. By contrast, cultivation-based approaches allow the retrieval of both common and uncommon bacteria that can grow in the conditions used and provide access to strains for biotechnological applications. We performed bidirectional pyrosequencing of the bacterial 16S rRNA gene diversity in two terrestrial and seven aquatic Antarctic microbial mat samples previously studied by heterotrophic cultivation. While, not unexpectedly, 77.5% of genera recovered by pyrosequencing were not among the isolates, 25.6% of the genera picked up by cultivation were not detected by pyrosequencing. To allow comparison between both techniques, we focused on the five phyla (Proteobacteria, Actinobacteria, Bacteroidetes, Firmicutes and Deinococcus-Thermus) recovered by heterotrophic cultivation. Four of these phyla were among the most abundantly recovered by pyrosequencing. Strikingly, there was relatively little overlap between cultivation and the forward and reverse pyrosequencing-based datasets at the genus (17.1–22.2%) and OTU (3.5–3.6%) level (defined on a 97% similarity cut-off level). Comparison of the V1–V2 and V3–V2 datasets of the 16S rRNA gene revealed remarkable differences in number of OTUs and genera recovered. The forward dataset missed 33% of the genera from the reverse dataset despite comprising 50% more OTUs, while the reverse dataset did not contain 40% of the genera of the forward dataset. Similar observations were evident when comparing the forward and reverse cultivation datasets. Our results indicate that the region under consideration can have a large impact on perceived diversity, and should be considered when comparing different datasets. Finally, a high number of OTUs could not be classified using the RDP reference database, suggesting the presence of a large amount of novel diversity

    Contact transmission of influenza virus between ferrets imposes a looser bottleneck than respiratory droplet transmission allowing propagation of antiviral resistance

    Get PDF
    Influenza viruses cause annual seasonal epidemics and occasional pandemics. It is important to elucidate the stringency of bottlenecks during transmission to shed light on mechanisms that underlie the evolution and propagation of antigenic drift, host range switching or drug resistance. The virus spreads between people by different routes, including through the air in droplets and aerosols, and by direct contact. By housing ferrets under different conditions, it is possible to mimic various routes of transmission. Here, we inoculated donor animals with a mixture of two viruses whose genomes differed by one or two reverse engineered synonymous mutations, and measured the transmission of the mixture to exposed sentinel animals. Transmission through the air imposed a tight bottleneck since most recipient animals became infected by only one virus. In contrast, a direct contact transmission chain propagated a mixture of viruses suggesting the dose transferred by this route was higher. From animals with a mixed infection of viruses that were resistant and sensitive to the antiviral drug oseltamivir, resistance was propagated through contact transmission but not by air. These data imply that transmission events with a looser bottleneck can propagate minority variants and may be an important route for influenza evolution

    Recovering the state sequence of hidden Markov models using mean-field approximations

    Full text link
    Inferring the sequence of states from observations is one of the most fundamental problems in Hidden Markov Models. In statistical physics language, this problem is equivalent to computing the marginals of a one-dimensional model with a random external field. While this task can be accomplished through transfer matrix methods, it becomes quickly intractable when the underlying state space is large. This paper develops several low-complexity approximate algorithms to address this inference problem when the state space becomes large. The new algorithms are based on various mean-field approximations of the transfer matrix. Their performances are studied in detail on a simple realistic model for DNA pyrosequencing.Comment: 43 pages, 41 figure

    Amino acid changes in the spike protein of feline coronavirus correlate with systemic spread of virus from the intestine and not with feline infectious peritonitis

    Get PDF
    Recent evidence suggests that a mutation in the spike protein gene of feline coronavirus (FCoV), which results in an amino acid change from methionine to leucine at position 1058, may be associated with feline infectious peritonitis (FIP). Tissue and faecal samples collected post mortem from cats diagnosed with or without FIP were subjected to RNA extraction and quantitative reverse-transcriptase polymerase chain reaction (qRT-PCR) to detect FCoV RNA. In cats with FIP, 95% of tissue, and 81% of faecal samples were PCR-positive, as opposed to 22% of tissue, and 60% of faecal samples in cats without FIP. Relative FCoV copy numbers were significantly higher in the cats with FIP, both in tissues (P < 0.001) and faeces (P = 0.02). PCR-positive samples underwent pyrosequencing encompassing position 1058 of the FCoV spike protein. This identified a methionine codon at position 1058, consistent with the shedding of an enteric form of FCoV, in 77% of the faecal samples from cats with FIP, and in 100% of the samples from cats without FIP. In contrast, 91% of the tissue samples from cats with FIP and 89% from cats without FIP had a leucine codon at position 1058, consistent with a systemic form of FCoV. These results suggest that the methionine to leucine substitution at position 1058 in the FCoV spike protein is indicative of systemic spread of FCoV from the intestine, rather than a virus with the potential to cause FIP

    Evaluating detection limits of next-generation sequencing for the surveillance and monitoring of international marine pests

    Get PDF
    Most surveillance programmes for marine invasive species (MIS) require considerable taxonomic expertise, are laborious, and are unable to identify species at larval or juvenile stages. Therefore, marine pests may go undetected at the initial stages of incursions when population densities are low. In this study, we evaluated the ability of the benchtop GS Junior™ 454 pyrosequencing system to detect the presence of MIS in complex sample matrices. An initial in-silico evaluation of the mitochondrial cytochrome c oxidase subunit I (COI) and the nuclear small subunit ribosomal DNA (SSU) genes, found that multiple primer sets (targeting a ca. 400 base pair region) would be required to obtain species level identification within the COI gene. In contrast a single universal primer set was designed to target the V1–V3 region of SSU, allowing simultaneous PCR amplification of a wide taxonomic range of MIS. To evaluate the limits of detection of this method, artificial contrived communities (10 species from 5 taxonomic groups) were created using varying concentrations of known DNA samples and PCR products. Environmental samples (water and sediment) spiked with one or five 160 hr old Asterias amurensis larvae were also examined. Pyrosequencing was able to recover DNA/PCR products of individual species present at greater than 0.64% abundance from all tested contrived communities. Additionally, single A. amurensis larvae were detected from both water and sediment samples despite the co-occurrence of a large array of environmental eukaryotes, indicating an equivalent sensitivity to quantitative PCR. NGS technology has tremendous potential for the early detection of marine invasive species worldwide
    corecore