114 research outputs found

    Cross-study analyses of microbial abundance using generalized common factor methods

    Full text link
    By creating networks of biochemical pathways, communities of micro-organisms are able to modulate the properties of their environment and even the metabolic processes within their hosts. Next-generation high-throughput sequencing has led to a new frontier in microbial ecology, promising the ability to leverage the microbiome to make crucial advancements in the environmental and biomedical sciences. However, this is challenging, as genomic data are high-dimensional, sparse, and noisy. Much of this noise reflects the exact conditions under which sequencing took place, and is so significant that it limits consensus-based validation of study results. We propose an ensemble approach for cross-study exploratory analyses of microbial abundance data in which we first estimate the variance-covariance matrix of the underlying abundances from each dataset on the log scale assuming Poisson sampling, and subsequently model these covariances jointly so as to find a shared low-dimensional subspace of the feature space. By viewing the projection of the latent true abundances onto this common structure, the variation is pared down to that which is shared among all datasets, and is likely to reflect more generalizable biological signal than can be inferred from individual datasets. We investigate several ways of achieving this, and demonstrate that they work well on simulated and real metagenomic data in terms of signal retention and interpretability

    Infectious Complications Are Associated With Alterations in the Gut Microbiome in Pediatric Patients With Acute Lymphoblastic Leukemia

    Get PDF
    Acute lymphoblastic leukemia is the most common pediatric cancer. Fortunately, survival rates exceed 90%, however, infectious complications remain a significant issue that can cause reductions in the quality of life and prognosis of patients. Recently, numerous studies have linked shifts in the gut microbiome composition to infection events in various hematological malignances including acute lymphoblastic leukemia (ALL). These studies have been limited to observing broad taxonomic changes using 16S rRNA gene profiling, while missing possible differences within microbial functions encoded by individual species. In this study we present the first combined 16S rRNA gene and metagenomic shotgun sequencing study on the gut microbiome of an independent pediatric ALL cohort during treatment. In this study we found distinctive differences in alpha diversity and beta diversity in samples from patients with infectious complications in the first 6 months of therapy. We were also able to find specific species and functional pathways that were significantly different in relative abundance between samples that came from patients with infectious complications. Finally, machine learning models based on patient metadata and bacterial species were able to classify samples with high accuracy (84.09%), with bacterial species being the most important classifying features. This study strengthens our understanding of the association between infection and pediatric acute lymphoblastic leukemia treatment and warrants further investigation in the future

    The Association of Virulence Factors with Genomic Islands

    Get PDF
    Background: It has been noted that many bacterial virulence factor genes are located within genomic islands (GIs; clusters of genes in a prokaryotic genome of probable horizontal origin). However, such studies have been limited to single genera or isolated observations. We have performed the first large-scale analysis of multiple diverse pathogens to examine this association. We additionally identified genes found predominantly in pathogens, but not non-pathogens, across multiple genera using 631 complete bacterial genomes, and we identified common trends in virulence for genes in GIs. Furthermore, we examined the relationship between GIs and clustered regularly interspaced palindromic repeats (CRISPRs) proposed to confer resistance to phage. Methodology/Principal Findings: We show quantitatively that GIs disproportionately contain more virulence factors than the rest of a given genome (p,1E-40 using three GI datasets) and that CRISPRs are also over-represented in GIs. Virulence factors in GIs and pathogen-associated virulence factors are enriched for proteins having more ‘‘offensive’ ’ functions, e.g. active invasion of the host, and are disproportionately components of type III/IV secretion systems or toxins. Numerous hypothetical pathogen-associated genes were identified, meriting further study. Conclusions/Significance: This is the first systematic analysis across diverse genera indicating that virulence factors are disproportionately associated with GIs. ‘‘Offensive’ ’ virulence factors, as opposed to host-interaction factors, may more ofte

    Detection of Helicobacter pylori Microevolution and Multiple Infection from Gastric Biopsies by Housekeeping Gene Amplicon Sequencing

    Get PDF
    Despite the great efforts devoted to research on Helicobacter pylori, the prevalence of single-strain infection or H. pylori mixed infection and its implications in the mode of transmission of this bacterium are still controversial. In this study, we explored the usefulness of housekeeping gene amplicon sequencing in the detection of H. pylori microevolution and multiple infections. DNA was extracted from five gastric biopsies from four patients infected with distinct histopathological diagnoses. PCR amplification of six H. pylori-specific housekeeping genes was then assessed on each sample. Optimal results were obtained for the cgt and luxS genes, which were selected for amplicon sequencing. A total of 11,833 cgt and 403 luxS amplicon sequences were obtained, 2042 and 112 of which were unique sequences, respectively. All cgt and luxS sequences were clustered at 97% to 9 and 13 operational taxonomic units (OTUs), respectively. For each sample from a different patient, a single OTU comprised the majority of sequences in both genes, but more than one OTU was detected in all samples. These results suggest that multiple infections with a predominant strain together with other minority strains are the main way by which H. pylori colonizes the human stomach

    Microbial hitchhikers harbouring antimicrobial-resistance genes in the riverine plastisphere

    Get PDF
    Background: The widespread nature of plastic pollution has given rise to wide scientific and social concern regarding the capacity of these materials to serve as vectors for pathogenic bacteria and reservoirs for Antimicrobial Resistance Genes (ARG). In- and ex-situ incubations were used to characterise the riverine plastisphere taxonomically and functionally in order to determine whether antibiotics within the water influenced the ARG profiles in these microbiomes and how these compared to those on natural surfaces such as wood and their planktonic counterparts. Results: We show that plastics support a taxonomically distinct microbiome containing potential pathogens and ARGs. While the plastisphere was similar to those biofilms that grew on wood, they were distinct from the surrounding water microbiome. Hence, whilst potential opportunistic pathogens (i.e. Pseudomonas aeruginosa, Acinetobacter and Aeromonas) and ARG subtypes (i.e. those that confer resistance to macrolides/lincosamides, rifamycin, sulfonamides, disinfecting agents and glycopeptides) were predominant in all surface-related microbiomes, especially on weathered plastics, a completely different set of potential pathogens (i.e. Escherichia, Salmonella, Klebsiella and Streptococcus) and ARGs (i.e. aminoglycosides, tetracycline, aminocoumarin, fluoroquinolones, nitroimidazole, oxazolidinone and fosfomycin) dominated in the planktonic compartment. Our genome-centric analysis allowed the assembly of 215 Metagenome Assembled Genomes (MAGs), linking ARGs and other virulence-related genes to their host. Interestingly, a MAG belonging to Escherichia –that clearly predominated in water– harboured more ARGs and virulence factors than any other MAG, emphasising the potential virulent nature of these pathogenic-related groups. Finally, ex-situ incubations using environmentally-relevant concentrations of antibiotics increased the prevalence of their corresponding ARGs, but different riverine compartments –including plastispheres– were affected differently by each antibiotic. Conclusions: Our results provide insights into the capacity of the riverine plastisphere to harbour a distinct set of potentially pathogenic bacteria and function as a reservoir of ARGs. The environmental impact that plastics pose if they act as a reservoir for either pathogenic bacteria or ARGs is aggravated by the persistence of plastics in the environment due to their recalcitrance and buoyancy. Nevertheless, the high similarities with microbiomes growing on natural co-occurring materials and even more worrisome microbiome observed in the surrounding water highlights the urgent need to integrate the analysis of all environmental compartments when assessing risks and exposure to pathogens and ARGs in anthropogenically-impacted ecosystems. 1SQe33MjkWBo3cdx_C_SmDVideo Abstrac

    BioTorrents: A File Sharing Service for Scientific Data

    Get PDF
    The transfer of scientific data has emerged as a significant challenge, as datasets continue to grow in size and demand for open access sharing increases. Current methods for file transfer do not scale well for large files and can cause long transfer times. In this study we present BioTorrents, a website that allows open access sharing of scientific data and uses the popular BitTorrent peer-to-peer file sharing technology. BioTorrents allows files to be transferred rapidly due to the sharing of bandwidth across multiple institutions and provides more reliable file transfers due to the built-in error checking of the file sharing technology. BioTorrents contains multiple features, including keyword searching, category browsing, RSS feeds, torrent comments, and a discussion forum. BioTorrents is available at http://www.biotorrents.net

    Research Article Genomic Analysis of a Serotype 5 Streptococcus pneumoniae Outbreak in British Columbia

    Get PDF
    Background. Streptococcus pneumoniae can cause a wide spectrum of disease, including invasive pneumococcal disease (IPD). From 2005 to 2009 an outbreak of IPD occurred in Western Canada, caused by a S. pneumoniae strain with multilocus sequence type (MLST) 289 and serotype 5. We sought to investigate the incidence of IPD due to this S. pneumoniae strain and to characterize the outbreak in British Columbia using whole-genome sequencing. Methods. IPD was defined according to Public Health Agency of Canada guidelines. Two isolates representing the beginning and end of the outbreak were whole-genome sequenced. The sequences were analyzed for single nucleotide variants (SNVs) and putative genomic islands. Results. The peak of the outbreak in British Columbia was in 2006, when 57% of invasive S. pneumoniae isolates were serotype 5. Comparison of two whole-genome sequenced strains showed only 10 SNVs between them. A 15.5 kb genomic island was identified in outbreak strains, allowing the design of a PCR assay to track the spread of the outbreak strain. Discussion. We show that the serotype 5 MLST 289 strain contains a distinguishing genomic island, which remained genetically consistent over time. Whole-genome sequencing holds great promise for real-time characterization of outbreaks in the future and may allow responses tailored to characteristics identified in the genome

    The evolutionary signal in metagenome phyletic profiles predicts many gene functions

    Get PDF
    Background. The function of many genes is still not known even in model organisms. An increasing availability of microbiome DNA sequencing data provides an opportunity to infer gene function in a systematic manner. Results. We evaluated if the evolutionary signal contained in metagenome phyletic profiles (MPP) is predictive of a broad array of gene functions. The MPPs are an encoding of environmental DNA sequencing data that consists of relative abundances of gene families across metagenomes. We find that such MPPs can accurately predict 826 Gene Ontology functional categories, while drawing on human gut microbiomes, ocean metagenomes, and DNA sequences from various other engineered and natural environments. Overall, in this task, the MPPs are highly accurate, and moreover they provide coverage for a set of Gene Ontology terms largely complementary to standard phylogenetic profiles, derived from fully sequenced genomes. We also find that metagenomes approximated from taxon relative abundance obtained via 16S rRNA gene sequencing may provide surprisingly useful predictive models. Crucially, the MPPs derived from different types of environments can infer distinct, non-overlapping sets of gene functions and therefore complement each other. Consistently, simulations on > 5000 metagenomes indicate that the amount of data is not in itself critical for maximizing predictive accuracy, while the diversity of sampled environments appears to be the critical factor for obtaining robust models. Conclusions. In past work, metagenomics has provided invaluable insight into ecology of various habitats, into diversity of microbial life and also into human health and disease mechanisms. We propose that environmental DNA sequencing additionally constitutes a useful tool to predict biological roles of genes, yielding inferences out of reach for existing comparative genomics approaches

    The genome sequence of E. coli W (ATCC 9637): comparative genome analysis and an improved genome-scale reconstruction of E. coli

    Get PDF
    Background: Escherichia coli is a model prokaryote, an important pathogen, and a key organism for industrial biotechnology. E. coli W (ATCC 9637), one of four strains designated as safe for laboratory purposes, has not been sequenced. E. coli W is a fast-growing strain and is the only safe strain that can utilize sucrose as a carbon source. Lifecycle analysis has demonstrated that sucrose from sugarcane is a preferred carbon source for industrial bioprocesses
    corecore