91 research outputs found

    Dissemination of metaldehyde catabolic pathways is driven by mobile genetic elements in Proteobacteria

    Get PDF
    Bioremediation of metaldehyde from drinking water using metaldehyde-degrading strains has recently emerged as a promising alternative. Whole-genome sequencing was used to obtain full genomes for metaldehyde degraders Acinetobacter calcoaceticus E1 and Sphingobium CMET-H. For the former, the genetic context of the metaldehyde-degrading genes had not been explored, while for the latter, none of the degrading genes themselves had been identified. In A. calcoaceticus E1, IS91 and IS6-family insertion sequences (ISs) were found surrounding the metaldehyde-degrading gene cluster located in plasmid pAME76. This cluster was located in closely-related plasmids and associated to identical ISs in most metaldehyde-degrading ?-and ?-Proteobacteria, indicating horizontal gene transfer (HGT). For Sphingobium CMET-H, sequence analysis suggested a phytanoyl-CoA family oxygenase as a metaldehyde-degrading gene candidate due to its close homology to a previously identified metaldehyde-degrading gene known as mahX. Heterologous gene expression in Escherichia coli alongside degradation tests verified its functional significance and the degrading gene homolog was henceforth called mahS. It was found that mahS is hosted within the conjugative plasmid pSM1 and its genetic context suggested a crossover between the metaldehyde and acetoin degradation pathways. Here, specific replicons and ISs responsible for maintaining and dispersing metaldehyde-degrading genes in ?, ? and ?-Proteobacteria through HGT were identified and described. In addition, a homologous gene implicated in the first step of metaldehyde utilisation in an ?-Proteobacteria was uncovered. Insights into specific steps of this possible degradation pathway are provided.Funding information: VCG was supported by the University of Costa Rica with a Scholarship for Doctoral Studies Abroad and the University of York with a Scholarship for Overseas Students. MPG-B is supported by grant PID2020-117923GB-I00 from the Spanish Ministry of Science and Innovation. JM is grateful for NERC award (NE/N009061/1) and Thames Water for supporting a scholarship for EF

    Evaluating techniques for metagenome annotation using simulated sequence data

    Get PDF
    The advent of next-generation sequencing has allowed huge amounts of DNA sequence data to be produced, advancing the capabilities of microbial ecosystem studies. The current challenge is identifying from which microorganisms and genes the DNA originated. Several tools and databases are available for annotating DNA sequences. The tools, databases and parameters used can have a significant impact on the results: naïve choice of these factors can result in a false representation of community composition and function. We use a simulated metagenome to show how different parameters affect annotation accuracy by evaluating the sequence annotation performances of MEGAN, MG-RAST, One Codex and Megablast. This simulated metagenome allowed the recovery of known organism and function abundances to be quantitatively evaluated, which is not possible for environmental metagenomes. The performance of each program and database varied, e.g. One Codex correctly annotated many sequences at the genus level, whereas MG-RAST RefSeq produced many false positive annotations. This effect decreased as the taxonomic level investigated increased. Selecting more stringent parameters decreases the annotation sensitivity, but increases precision. Ultimately, there is a trade-off between taxonomic resolution and annotation accuracy. These results should be considered when annotating metagenomes and interpreting results from previous studies

    Dissemination of metaldehyde catabolic pathways is driven by mobile genetic elements in Proteobacteria

    Get PDF
    Bioremediation of metaldehyde from drinking water using metaldehyde-degrading strains has recently emerged as a promising alternative. Whole-genome sequencing was used to obtain full genomes for metaldehyde degraders Acinetobacter calcoaceticus E1 and Sphingobium CMET-H. For the former, the genetic context of the metaldehyde-degrading genes had not been explored, while for the latter, none of the degrading genes themselves had been identified. In A. calcoaceticus E1, IS91 and IS6-family insertion sequences (ISs) were found surrounding the metaldehyde-degrading gene cluster located in plasmid pAME76. This cluster was located in closely-related plasmids and associated to identical ISs in most metaldehyde-degrading β- and γ-Proteobacteria, indicating horizontal gene transfer (HGT). For Sphingobium CMET-H, sequence analysis suggested a phytanoyl-CoA family oxygenase as a metaldehyde-degrading gene candidate due to its close homology to a previously identified metaldehyde-degrading gene known as mahX. Heterologous gene expression in Escherichia coli alongside degradation tests verified its functional significance and the degrading gene homolog was henceforth called mahS. It was found that mahS is hosted within the conjugative plasmid pSM1 and its genetic context suggested a crossover between the metaldehyde and acetoin degradation pathways. Here, specific replicons and ISs responsible for maintaining and dispersing metaldehyde-degrading genes in α, β and γ-Proteobacteria through HGT were identified and described. In addition, a homologous gene implicated in the first step of metaldehyde utilisation in an α-Proteobacteria was uncovered. Insights into specific steps of this possible degradation pathway are provided

    Whole genome characterization of sequence diversity of 15,220 Icelanders

    Get PDF
    Understanding of sequence diversity is the cornerstone of analysis of genetic disorders, population genetics, and evolutionary biology. Here, we present an update of our sequencing set to 15,220 Icelanders who we sequenced to an average genome-wide coverage of 34X. We identified 39,020,168 autosomal variants passing GATK filters: 31,079,378 SNPs and 7,940,790 indels. Calling de novo mutations (DNMs) is a formidable challenge given the high false positive rate in sequencing datasets relative to the mutation rate. Here we addressed this issue by using segregation of alleles in three-generation families. Using this transmission assay, we controlled the false positive rate and identified 108,778 high quality DNMs. Furthermore, we used our extended family structure and read pair tracing of DNMs to a panel of phased SNPs, to determine the parent of origin of 42,961 DNMs.Peer Reviewe
    corecore