22 research outputs found

    Gentle Masking of Low-Complexity Sequences Improves Homology Search

    Get PDF
    Detection of sequences that are homologous, i.e. descended from a common ancestor, is a fundamental task in computational biology. This task is confounded by low-complexity tracts (such as atatatatatat), which arise frequently and independently, causing strong similarities that are not homologies. There has been much research on identifying low-complexity tracts, but little research on how to treat them during homology search. We propose to find homologies by aligning sequences with “gentle” masking of low-complexity tracts. Gentle masking means that the match score involving a masked letter is , where is the unmasked score. Gentle masking slightly but noticeably improves the sensitivity of homology search (compared to “harsh” masking), without harming specificity. We show examples in three useful homology search problems: detection of NUMTs (nuclear copies of mitochondrial DNA), recruitment of metagenomic DNA reads to reference genomes, and pseudogene detection. Gentle masking is currently the best way to treat low-complexity tracts during homology search

    Mining for viral fragments in methylation enriched sequencing data

    Get PDF
    Most next generation sequencing experiments generate more data than is required for the experimental set up. For example, methyl-CpG binding domain (MBD) affinity purification based sequencing is often used for DNA-methylation profiling, but up to 30% of the sequenced fragments cannot be mapped uniquely to the reference genome. Here we present and evaluate a methodology for the identification of viruses in these otherwise unused paired-end MBD-seq data. Viral detection is accomplished by mapping non-reference alignable reads to a comprehensive set of viral genomes. As viruses play an important role in epigenetics and cancer development, 92 (pre)malignant and benign samples, originating from two different collections of cervical samples and related cell lines, were used in this study. These samples include primary carcinomas (n=22), low- & high-grade cervical intrapeithelial neoplasia (CIN1 & CIN2/3 - n=2/n=30) and normal tissue (n=20), as well as control samples (n=17). Viruses that were detected include phages, adenoviruses, herpesviridae and HPV. HPV, which causes virtually all cervical cancers, was identified in 95% of the carcinomas, 100% of the CIN2/3 samples, both CIN1 samples and in 55% of the normal samples. Comparing the amount of mapped fragments on HPV for each HPV-infected sample yielded a significant difference between normal samples and carcinomas or CIN2/3 samples (adjusted p-values resp. < 10^-5, < 10^-5), reflecting different viral loads and/or methylation degrees in non-normal samples. Fragments originating from different HPV types could be distinguished and were independently validated by PCR-based assays with a specificity of 98% and a sensitivitity of 66%. In conclusion, although limited by the a priori knowledge of viral reference genome sequences, the proposed methodology can provide a first but substantial insight into the presence, concentration and types of methylated viral sequences in MBD-seq data without additional costs

    Perturbation Dynamics of the Rumen Microbiota in Response to Exogenous Butyrate

    Get PDF
    The capacity of the rumen microbiota to produce volatile fatty acids (VFAs) has important implications in animal well-being and production. We investigated temporal changes of the rumen microbiota in response to butyrate infusion using pyrosequencing of the 16S rRNA gene. Twenty one phyla were identified in the rumen microbiota of dairy cows. The rumen microbiota harbored 54.5±6.1 genera (mean ± SD) and 127.3±4.4 operational taxonomic units (OTUs), respectively. However, the core microbiome comprised of 26 genera and 82 OTUs. Butyrate infusion altered molar percentages of 3 major VFAs. Butyrate perturbation had a profound impact on the rumen microbial composition. A 72 h-infusion led to a significant change in the numbers of sequence reads derived from 4 phyla, including 2 most abundant phyla, Bacteroidetes and Firmicutes. As many as 19 genera and 43 OTUs were significantly impacted by butyrate infusion. Elevated butyrate levels in the rumen seemingly had a stimulating effect on butyrate-producing bacteria populations. The resilience of the rumen microbial ecosystem was evident as the abundance of the microorganisms returned to their pre-disturbed status after infusion withdrawal. Our findings provide insight into perturbation dynamics of the rumen microbial ecosystem and should guide efforts in formulating optimal uses of probiotic bacteria treating human diseases

    Metagenome Plasticity of the Bovine Abomasal Microbiota in Immune Animals in Response to Ostertagia Ostertagi Infection

    Get PDF
    Infections in cattle by the abomasal nematode Ostertagia ostertagi result in impaired gastrointestinal function. Six partially immune animals were developed using multiple drug-attenuated infections, and these animals displayed reduced worm burdens and a slightly elevated abomasal pH upon reinfection. In this study, we characterized the abomasal microbiota in response to reinfection using metagenomic tools. Compared to uninfected controls, infection did not induce a significant change in the microbial community composition in immune animals. 16S rRNA gene-based phylogenetic analysis identified 15 phyla in the bovine abomasal microbiota with Bacteroidetes (60.5%), Firmicutes (27.1%), Proteobacteria (7.2%), Spirochates (2.9%), and Fibrobacteres (1.5%) being the most predominant. The number of prokaryotic genera and operational taxonomic units (OTU) identified in the abomasal microbial community was 70.8±19.8 (mean ± SD) and 90.3±2.9, respectively. However, the core microbiome comprised of 32 genera and 72 OTU. Infection seemingly had a minimal impact on the abomasal microbial diversity at a genus level in immune animals. Proteins predicted from whole genome shotgun (WGS) DNA sequences were assigned to 5,408 Pfam and 3,381 COG families, demonstrating dazzling arrays of functional diversity in bovine abomasal microbial communities. However, none of COG functional classes were significantly impacted by infection. Our results demonstrate that immune animals may develop abilities to maintain proper stability of their abomasal microbial ecosystem. A minimal disruption in the bovine abomasal microbiota by reinfection may contribute equally to the restoration of gastric function in immune animals

    Bacteria Isolated From the Antarctic Sponge Iophon sp. Reveals Mechanisms of Symbiosis in Sporosarcina, Cellulophaga, and Nesterenkonia

    Get PDF
    Antarctic sponges harbor a diverse range of microorganisms that perform unique metabolic functions for nutrient cycles. Understanding how microorganisms establish functional sponge–microbe interactions in the Antarctic marine ecosystem provides clues about the success of these ancient animals in this realm. Here, we use a culture-dependent approach and genome sequencing to investigate the molecular determinants that promote a dual lifestyle in three bacterial genera Sporosarcina, Cellulophaga, and Nesterenkonia. Phylogenomic analyses showed that four sponge-associated isolates represent putative novel bacterial species within the Sporosarcina and Nesterenkonia genera and that the fifth bacterial isolate corresponds to Cellulophaga algicola. We inferred that isolated sponge-associated bacteria inhabit similarly marine sponges and also seawater. Comparative genomics revealed that these sponge-associated bacteria are enriched in symbiotic lifestyle-related genes. Specific adaptations related to the cold Antarctic environment are features of the bacterial strains isolated here. Furthermore, we showed evidence that the vitamin B5 synthesis-related gene, panE from Nesterenkonia E16_7 and E16_10, was laterally transferred within Actinobacteria members. Together, these findings indicate that the genomes of sponge-associated strains differ from other related genomes based on mechanisms that may contribute to the life in association with sponges and the extreme conditions of the Antarctic environment
    corecore