36 research outputs found

    A reexamination of information theory-based methods for DNA-binding site identification

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Searching for transcription factor binding sites in genome sequences is still an open problem in bioinformatics. Despite substantial progress, search methods based on information theory remain a standard in the field, even though the full validity of their underlying assumptions has only been tested in artificial settings. Here we use newly available data on transcription factors from different bacterial genomes to make a more thorough assessment of information theory-based search methods.</p> <p>Results</p> <p>Our results reveal that conventional benchmarking against artificial sequence data leads frequently to overestimation of search efficiency. In addition, we find that sequence information by itself is often inadequate and therefore must be complemented by other cues, such as curvature, in real genomes. Furthermore, results on skewed genomes show that methods integrating skew information, such as <it>Relative Entropy</it>, are not effective because their assumptions may not hold in real genomes. The evidence suggests that binding sites tend to evolve towards genomic skew, rather than against it, and to maintain their information content through increased conservation. Based on these results, we identify several misconceptions on information theory as applied to binding sites, such as negative entropy, and we propose a revised paradigm to explain the observed results.</p> <p>Conclusion</p> <p>We conclude that, among information theory-based methods, the most unassuming search methods perform, on average, better than any other alternatives, since heuristic corrections to these methods are prone to fail when working on real data. A reexamination of information content in binding sites reveals that information content is a compound measure of search and binding affinity requirements, a fact that has important repercussions for our understanding of binding site evolution.</p

    Tracing back ancient oral microbiomes and oral pathogens using dental pulps from ancient teeth

    Get PDF
    International audienceAncient dental pulps are highly precious samples because they conserve DNA from humans and blood-borne pathogens for ages. However, little is known about the microbial communities present in dental pulps. Here, we analyzed ancient and modern dental pulp samples from different time periods and geographic regions and found that they are colonized by distinct microbial communities, which can be differentiated from other oral cavity samples. We found that despite the presence of environmental bacteria, ancient dental pulps conserve a clear and well-conserved record of oral microbes. We were able to detect several different oral pathogens in ancient and modern dental pulps, which are commonly associated with periodontal diseases. We thus showed that ancient dental pulps are not only valuable sources of DNA from humans and systemic infections, but also an open window for the study of ancient oral microbiomes
    corecore