2,224 research outputs found
On the comparison of regulatory sequences with multiple resolution Entropic Profiles
Enhancers are stretches of DNA (100-1000 bp) that play a major role in development gene expression, evolution and disease. It has been recently shown that in high-level eukaryotes enhancers rarely work alone, instead they collaborate by forming clusters of cis-regulatory modules (CRMs). Although the binding of transcription factors is sequence-specific, the identification of functionally similar enhancers is very difficult and it cannot be carried out with traditional alignment-based techniques
Local Renyi entropic profiles of DNA sequences
<p>Abstract</p> <p>Background</p> <p>In a recent report the authors presented a new measure of continuous entropy for DNA sequences, which allows the estimation of their randomness level. The definition therein explored was based on the RĂ©nyi entropy of probability density estimation (pdf) using the Parzen's window method and applied to Chaos Game Representation/Universal Sequence Maps (CGR/USM). Subsequent work proposed a fractal pdf kernel as a more exact solution for the iterated map representation. This report extends the concepts of continuous entropy by defining DNA sequence entropic profiles using the new pdf estimations to refine the density estimation of motifs.</p> <p>Results</p> <p>The new methodology enables two results. On the one hand it shows that the entropic profiles are directly related with the statistical significance of motifs, allowing the study of under and over-representation of segments. On the other hand, by spanning the parameters of the kernel function it is possible to extract important information about the scale of each conserved DNA region. The computational applications, developed in Matlab m-code, the corresponding binary executables and additional material and examples are made publicly available at <url>http://kdbio.inesc-id.pt/~svinga/ep/</url>.</p> <p>Conclusion</p> <p>The ability to detect local conservation from a scale-independent representation of symbolic sequences is particularly relevant for biological applications where conserved motifs occur in multiple, overlapping scales, with significant future applications in the recognition of foreign genomic material and inference of motif structures.</p
Nonspecific Transcription-Factor-DNA Binding Influences Nucleosome Occupancy in Yeast
AbstractQuantitative understanding of the principles regulating nucleosome occupancy on a genome-wide level is a central issue in eukaryotic genomics. Here, we address this question using budding yeast, Saccharomyces cerevisiae, as a model organism. We perform a genome-wide computational analysis of the nonspecific transcription factor (TF)-DNA binding free-energy landscape and compare this landscape with experimentally determined nucleosome-binding preferences. We show that DNA regions with enhanced nonspecific TF-DNA binding are statistically significantly depleted of nucleosomes. We suggest therefore that the competition between TFs with histones for nonspecific binding to genomic sequences might be an important mechanism influencing nucleosome-binding preferences in vivo. We also predict that poly(dA:dT) and poly(dC:dG) tracts represent genomic elements with the strongest propensity for nonspecific TF-DNA binding, thus allowing TFs to outcompete nucleosomes at these elements. Our results suggest that nonspecific TF-DNA binding might provide a barrier for statistical positioning of nucleosomes throughout the yeast genome. We predict that the strength of this barrier increases with the concentration of DNA binding proteins in a cell. We discuss the connection of the proposed mechanism with the recently discovered pathway of active nucleosome reconstitution
Recommended from our members
Multiscale Simulations of Intrinsically Disordered Proteins
Intrinsically disordered proteins (IDPs) lack stable secondary and/or tertiary structures under physiological conditions. The have now been recognized to play important roles in numerous biological processes, particularly cellular signaling and regulation. Mutation of IDPs are frequently associated with human diseases, such as cancers and neuron degenerative diseases. Therefore, it is important to understand the structure, dynamics, and interactions of IDPs, so as to establish the mechanistic basis of how intrinsic disorder mediates versatile functions and how such mechanisms may fail in human diseases. However, the heterogeneous structural ensembles of IDPs are not amenable to high resolution characterization solely through experimental measurements, and molecular modelling and simulation are required to study IDP structures, dynamics, and interactions at the atomistic levels.
Here, we first applied the state-of-the-art explicit solvent atomistic simulations to an anti-apoptotic protein Bcl-xL and demonstrated how inherent structural disorder may provide a physical basis of protein regulated unfolding in signaling transduction. We have also constructed a series of efficient coarse-grained models to directly simulate the interactions between IDPs and unveiled how the preexisting structural elements accelerate binding of ACTR to NCBD by promoting efficient folding upon encounter. These studies shed important light on how IDPs perform functions in the cellular regulatory network, but also reveal the necessity of new sampling techniques for more efficient simulations of IDPs.
We have thus developed a novel sampling technique, called multiscale enhanced sampling (MSES). MSES couples the atomistic model with coarse-grained ones, to accelerate the sampling of atomistic conformational space. Bias from coupling to a coarse-grained model can be removed using Hamiltonian replica exchange. To achieve the best possible efficiency of MSES simulations, we have developed a new hybrid resolution protein model that could capture the essential features of IDP structures, so as to generate local and long-range fluctuations that are largely consistent with those at the atomistic level. We have also developed an advanced replica exchange protocol, to allow the fast conformational transitions observed in the coupled conditions to be rapidly exchanged to the unbiased limit. Application of these strategies to characterize the structural ensembles of a few non-trivial IDPs shows that faster convergence rate can be achieved, demonstrating the great potential of MSES for atomistic simulations of larger and more complex IDPs
Statistical modeling of RNA structure profiling experiments enables parsimonious reconstruction of structure landscapes.
RNA plays key regulatory roles in diverse cellular processes, where its functionality often derives from folding into and converting between structures. Many RNAs further rely on co-existence of alternative structures, which govern their response to cellular signals. However, characterizing heterogeneous landscapes is difficult, both experimentally and computationally. Recently, structure profiling experiments have emerged as powerful and affordable structure characterization methods, which improve computational structure prediction. To date, efforts have centered on predicting one optimal structure, with much less progress made on multiple-structure prediction. Here, we report a probabilistic modeling approach that predicts a parsimonious set of co-existing structures and estimates their abundances from structure profiling data. We demonstrate robust landscape reconstruction and quantitative insights into structural dynamics by analyzing numerous data sets. This work establishes a framework for data-directed characterization of structure landscapes to aid experimentalists in performing structure-function studies
- …