411 research outputs found

    Ubiquitous nucleosome unwrapping in the yeast genome

    Get PDF
    Nucleosome core particle is a dynamic structure -- DNA may transiently peel off the histone octamer surface due to thermal fluctuations or the action of chromatin remodeling enzymes. Partial DNA unwrapping enables easier access of DNA-binding factors to their target sites and thus may provide a dominant pathway for effecting rapid and robust access to DNA packaged into chromatin. Indeed, a recent high-resolution map of distances between neighboring nucleosome dyads in \emph{S.cerevisiae} shows that at least 38.7\% of all nucleosomes are partially unwrapped. The extent of unwrapping follows a stereotypical pattern in the vicinity of genes, reminiscent of the canonical pattern of nucleosome occupancy in which nucleosomes are depleted over promoters and well-positioned over coding regions. To explain these observations, we developed a biophysical model which employs a 10-11 base pair periodic nucleosome energy profile. The profile, based on the pattern of histone-DNA contacts in nucleosome crystal structures and the idea of linker length discretization, accounts for both nucleosome unwrapping and higher-order chromatin structure. Our model reproduces the observed genome-wide distribution of inter-dyad distances, and accounts for patterns of nucleosome occupancy and unwrapping around coding regions. At the same time, our approach explains \emph{in vitro} measurements of accessibility of nucleosome-covered binding sites, and of nucleosome-induced cooperativity between DNA-binding factors. We are able to rule out several alternative scenarios of nucleosome unwrapping as inconsistent with the genomic data.Comment: 49 pages; 15 figure

    Statistical Physics of Evolutionary Trajectories on Fitness Landscapes

    Full text link
    Random walks on multidimensional nonlinear landscapes are of interest in many areas of science and engineering. In particular, properties of adaptive trajectories on fitness landscapes determine population fates and thus play a central role in evolutionary theory. The topography of fitness landscapes and its effect on evolutionary dynamics have been extensively studied in the literature. We will survey the current research knowledge in this field, focusing on a recently developed systematic approach to characterizing path lengths, mean first-passage times, and other statistics of the path ensemble. This approach, based on general techniques from statistical physics, is applicable to landscapes of arbitrary complexity and structure. It is especially well-suited to quantifying the diversity of stochastic trajectories and repeatability of evolutionary events. We demonstrate this methodology using a biophysical model of protein evolution that describes how proteins maintain stability while evolving new functions

    Biophysical Fitness Landscapes for Transcription Factor Binding Sites

    Full text link
    Evolutionary trajectories and phenotypic states available to cell populations are ultimately dictated by intermolecular interactions between DNA, RNA, proteins, and other molecular species. Here we study how evolution of gene regulation in a single-cell eukaryote S. cerevisiae is affected by the interactions between transcription factors (TFs) and their cognate genomic sites. Our study is informed by high-throughput in vitro measurements of TF-DNA binding interactions and by a comprehensive collection of genomic binding sites. Using an evolutionary model for monomorphic populations evolving on a fitness landscape, we infer fitness as a function of TF-DNA binding energy for a collection of 12 yeast TFs, and show that the shape of the predicted fitness functions is in broad agreement with a simple thermodynamic model of two-state TF-DNA binding. However, the effective temperature of the model is not always equal to the physical temperature, indicating selection pressures in addition to biophysical constraints caused by TF-DNA interactions. We find little statistical support for the fitness landscape in which each position in the binding site evolves independently, showing that epistasis is common in evolution of gene regulation. Finally, by correlating TF-DNA binding energies with biological properties of the sites or the genes they regulate, we are able to rule out several scenarios of site-specific selection, under which binding sites of the same TF would experience a spectrum of selection pressures depending on their position in the genome. These findings argue for the existence of universal fitness landscapes which shape evolution of all sites for a given TF, and whose properties are determined in part by the physics of protein-DNA interactions

    Single temperature for Monte Carlo optimization on complex landscapes

    Full text link
    We propose a new strategy for Monte Carlo (MC) optimization on rugged multidimensional landscapes. The strategy is based on querying the statistical properties of the landscape in order to find the temperature at which the mean first passage time across the current region of the landscape is minimized. Thus, in contrast to other algorithms such as simulated annealing (SA), we explicitly match the temperature schedule to the statistics of landscape irregularities. In cases where this statistics is approximately the same over the entire landscape, or where non-local moves couple distant parts of the landscape, single-temperature MC will outperform any other MC algorithm with the same move set. We also find that in strongly anisotropic Coulomb spin glass and traveling salesman problems, the only relevant statistics (which we use to assign a single MC temperature) is that of irregularities in low-energy funnels. Our results may explain why protein folding in nature is efficient at room temperatures.Comment: 5 pages, 3 figure

    Pairwise and higher-order correlations among drug-resistance mutations in HIV-1 subtype B protease

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The reaction of HIV protease to inhibitor therapy is characterized by the emergence of complex mutational patterns which confer drug resistance. The response of HIV protease to drugs often involves both primary mutations that directly inhibit the action of the drug, and a host of accessory resistance mutations that may occur far from the active site but may contribute to restoring the fitness or stability of the enzyme. Here we develop a probabilistic approach based on connected information that allows us to study residue, pair level and higher-order correlations within the same framework.</p> <p>Results</p> <p>We apply our methodology to a database of approximately 13,000 sequences which have been annotated by the treatment history of the patients from which the samples were obtained. We show that including pair interactions is essential for agreement with the mutational data, since neglect of these interactions results in order-of-magnitude errors in the probabilities of the simultaneous occurence of many mutations. The magnitude of these pair correlations changes dramatically between sequences obtained from patients that were or were not exposed to drugs. Higher-order effects make a contribution of as much as 10% for residues taken three at a time, but increase to more than twice that for 10 to 15-residue groups. The sequence data is insufficient to determine the higher-order effects for larger groups. We find that higher-order interactions have a significant effect on the predicted frequencies of sequences with large numbers of mutations. While relatively rare, such sequences are more prevalent after multi-drug therapy. The relative importance of these higher-order interactions increases with the number of drugs the patient had been exposed to.</p> <p>Conclusion</p> <p>Correlations are critical for the understanding of mutation patterns in HIV protease. Pair interactions have substantial qualitative effects, while higher-order interactions are individually smaller but may have a collective effect. Together they lead to correlations which could have an important impact on the dynamics of the evolution of cross-resistance, by allowing the virus to pass through otherwise unlikely mutational states. These findings also indicate that pairwise and possibly higher-order effects should be included in the models of protein evolution, instead of assuming that all residues mutate independently of one another.</p
    • …
    corecore