61 research outputs found
Statistical-mechanical lattice models for protein-DNA binding in chromatin
Statistical-mechanical lattice models for protein-DNA binding are well
established as a method to describe complex ligand binding equilibriums
measured in vitro with purified DNA and protein components. Recently, a new
field of applications has opened up for this approach since it has become
possible to experimentally quantify genome-wide protein occupancies in relation
to the DNA sequence. In particular, the organization of the eukaryotic genome
by histone proteins into a nucleoprotein complex termed chromatin has been
recognized as a key parameter that controls the access of transcription factors
to the DNA sequence. New approaches have to be developed to derive statistical
mechanical lattice descriptions of chromatin-associated protein-DNA
interactions. Here, we present the theoretical framework for lattice models of
histone-DNA interactions in chromatin and investigate the (competitive) DNA
binding of other chromosomal proteins and transcription factors. The results
have a number of applications for quantitative models for the regulation of
gene expression.Comment: 19 pages, 7 figures, accepted author manuscript, to appear in J.
Phys.: Cond. Mat
NucPosDB: a database of nucleosome positioning in vivo and nucleosomics of cell-free DNA
Nucleosome positioning is involved in many gene regulatory processes happening in the cell, and it may change as cells differentiate or respond to the changing microenvironment in a healthy or diseased organism. One important implication of nucleosome positioning in clinical epigenetics is its use in the “nucleosomics” analysis of cell-free DNA (cfDNA) for the purpose of patient diagnostics in liquid biopsies. The rationale for this is that the apoptotic nucleases that digest chromatin of the dying cells mostly cut DNA between nucleosomes. Thus, the short pieces of DNA in body fluids reflect the positions of nucleosomes in the cells of origin. Here, we report a systematic nucleosomics database — NucPosDB — curating published nucleosome positioning datasets in vivo as well as datasets of sequenced cell-free DNA (cfDNA) that reflect nucleosome positioning in situ in the cells of origin. Users can select subsets of the database by a number of criteria and then obtain raw or processed data. NucPosDB also reports the originally determined regions with stable nucleosome occupancy across several individuals with a given condition. An additional section provides a catalogue of computational tools for the analysis of nucleosome positioning or cfDNA experiments and theoretical algorithms for the prediction of nucleosome positioning preferences from DNA sequence. We provide an overview of the field, describe the structure of the database in this context, and demonstrate data variability using examples of different medical conditions. NucPosDB is useful both for the analysis of fundamental gene regulation processes and the training of computational models for patient diagnostics based on cfDNA. The database currently curates ~ 400 publications on nucleosome positioning in cell lines and in situ as well as cfDNA from > 10,000 patients and healthy volunteers. For open-access cfDNA datasets as well as key MNase-seq datasets in human cells, NucPosDB allows downloading processed mapped data in addition to the regions with stable nucleosome occupancy. NucPosDB is available at https://generegulation.org/nucposdb/
Nucleosomes in gene regulation: theoretical approaches
This work reviews current theoretical approaches of biophysics and
bioinformatics for the description of nucleosome arrangements in chromatin and
transcription factor binding to nucleosomal organized DNA. The role of
nucleosomes in gene regulation is discussed from molecular-mechanistic and
biological point of view. In addition to classical problems of this field,
actual questions of epigenetic regulation are discussed. The authors selected
for discussion what seem to be the most interesting concepts and hypotheses.
Mathematical approaches are described in a simplified language to attract
attention to the most important directions of this field
NucTools: analysis of chromatin feature occupancy profiles from high-throughput sequencing data
Background: Biomedical applications of high-throughput sequencing methods generate a vast amount of data in which numerous chromatin features are mapped along the genome. The results are frequently analysed by creating binary data sets that link the presence/absence of a given feature to specific genomic loci. However, the nucleosome occupancy or chromatin accessibility landscape is essentially continuous. It is currently a challenge in the field to cope with continuous distributions of deep sequencing chromatin readouts and to integrate the different types of discrete chromatin features to reveal linkages between them. Results: Here we introduce the NucTools suite of Perl scripts as well as MATLAB- and R-based visualization programs for a nucleosome-centred downstream analysis of deep sequencing data. NucTools accounts for the continuous distribution of nucleosome occupancy. It allows calculations of nucleosome occupancy profiles averaged over several replicates, comparisons of nucleosome occupancy landscapes between different experimental conditions, and the estimation of the changes of integral chromatin properties such as the nucleosome repeat length. Furthermore, NucTools facilitates the annotation of nucleosome occupancy with other chromatin features like binding of transcription factors or architectural proteins, and epigenetic marks like histone modifications or DNA methylation. The applications of NucTools are demonstrated for the comparison of several datasets for nucleosome occupancy in mouse embryonic stem cells (ESCs) and mouse embryonic fibroblasts (MEFs). Conclusions: The typical workflows of data processing and integrative analysis with NucTools reveal information on the interplay of nucleosome positioning with other features such as for example binding of a transcription factor CTCF, regions with stable and unstable nucleosomes, and domains of large organized chromatin K9me2 modifications (LOCKs). As potential limitations and problems we discuss how inter-replicate variability of MNase-seq experiments can be addressed
Chromatin and epigenetics: current biophysical views
Recent advances in high-throughput sequencing experiments and their theoretical descriptions have determined fast dynamics of the "chromatin and epigenetics" field, with new concepts appearing at high rate. This field includes but is not limited to the study of DNA-protein-RNA interactions, chromatin packing properties at different scales, regulation of gene expression and protein trafficking in the cell nucleus, binding site search in the crowded chromatin environment and modulation of physical interactions by covalent chemical modifications of the binding partners. The current special issue does not pretend for the full coverage of the field, but it rather aims to capture its development and provide a snapshot of the most recent concepts and approaches. Eighteen open-access articles comprising this issue provide a delicate balance between current theoretical and experimental biophysical approaches to uncover chromatin structure and understand epigenetic regulation, allowing free flow of new ideas and preliminary results
Mammalian transcriptional hotspots are enriched for tissue specific enhancers near cell type specific highly expressed genes and are predicted to act as transcriptional activator hubs
BACKGROUND: Transcriptional hotspots are defined as genomic regions bound by multiple factors. They have been identified recently as cell type specific enhancers regulating developmentally essential genes in many species such as worm, fly and humans. The in-depth analysis of hotspots across multiple cell types in same species still remains to be explored and can bring new biological insights. RESULTS: We therefore collected 108 transcription-related factor (TF) ChIP sequencing data sets in ten murine cell types and classified the peaks in each cell type in three groups according to binding occupancy as singletons (low-occupancy), combinatorials (mid-occupancy) and hotspots (high-occupancy). The peaks in the three groups clustered largely according to the occupancy, suggesting priming of genomic loci for mid occupancy irrespective of cell type. We then characterized hotspots for diverse structural functional properties. The genes neighbouring hotspots had a small overlap with hotspot genes in other cell types and were highly enriched for cell type specific function. Hotspots were enriched for sequence motifs of key TFs in that cell type and more than 90% of hotspots were occupied by pioneering factors. Though we did not find any sequence signature in the three groups, the H3K4me1 binding profile had bimodal peaks at hotspots, distinguishing hotspots from mono-modal H3K4me1 singletons. In ES cells, differentially expressed genes after perturbation of activators were enriched for hotspot genes suggesting hotspots primarily act as transcriptional activator hubs. Finally, we proposed that ES hotspots might be under control of SetDB1 and not DNMT for silencing. CONCLUSION: Transcriptional hotspots are enriched for tissue specific enhancers near cell type specific highly expressed genes. In ES cells, they are predicted to act as transcriptional activator hubs and might be under SetDB1 control for silencing. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-014-0412-0) contains supplementary material, which is available to authorized users
Nucleosome mediated crosstalk between transcription factors at eukaryotic enhancers
A recent study of transcription regulation in Drosophila embryonic development revealed a complex non-monotonic dependence of gene expression on the distance between binding sites of repressor and activator proteins at the corresponding enhancer cis-regulatory modules (Fakhouri et al 2010 Mol. Syst. Biol. 6 341). The repressor efficiency was high at small separations, low around 30 bp, reached a maximum at 50-60 bp, and decreased at larger distances to the activator binding sites. Here, we propose a straightforward explanation for the distance dependence of repressor activity by considering the effect of the presence of a nucleosome. Using a method that considers partial unwrapping of nucleosomal DNA from the histone octamer core, we calculated the dependence of activator binding on the repressor-activator distance and found a quantitative agreement with the distance dependence reported for the Drosophila enhancer element. In addition, the proposed model offers explanations for other distance-dependent effects at eukaryotic enhancers. © 2011 IOP Publishing Ltd
Structure-driven homology pairing of chromatin fibers : the role of electrostatics and protein-induced bridging
Chromatin domains formed in vivo are characterized by different types of 3D organization of interconnected nucleosomes and architectural proteins. Here, we quantitatively test a hypothesis that the similarities in the structure of chromatin fibers (which we call “structural homology”) can affect their mutual electrostatic and protein-mediated bridging interactions. For example, highly repetitive DNA sequences in heterochromatic regions can position nucleosomes so that preferred inter-nucleosomal distances are preserved on the surfaces of neighboring fibers. On the contrary, the segments of chromatin fiber formed on unrelated DNA sequences have different geometrical parameters and lack structural complementarity pivotal for stable association and cohesion. Furthermore, specific functional elements such as insulator regions, transcription start and termination sites, and replication origins are characterized by strong nucleosome ordering that might induce structure-driven iterations of chromatin fibers. We propose that shape-specific protein-bridging interactions facilitate long-range pairing of chromatin fragments, while for closely-juxtaposed fibers electrostatic forces can in addition yield fine-tuned structure-specific recognition and pairing. These pairing effects can account for some features observed for mitotic and inter-phase chromatins
Electrostatic effect of H1-histone protein binding on nucleosome repeat length
Within a simple biophysical model we describe the effect of electrostatic binding of H1 histone proteins on the nucleosome repeat length in chromatin. The length of wrapped DNA optimizes its binding energy to the histone core and the elastic energy penalty of DNA wrapping. The magnitude of the effect predicted from our model is in agreement with the systematic experimental data on the linear variation of nucleosome repeat lengths with H1/nucleosome ratio (Woodcock C L et al 2006 Chromos. Res. 14 17-25). We compare our model to the data for different cell types and organisms, with a widely varying ratio of bound H1 histones per nucleosome. We underline the importance of this non-specific histone-DNA charge-balance mechanism in regulating the positioning of nucleosomes and the degree of compaction of chromatin fibers in eukaryotic cells. © 2014 IOP Publishing Ltd
- …