13,580 research outputs found
Sequence-based Multiscale Model (SeqMM) for High-throughput chromosome conformation capture (Hi-C) data analysis
In this paper, I introduce a Sequence-based Multiscale Model (SeqMM) for the
biomolecular data analysis. With the combination of spectral graph method, I
reveal the essential difference between the global scale models and local scale
ones in structure clustering, i.e., different optimization on Euclidean (or
spatial) distances and sequential (or genomic) distances. More specifically,
clusters from global scale models optimize Euclidean distance relations. Local
scale models, on the other hand, result in clusters that optimize the genomic
distance relations. For a biomolecular data, Euclidean distances and sequential
distances are two independent variables, which can never be optimized
simultaneously in data clustering. However, sequence scale in my SeqMM can work
as a tuning parameter that balances these two variables and deliver different
clusterings based on my purposes. Further, my SeqMM is used to explore the
hierarchical structures of chromosomes. I find that in global scale, the
Fiedler vector from my SeqMM bears a great similarity with the principal vector
from principal component analysis, and can be used to study genomic
compartments. In TAD analysis, I find that TADs evaluated from different scales
are not consistent and vary a lot. Particularly when the sequence scale is
small, the calculated TAD boundaries are dramatically different. Even for
regions with high contact frequencies, TAD regions show no obvious consistence.
However, when the scale value increases further, although TADs are still quite
different, TAD boundaries in these high contact frequency regions become more
and more consistent. Finally, I find that for a fixed local scale, my method
can deliver very robust TAD boundaries in different cluster numbers.Comment: 22 PAGES, 13 FIGURE
Pericentromeric heterochromatin is hierarchically organized and spatially contacts H3K9me2 islands in euchromatin.
Membraneless pericentromeric heterochromatin (PCH) domains play vital roles in chromosome dynamics and genome stability. However, our current understanding of 3D genome organization does not include PCH domains because of technical challenges associated with repetitive sequences enriched in PCH genomic regions. We investigated the 3D architecture of Drosophila melanogaster PCH domains and their spatial associations with the euchromatic genome by developing a novel analysis method that incorporates genome-wide Hi-C reads originating from PCH DNA. Combined with cytogenetic analysis, we reveal a hierarchical organization of the PCH domains into distinct territories. Strikingly, H3K9me2-enriched regions embedded in the euchromatic genome show prevalent 3D interactions with the PCH domain. These spatial contacts require H3K9me2 enrichment, are likely mediated by liquid-liquid phase separation, and may influence organismal fitness. Our findings have important implications for how PCH architecture influences the function and evolution of both repetitive heterochromatin and the gene-rich euchromatin
Stage-specific histone modification profiles reveal global transitions in the Xenopus embryonic epigenome.
Vertebrate embryos are derived from a transitory pool of pluripotent cells. By the process of embryonic induction, these precursor cells are assigned to specific fates and differentiation programs. Histone post-translational modifications are thought to play a key role in the establishment and maintenance of stable gene expression patterns underlying these processes. While on gene level histone modifications are known to change during differentiation, very little is known about the quantitative fluctuations in bulk histone modifications during development. To investigate this issue we analysed histones isolated from four different developmental stages of Xenopus laevis by mass spectrometry. In toto, we quantified 59 modification states on core histones H3 and H4 from blastula to tadpole stages. During this developmental period, we observed in general an increase in the unmodified states, and a shift from histone modifications associated with transcriptional activity to transcriptionally repressive histone marks. We also compared these naturally occurring patterns with the histone modifications of murine ES cells, detecting large differences in the methylation patterns of histone H3 lysines 27 and 36 between pluripotent ES cells and pluripotent cells from Xenopus blastulae. By combining all detected modification transitions we could cluster their patterns according to their embryonic origin, defining specific histone modification profiles (HMPs) for each developmental stage. To our knowledge, this data set represents the first compendium of covalent histone modifications and their quantitative flux during normogenesis in a vertebrate model organism. The HMPs indicate a stepwise maturation of the embryonic epigenome, which may be causal to the progressing restriction of cellular potency during development
Recommended from our members
EpiAlign: an alignment-based bioinformatic tool for comparing chromatin state sequences.
The availability of genome-wide epigenomic datasets enables in-depth studies of epigenetic modifications and their relationships with chromatin structures and gene expression. Various alignment tools have been developed to align nucleotide or protein sequences in order to identify structurally similar regions. However, there are currently no alignment methods specifically designed for comparing multi-track epigenomic signals and detecting common patterns that may explain functional or evolutionary similarities. We propose a new local alignment algorithm, EpiAlign, designed to compare chromatin state sequences learned from multi-track epigenomic signals and to identify locally aligned chromatin regions. EpiAlign is a dynamic programming algorithm that novelly incorporates varying lengths and frequencies of chromatin states. We demonstrate the efficacy of EpiAlign through extensive simulations and studies on the real data from the NIH Roadmap Epigenomics project. EpiAlign is able to extract recurrent chromatin state patterns along a single epigenome, and many of these patterns carry cell-type-specific characteristics. EpiAlign can also detect common chromatin state patterns across multiple epigenomes, and it will serve as a useful tool to group and distinguish epigenomic samples based on genome-wide or local chromatin state patterns
Recommended from our members
Transcriptionally active HERV-H retrotransposons demarcate topologically associating domains in human pluripotent stem cells.
Chromatin architecture has been implicated in cell type-specific gene regulatory programs, yet how chromatin remodels during development remains to be fully elucidated. Here, by interrogating chromatin reorganization during human pluripotent stem cell (hPSC) differentiation, we discover a role for the primate-specific endogenous retrotransposon human endogenous retrovirus subfamily H (HERV-H) in creating topologically associating domains (TADs) in hPSCs. Deleting these HERV-H elements eliminates their corresponding TAD boundaries and reduces the transcription of upstream genes, while de novo insertion of HERV-H elements can introduce new TAD boundaries. The ability of HERV-H to create TAD boundaries depends on high transcription, as transcriptional repression of HERV-H elements prevents the formation of boundaries. This ability is not limited to hPSCs, as these actively transcribed HERV-H elements and their corresponding TAD boundaries also appear in pluripotent stem cells from other hominids but not in more distantly related species lacking HERV-H elements. Overall, our results provide direct evidence for retrotransposons in actively shaping cell type- and species-specific chromatin architecture
ModuLand plug-in for Cytoscape: determination of hierarchical layers of overlapping network modules and community centrality
Summary: The ModuLand plug-in provides Cytoscape users an algorithm for
determining extensively overlapping network modules. Moreover, it identifies
several hierarchical layers of modules, where meta-nodes of the higher
hierarchical layer represent modules of the lower layer. The tool assigns
module cores, which predict the function of the whole module, and determines
key nodes bridging two or multiple modules. The plug-in has a detailed
JAVA-based graphical interface with various colouring options. The ModuLand
tool can run on Windows, Linux, or Mac OS. We demonstrate its use on protein
structure and metabolic networks. Availability: The plug-in and its user guide
can be downloaded freely from: http://www.linkgroup.hu/modules.php. Contact:
[email protected] Supplementary information: Supplementary
information is available at Bioinformatics online.Comment: 39 pages, 1 figure and a Supplement with 9 figures and 10 table
Control of VEGF-A transcriptional programs by pausing and genomic compartmentalization.
Vascular endothelial growth factor A (VEGF-A) is a master regulator of angiogenesis, vascular development and function. In this study we investigated the transcriptional regulation of VEGF-A-responsive genes in primary human aortic endothelial cells (HAECs) and human umbilical vein endothelial cells (HUVECs) using genome-wide global run-on sequencing (GRO-Seq). We demonstrate that half of VEGF-A-regulated gene promoters are characterized by a transcriptionally competent paused RNA polymerase II (Pol II). We show that transition into productive elongation is a major mechanism of gene activation of virtually all VEGF-regulated genes, whereas only ∼40% of the genes are induced at the level of initiation. In addition, we report a comprehensive chromatin interaction map generated in HUVECs using tethered conformation capture (TCC) and characterize chromatin interactions in relation to transcriptional activity. We demonstrate that sites of active transcription are more likely to engage in chromatin looping and cell type-specific transcriptional activity reflects the boundaries of chromatin interactions. Furthermore, we identify large chromatin compartments with a tendency to be coordinately transcribed upon VEGF-A stimulation. We provide evidence that these compartments are enriched for clusters of regulatory regions such as super-enhancers and for disease-associated single nucleotide polymorphisms (SNPs). Collectively, these findings provide new insights into mechanisms behind VEGF-A-regulated transcriptional programs in endothelial cells
- …