78,810 research outputs found

    Clustering of genes into regulons using integrated modeling-COGRIM

    Get PDF
    We present a Bayesian hierarchical model and Gibbs Sampling implementation that integrates gene expression, ChIP binding, and transcription factor motif data in a principled and robust fashion. COGRIM was applied to both unicellular and mammalian organisms under different scenarios of available data. In these applications, we demonstrate the ability to predict gene-transcription factor interactions with reduced numbers of false-positive findings and to make predictions beyond what is obtained when single types of data are considered

    Histone crosstalk directed by H2B ubiquitination is required for chromatin boundary integrity

    Get PDF
    Genomic maps of chromatin modifications have provided evidence for the partitioning of genomes into domains of distinct chromatin states, which assist coordinated gene regulation. The maintenance of chromatin domain integrity can require the setting of boundaries. The HS4 insulator element marks the 3β€² boundary of a heterochromatin region located upstream of the chicken β-globin gene cluster. Here we show that HS4 recruits the E3 ligase RNF20/BRE1A to mediate H2B mono-ubiquitination (H2Bub1) at this insulator. Knockdown experiments show that RNF20 is required for H2Bub1 and processive H3K4 methylation. Depletion of RNF20 results in a collapse of the active histone modification signature at the HS4 chromatin boundary, where H2Bub1, H3K4 methylation, and hyperacetylation of H3, H4, and H2A.Z are rapidly lost. A remarkably similar set of events occurs at the HSA/HSB regulatory elements of the FOLR1 gene, which mark the 5β€² boundary of the same heterochromatin region. We find that persistent H2Bub1 at the HSA/HSB and HS4 elements is required for chromatin boundary integrity. The loss of boundary function leads to the sequential spreading of H3K9me2, H3K9me3, and H4K20me3 over the entire 50 kb FOLR1 and β-globin region and silencing of FOLR1 expression. These findings show that the HSA/HSB and HS4 boundary elements direct a cascade of active histone modifications that defend the FOLR1 and β-globin gene loci from the pervasive encroachment of an adjacent heterochromatin domain. We propose that many gene loci employ H2Bub1-dependent boundaries to prevent heterochromatin spreading

    Chromatin recruitment of activated AMPK drives fasting response genes co-controlled by GR and PPARΞ±

    Get PDF
    Adaptation to fasting involves both Glucocorticoid Receptor (GRΞ±) and Peroxisome Proliferator-Activated Receptor Ξ± (PPARΞ±) activation. Given both receptors can physically interact we investigated the possibility of a genome-wide cross-talk between activated GR and PPARΞ±, using ChIP- and RNA-seq in primary hepatocytes. Our data reveal extensive chromatin co-localization of both factors with cooperative induction of genes controlling lipid/glucose metabolism. Key GR/PPAR co-controlled genes switched from transcriptional antagonism to cooperativity when moving from short to prolonged hepatocyte fasting, a phenomenon coinciding with gene promoter recruitment of phosphorylated AMP-activated protein kinase (AMPK) and blocked by its pharmacological inhibition. In vitro interaction studies support trimeric complex formation between GR, PPARΞ± and phospho-AMPK. Long-term fasting in mice showed enhanced phosphorylation of liver AMPK and GRΞ± Ser211. Phospho-AMPK chromatin recruitment at liver target genes, observed upon prolonged fasting in mice, is dampened by refeeding. Taken together, our results identify phospho-AMPK as a molecular switch able to cooperate with nuclear receptors at the chromatin level and reveal a novel adaptation mechanism to prolonged fasting

    Next-generation sequencing: applications beyond genomes.

    Get PDF
    The development of DNA sequencing more than 30 years ago has profoundly impacted biological research. In the last couple of years, remarkable technological innovations have emerged that allow the direct and cost-effective sequencing of complex samples at unprecedented scale and speed. These next-generation technologies make it feasible to sequence not only static genomes, but also entire transcriptomes expressed under different conditions. These and other powerful applications of next-generation sequencing are rapidly revolutionizing the way genomic studies are carried out. Below, we provide a snapshot of these exciting new approaches to understanding the properties and functions of genomes. Given that sequencing-based assays may increasingly supersede microarray-based assays, we also compare and contrast data obtained from these distinct approaches

    chroGPS, a global chromatin positioning system for the functional analysis and visualization of the epigenome

    Get PDF
    Development of tools to jointly visualize the genome and the epigenome remains a challenge. chroGPS is a computational approach that addresses this question. chroGPS uses multidimensional scaling techniques to represent similarity between epigenetic factors, or between genetic elements on the basis of their epigenetic state, in 2D/3D reference maps. We emphasize biological interpretability, statistical robustness, integration of genetic and epigenetic data from heterogeneous sources, and computational feasibility. Although chroGPS is a general methodology to create reference maps and study the epigenetic state of any class of genetic element or genomic region, we focus on two specific kinds of maps: chroGPSfactors, which visualizes functional similarities between epigenetic factors, and chroGPSgenes, which describes the epigenetic state of genes and integrates gene expression and other functional data. We use data from the modENCODE project on the genomic distribution of a large collection of epigenetic factors in Drosophila, a model system extensively used to study genome organization and function. Our results show that the maps allow straightforward visualization of relationships between factors and elements, capturing relevant information about their functional properties that helps to interpret epigenetic information in a functional context and derive testable hypotheses

    Bayesian correlated clustering to integrate multiple datasets

    Get PDF
    Motivation: The integration of multiple datasets remains a key challenge in systems biology and genomic medicine. Modern high-throughput technologies generate a broad array of different data types, providing distinct – but often complementary – information. We present a Bayesian method for the unsupervised integrative modelling of multiple datasets, which we refer to as MDI (Multiple Dataset Integration). MDI can integrate information from a wide range of different datasets and data types simultaneously (including the ability to model time series data explicitly using Gaussian processes). Each dataset is modelled using a Dirichlet-multinomial allocation (DMA) mixture model, with dependencies between these models captured via parameters that describe the agreement among the datasets. Results: Using a set of 6 artificially constructed time series datasets, we show that MDI is able to integrate a significant number of datasets simultaneously, and that it successfully captures the underlying structural similarity between the datasets. We also analyse a variety of real S. cerevisiae datasets. In the 2-dataset case, we show that MDI’s performance is comparable to the present state of the art. We then move beyond the capabilities of current approaches and integrate gene expression, ChIP-chip and protein-protein interaction data, to identify a set of protein complexes for which genes are co-regulated during the cell cycle. Comparisons to other unsupervised data integration techniques – as well as to non-integrative approaches – demonstrate that MDI is very competitive, while also providing information that would be difficult or impossible to extract using other methods

    Spatio-temporal expression patterns of Arabidopsis thaliana and Medicago truncatula defensin-like genes

    Get PDF
    Plant genomes contain several hundred defensin-like (DEFL) genes that encode short cysteine-rich proteins resembling defensins, which are well known antimicrobial polypeptides. Little is known about the expression patterns or functions of many DEFLs because most were discovered recently and hence are not well represented on standard microarrays. We designed a custom Affymetrix chip consisting of probe sets for 317 and 684 DEFLs from Arabidopsis thaliana and Medicago truncatula, respectively for cataloging DEFL expression in a variety of plant organs at different developmental stages and during symbiotic and pathogenic associations. The microarray analysis provided evidence for the transcription of 71% and 90% of the DEFLs identified in Arabidopsis and Medicago, respectively, including many of the recently annotated DEFL genes that previously lacked expression information. Both model plants contain a subset of DEFLs specifically expressed in seeds or fruits. A few DEFLs, including some plant defensins, were significantly up-regulated in Arabidopsis leaves inoculated with Alternaria brassicicola or Pseudomonas syringae pathogens. Among these, some were dependent on jasmonic acid signaling or were associated with specific types of immune responses. There were notable differences in DEFL gene expression patterns between Arabidopsis and Medicago, as the majority of Arabidopsis DEFLs were expressed in inflorescences, while only a few exhibited root-enhanced expression. By contrast, Medicago DEFLs were most prominently expressed in nitrogen-fixing root nodules. Thus, our data document salient differences in DEFL temporal and spatial expression between Arabidopsis and Medicago, suggesting distinct signaling routes and distinct roles for these proteins in the two plant species
    • …
    corecore