97 research outputs found

    Stochastic EM-based TFBS motif discovery with MITSU

    Get PDF
    Motivation: The Expectation–Maximization (EM) algorithm has been successfully applied to the problem of transcription factor binding site (TFBS) motif discovery and underlies the most widely used motif discovery algorithms. In the wider field of probabilistic modelling, the stochastic EM (sEM) algorithm has been used to overcome some of the limitations of the EM algorithm; however, the application of sEM to motif discovery has not been fully explored. Results: We present MITSU (Motif discovery by ITerative Sampling and Updating), a novel algorithm for motif discovery, which combines sEM with an improved approximation to the likelihood function, which is unconstrained with regard to the distribution of motif occurrences within the input dataset. The algorithm is evaluated quantitatively on realistic synthetic data and several collections of characterized prokaryotic TFBS motifs and shown to outperform EM and an alternative sEM-based algorithm, particularly in terms of site-level positive predictive value. Availability and implementation: Java executable available for download at http://www.sourceforge.net/p/mitsu-motif/, supported on Linux/OS X. Contact: [email protected]

    Gene Properties and Chromatin State Influence the Accumulation of Transposable Elements in Genes

    Get PDF
    Transposable elements (TEs) are mobile DNA sequences found in the genomes of almost all species. By measuring the normalized coverage of TE sequences within genes, we identified sets of genes with conserved extremes of high/low TE density in the genomes of human, mouse and cow and denoted them as ‘shared upper/lower outliers (SUOs/SLOs)’. By comparing these outlier genes to the genomic background, we show that a large proportion of SUOs are involved in metabolic pathways and tend to be mammal-specific, whereas many SLOs are related to developmental processes and have more ancient origins. Furthermore, the proportions of different types of TEs within human and mouse orthologous SUOs showed high similarity, even though most detectable TEs in these two genomes inserted after their divergence. Interestingly, our computational analysis of polymerase-II (Pol-II) occupancy at gene promoters in different mouse tissues showed that 60% of tissue-specific SUOs show strong Pol-II binding only in embryonic stem cells (ESCs), a proportion significantly higher than the genomic background (37%). In addition, our analysis of histone marks such as H3K4me3 and H3K27me3 in mouse ESCs also suggest a strong association between TE-rich genes and open-chromatin at promoters. Finally, two independent whole-transcriptome datasets show a positive association between TE density and gene expression level in ESCs. While this study focuses on genes with extreme TE densities, the above results clearly show that the probability of TE accumulation/fixation in mammalian genes is not random and is likely associated with different factors/gene properties and, most importantly, an association between the TE insertion/fixation rate and gene activity status in ES cells

    Lineage-specific dynamic and pre-established enhancer–promoter contacts cooperate in terminal differentiation

    Get PDF
    Chromosome conformation is an important feature of metazoan gene regulation; however, enhancer–promoter contact remodeling during cellular differentiation remains poorly understood. To address this, genome-wide promoter capture Hi-C (CHi-C) was performed during epidermal differentiation. Two classes of enhancer–promoter contacts associated with differentiation-induced genes were identified. The first class ('gained') increased in contact strength during differentiation in concert with enhancer acquisition of the H3K27ac activation mark. The second class ('stable') were pre-established in undifferentiated cells, with enhancers constitutively marked by H3K27ac. The stable class was associated with the canonical conformation regulator cohesin, whereas the gained class was not, implying distinct mechanisms of contact formation and regulation. Analysis of stable enhancers identified a new, essential role for a constitutively expressed, lineage-restricted ETS-family transcription factor, EHF, in epidermal differentiation. Furthermore, neither class of contacts was observed in pluripotent cells, suggesting that lineage-specific chromatin structure is established in tissue progenitor cells and is further remodeled in terminal differentiation

    Chromosome contacts in activated T cells identify autoimmune disease candidate genes

    Get PDF
    BACKGROUND: Autoimmune disease-associated variants are preferentially found in regulatory regions in immune cells, particularly CD4+ T cells. Linking such regulatory regions to gene promoters in disease-relevant cell contexts facilitates identification of candidate disease genes. RESULTS: Within four hours, activation of CD4+ T cells invokes changes in histone modifications and enhancer RNA transcription that correspond to altered expression of the interacting genes identified by promoter capture Hi-C (PCHi-C). By integrating PCHi-C data with genetic associations for five autoimmune diseases we prioritised 245 candidate genes with a median distance from peak signal to prioritised gene of 153 kb. Just under half (108/245) prioritised genes related to activation-sensitive interactions. This included IL2RA, where allele-specific expression analyses were consistent with its interaction-mediated regulation, illustrating the utility of the approach. CONCLUSIONS: Our systematic experimental framework offers an alternative approach to candidate causal gene identification for variants with cell state-specific functional effects, with achievable sample sizes.This work was funded by the JDRF (9-2011-253), the Wellcome Trust (089989, 091157, 107881), the UK Medical Research Council (MR/L007150/1, MC_UP_1302/5), the UK Biotechnology and Biological Sciences Research Council (BB/J004480/1) and the National Institute for Health Research (NIHR) Cambridge Biomedical Research Centre. The research leading to these results has received funding from the European Union’s 7th Framework Programme (FP7/2007-2013) under grant agreement no. 241447 (NAIMIT). The Cambridge Institute for Medical Research (CIMR) is in receipt of a Wellcome Trust Strategic Award (100140)

    NF-Y Dependent Epigenetic Modifications Discriminate between Proliferating and Postmitotic Tissue

    Get PDF
    The regulation of gene transcription requires posttranslational modifications of histones that, in concert with chromatin remodeling factors, shape the structure of chromatin. It is currently under intense investigation how this structure is modulated, in particular in the context of proliferation and differentiation. Compelling evidence suggests that the transcription factor NF-Y acts as a master regulator of cell cycle progression, activating the transcription of many cell cycle regulatory genes. However, the underlying molecular mechanisms are not yet completely understood. Here we show that NF-Y exerts its effect on transcription through the modulation of the histone “code”. NF-Y colocalizes with nascent RNA, while RNA polymerase II is I phosphorylated on serine 2 of the YSPTSPS repeats within its carboxyterminal domain and histones are carrying modifications that represent activation signals of gene expression (H3K9ac and PAN-H4ac). Comparing postmitotic muscle tissue from normal mice and proliferating muscles from mdx mice, we demonstrate by chromatin immunoprecipitation (ChIP) that NF-Y DNA binding activity correlates with the accumulation of acetylated histones H3 and H4 on promoters of key cell cycle regulatory genes, and with their active transcription. Accordingly, p300 is recruited onto the chromatin of NF-Y target genes in a NF-Y-dependent manner, as demonstrated by Re-ChIP. Conversely, the loss of NF-Y binding correlates with a decrease of acetylated histones, the recruitment of HDAC1, and a repressed heterochromatic state with enrichment of histones carrying modifications known to mediate silencing of gene expression (H3K9me3, H3K27me2 and H4K20me3). As a consequence, NF-Y target genes are downregulated in this context. In conclusion, our data indicate a role of NF-Y in modulating the structure and transcriptional competence of chromatin in vivo and support a model in which NF-Y-dependent histone “code” changes contribute to the proper discrimination between proliferating and postmitotic cells in vivo and in vitro

    DNA Methyltransferase Controls Stem Cell Aging by Regulating BMI1 and EZH2 through MicroRNAs

    Get PDF
    Epigenetic regulation of gene expression is well known mechanism that regulates cellular senescence of cancer cells. Here we show that inhibition of DNA methyltransferases (DNMTs) with 5-azacytidine (5-AzaC) or with specific small interfering RNA (siRNA) against DNMT1 and 3b induced the cellular senescence of human umbilical cord blood-derived multipotent stem cells (hUCB-MSCs) and increased p16INK4A and p21CIP1/WAF1 expression. DNMT inhibition changed histone marks into the active forms and decreased the methylation of CpG islands in the p16INK4A and p21CIP1/WAF1 promoter regions. Enrichment of EZH2, the key factor that methylates histone H3 lysine 9 and 27 residues, was decreased on the p16INK4A and p21CIP1/WAF1 promoter regions. We found that DNMT inhibition decreased expression levels of Polycomb-group (PcG) proteins and increased expression of microRNAs (miRNAs), which target PcG proteins. Decreased CpG island methylation and increased levels of active histone marks at genomic regions encoding miRNAs were observed after 5-AzaC treatment. Taken together, DNMTs have a critical role in regulating the cellular senescence of hUCB-MSCs through controlling not only the DNA methylation status but also active/inactive histone marks at genomic regions of PcG-targeting miRNAs and p16INK4A and p21CIP1/WAF1 promoter regions

    MicroRNA-145 Regulates Human Corneal Epithelial Differentiation

    Get PDF
    Epigenetic factors, such as microRNAs, are important regulators in the self-renewal and differentiation of stem cells and progenies. Here we investigated the microRNAs expressed in human limbal-peripheral corneal (LPC) epithelia containing corneal epithelial progenitor cells (CEPCs) and early transit amplifying cells, and their role in corneal epithelium.Human LPC epithelia was extracted for small RNAs or dissociated for CEPC culture. By Agilent Human microRNA Microarray V2 platform and GeneSpring GX11.0 analysis, we found differential expression of 18 microRNAs against central corneal (CC) epithelia, which were devoid of CEPCs. Among them, miR-184 was up-regulated in CC epithelia, similar to reported finding. Cluster miR-143/145 was expressed strongly in LPC but weakly in CC epithelia (P = 0.0004, Mann-Whitney U-test). This was validated by quantitative polymerase chain reaction (qPCR). Locked nucleic acid-based in situ hybridization on corneal rim cryosections showed miR-143/145 presence localized to the parabasal cells of limbal epithelium but negligible in basal and superficial epithelia. With holoclone forming ability, CEPCs transfected with lentiviral plasmid containing mature miR-145 sequence gave rise to defective epithelium in organotypic culture and had increased cytokeratin-3/12 and connexin-43 expressions and decreased ABCG2 and p63 compared with cells transfected with scrambled sequences. Global gene expression was analyzed using Agilent Whole Human Genome Oligo Microarray and GeneSpring GX11.0. With a 5-fold difference compared to cells with scrambled sequences, miR-145 up-regulated 324 genes (containing genes for immune response) and down-regulated 277 genes (containing genes for epithelial development and stem cell maintenance). As validated by qPCR and luciferase reporter assay, our results showed miR-145 suppressed integrin β8 (ITGB8) expression in both human corneal epithelial cells and primary CEPCs.We found expression of miR-143/145 cluster in human corneal epithelium. Our results also showed that miR-145 regulated the corneal epithelium formation and maintenance of epithelial integrity, via ITGB8 targeting
    corecore