156 research outputs found

    CHiCP: a web-based tool for the integrative and interactive visualization of promoter capture Hi-C datasets.

    Get PDF
    UNLABELLED: Promoter capture Hi-C (PCHi-C) allows the genome-wide interrogation of physical interactions between distal DNA regulatory elements and gene promoters in multiple tissue contexts. Visual integration of the resultant chromosome interaction maps with other sources of genomic annotations can provide insight into underlying regulatory mechanisms. We have developed Capture HiC Plotter (CHiCP), a web-based tool that allows interactive exploration of PCHi-C interaction maps and integration with both public and user-defined genomic datasets. AVAILABILITY AND IMPLEMENTATION: CHiCP is freely accessible from www.chicp.org and supports most major HTML5 compliant web browsers. Full source code and installation instructions are available from http://github.com/D-I-L/django-chicp CONTACT: [email protected] is the published version. It first appeared at http://bioinformatics.oxfordjournals.org/content/early/2016/04/26/bioinformatics.btw173

    Stochastic EM-based TFBS motif discovery with MITSU

    Get PDF
    Motivation: The Expectation–Maximization (EM) algorithm has been successfully applied to the problem of transcription factor binding site (TFBS) motif discovery and underlies the most widely used motif discovery algorithms. In the wider field of probabilistic modelling, the stochastic EM (sEM) algorithm has been used to overcome some of the limitations of the EM algorithm; however, the application of sEM to motif discovery has not been fully explored. Results: We present MITSU (Motif discovery by ITerative Sampling and Updating), a novel algorithm for motif discovery, which combines sEM with an improved approximation to the likelihood function, which is unconstrained with regard to the distribution of motif occurrences within the input dataset. The algorithm is evaluated quantitatively on realistic synthetic data and several collections of characterized prokaryotic TFBS motifs and shown to outperform EM and an alternative sEM-based algorithm, particularly in terms of site-level positive predictive value. Availability and implementation: Java executable available for download at http://www.sourceforge.net/p/mitsu-motif/, supported on Linux/OS X. Contact: [email protected]

    Gene Properties and Chromatin State Influence the Accumulation of Transposable Elements in Genes

    Get PDF
    Transposable elements (TEs) are mobile DNA sequences found in the genomes of almost all species. By measuring the normalized coverage of TE sequences within genes, we identified sets of genes with conserved extremes of high/low TE density in the genomes of human, mouse and cow and denoted them as ‘shared upper/lower outliers (SUOs/SLOs)’. By comparing these outlier genes to the genomic background, we show that a large proportion of SUOs are involved in metabolic pathways and tend to be mammal-specific, whereas many SLOs are related to developmental processes and have more ancient origins. Furthermore, the proportions of different types of TEs within human and mouse orthologous SUOs showed high similarity, even though most detectable TEs in these two genomes inserted after their divergence. Interestingly, our computational analysis of polymerase-II (Pol-II) occupancy at gene promoters in different mouse tissues showed that 60% of tissue-specific SUOs show strong Pol-II binding only in embryonic stem cells (ESCs), a proportion significantly higher than the genomic background (37%). In addition, our analysis of histone marks such as H3K4me3 and H3K27me3 in mouse ESCs also suggest a strong association between TE-rich genes and open-chromatin at promoters. Finally, two independent whole-transcriptome datasets show a positive association between TE density and gene expression level in ESCs. While this study focuses on genes with extreme TE densities, the above results clearly show that the probability of TE accumulation/fixation in mammalian genes is not random and is likely associated with different factors/gene properties and, most importantly, an association between the TE insertion/fixation rate and gene activity status in ES cells

    Global Mapping of DNA Methylation in Mouse Promoters Reveals Epigenetic Reprogramming of Pluripotency Genes

    Get PDF
    DNA methylation patterns are reprogrammed in primordial germ cells and in preimplantation embryos by demethylation and subsequent de novo methylation. It has been suggested that epigenetic reprogramming may be necessary for the embryonic genome to return to a pluripotent state. We have carried out a genome-wide promoter analysis of DNA methylation in mouse embryonic stem (ES) cells, embryonic germ (EG) cells, sperm, trophoblast stem (TS) cells, and primary embryonic fibroblasts (pMEFs). Global clustering analysis shows that methylation patterns of ES cells, EG cells, and sperm are surprisingly similar, suggesting that while the sperm is a highly specialized cell type, its promoter epigenome is already largely reprogrammed and resembles a pluripotent state. Comparisons between pluripotent tissues and pMEFs reveal that a number of pluripotency related genes, including Nanog, Lefty1 and Tdgf1, as well as the nucleosome remodeller Smarcd1, are hypomethylated in stem cells and hypermethylated in differentiated cells. Differences in promoter methylation are associated with significant differences in transcription levels in more than 60% of genes analysed. Our comparative approach to promoter methylation thus identifies gene candidates for the regulation of pluripotency and epigenetic reprogramming. While the sperm genome is, overall, similarly methylated to that of ES and EG cells, there are some key exceptions, including Nanog and Lefty1, that are highly methylated in sperm. Nanog promoter methylation is erased by active and passive demethylation after fertilisation before expression commences in the morula. In ES cells the normally active Nanog promoter is silenced when targeted by de novo methylation. Our study suggests that reprogramming of promoter methylation is one of the key determinants of the epigenetic regulation of pluripotency genes. Epigenetic reprogramming in the germline prior to fertilisation and the reprogramming of key pluripotency genes in the early embryo is thus crucial for transmission of pluripotency

    A systematic, large-scale comparison of transcription factor binding site models

    Get PDF
    Background The modelling of gene regulation is a major challenge in biomedical research. This process is dominated by transcription factors (TFs) and mutations in their binding sites (TFBSs) may cause the misregulation of genes, eventually leading to disease. The consequences of DNA variants on TF binding are modelled in silico using binding matrices, but it remains unclear whether these are capable of accurately representing in vivo binding. In this study, we present a systematic comparison of binding models for 82 human TFs from three freely available sources: JASPAR matrices, HT-SELEX-generated models and matrices derived from protein binding microarrays (PBMs). We determined their ability to detect experimentally verified “real” in vivo TFBSs derived from ENCODE ChIP-seq data. As negative controls we chose random downstream exonic sequences, which are unlikely to harbour TFBS. All models were assessed by receiver operating characteristics (ROC) analysis. Results While the area- under-curve was low for most of the tested models with only 47 % reaching a score of 0.7 or higher, we noticed strong differences between the various position-specific scoring matrices with JASPAR and HT-SELEX models showing higher success rates than PBM-derived models. In addition, we found that while TFBS sequences showed a higher degree of conservation than randomly chosen sequences, there was a high variability between individual TFBSs. Conclusions Our results show that only few of the matrix-based models used to predict potential TFBS are able to reliably detect experimentally confirmed TFBS. We compiled our findings in a freely accessible web application called ePOSSUM (http:/mutationtaster.charite.de/ePOSSUM/) which uses a Bayes classifier to assess the impact of genetic alterations on TF binding in user-defined sequences. Additionally, ePOSSUM provides information on the reliability of the prediction using our test set of experimentally confirmed binding sites

    DNA Methyltransferase Controls Stem Cell Aging by Regulating BMI1 and EZH2 through MicroRNAs

    Get PDF
    Epigenetic regulation of gene expression is well known mechanism that regulates cellular senescence of cancer cells. Here we show that inhibition of DNA methyltransferases (DNMTs) with 5-azacytidine (5-AzaC) or with specific small interfering RNA (siRNA) against DNMT1 and 3b induced the cellular senescence of human umbilical cord blood-derived multipotent stem cells (hUCB-MSCs) and increased p16INK4A and p21CIP1/WAF1 expression. DNMT inhibition changed histone marks into the active forms and decreased the methylation of CpG islands in the p16INK4A and p21CIP1/WAF1 promoter regions. Enrichment of EZH2, the key factor that methylates histone H3 lysine 9 and 27 residues, was decreased on the p16INK4A and p21CIP1/WAF1 promoter regions. We found that DNMT inhibition decreased expression levels of Polycomb-group (PcG) proteins and increased expression of microRNAs (miRNAs), which target PcG proteins. Decreased CpG island methylation and increased levels of active histone marks at genomic regions encoding miRNAs were observed after 5-AzaC treatment. Taken together, DNMTs have a critical role in regulating the cellular senescence of hUCB-MSCs through controlling not only the DNA methylation status but also active/inactive histone marks at genomic regions of PcG-targeting miRNAs and p16INK4A and p21CIP1/WAF1 promoter regions

    Chromosome contacts in activated T cells identify autoimmune disease candidate genes

    Get PDF
    BACKGROUND: Autoimmune disease-associated variants are preferentially found in regulatory regions in immune cells, particularly CD4+ T cells. Linking such regulatory regions to gene promoters in disease-relevant cell contexts facilitates identification of candidate disease genes. RESULTS: Within four hours, activation of CD4+ T cells invokes changes in histone modifications and enhancer RNA transcription that correspond to altered expression of the interacting genes identified by promoter capture Hi-C (PCHi-C). By integrating PCHi-C data with genetic associations for five autoimmune diseases we prioritised 245 candidate genes with a median distance from peak signal to prioritised gene of 153 kb. Just under half (108/245) prioritised genes related to activation-sensitive interactions. This included IL2RA, where allele-specific expression analyses were consistent with its interaction-mediated regulation, illustrating the utility of the approach. CONCLUSIONS: Our systematic experimental framework offers an alternative approach to candidate causal gene identification for variants with cell state-specific functional effects, with achievable sample sizes.This work was funded by the JDRF (9-2011-253), the Wellcome Trust (089989, 091157, 107881), the UK Medical Research Council (MR/L007150/1, MC_UP_1302/5), the UK Biotechnology and Biological Sciences Research Council (BB/J004480/1) and the National Institute for Health Research (NIHR) Cambridge Biomedical Research Centre. The research leading to these results has received funding from the European Union’s 7th Framework Programme (FP7/2007-2013) under grant agreement no. 241447 (NAIMIT). The Cambridge Institute for Medical Research (CIMR) is in receipt of a Wellcome Trust Strategic Award (100140)

    MicroRNA-145 Regulates Human Corneal Epithelial Differentiation

    Get PDF
    Epigenetic factors, such as microRNAs, are important regulators in the self-renewal and differentiation of stem cells and progenies. Here we investigated the microRNAs expressed in human limbal-peripheral corneal (LPC) epithelia containing corneal epithelial progenitor cells (CEPCs) and early transit amplifying cells, and their role in corneal epithelium.Human LPC epithelia was extracted for small RNAs or dissociated for CEPC culture. By Agilent Human microRNA Microarray V2 platform and GeneSpring GX11.0 analysis, we found differential expression of 18 microRNAs against central corneal (CC) epithelia, which were devoid of CEPCs. Among them, miR-184 was up-regulated in CC epithelia, similar to reported finding. Cluster miR-143/145 was expressed strongly in LPC but weakly in CC epithelia (P = 0.0004, Mann-Whitney U-test). This was validated by quantitative polymerase chain reaction (qPCR). Locked nucleic acid-based in situ hybridization on corneal rim cryosections showed miR-143/145 presence localized to the parabasal cells of limbal epithelium but negligible in basal and superficial epithelia. With holoclone forming ability, CEPCs transfected with lentiviral plasmid containing mature miR-145 sequence gave rise to defective epithelium in organotypic culture and had increased cytokeratin-3/12 and connexin-43 expressions and decreased ABCG2 and p63 compared with cells transfected with scrambled sequences. Global gene expression was analyzed using Agilent Whole Human Genome Oligo Microarray and GeneSpring GX11.0. With a 5-fold difference compared to cells with scrambled sequences, miR-145 up-regulated 324 genes (containing genes for immune response) and down-regulated 277 genes (containing genes for epithelial development and stem cell maintenance). As validated by qPCR and luciferase reporter assay, our results showed miR-145 suppressed integrin β8 (ITGB8) expression in both human corneal epithelial cells and primary CEPCs.We found expression of miR-143/145 cluster in human corneal epithelium. Our results also showed that miR-145 regulated the corneal epithelium formation and maintenance of epithelial integrity, via ITGB8 targeting
    corecore