8 research outputs found
Nuclear transcriptome profiling of induced pluripotent stem cells and embryonic stem cells identify non-coding loci resistant to reprogramming
<p>Identification of functionally relevant differences between induced pluripotent stem cells (iPSC) and reference embryonic stem cells (ESC) remains a central question for therapeutic applications. Differences in gene expression between iPSC and ESC have been examined by microarray and more recently with RNA-SEQ technologies. We here report an in depth analyses of nuclear and cytoplasmic transcriptomes, using the CAGE (cap analysis of gene expression) technology, for 5 iPSC clones derived from mouse lymphocytes B and 3 ESC lines. This approach reveals nuclear transcriptomes significantly more complex in ESC than in iPSC. Hundreds of yet not annotated putative non-coding RNAs and enhancer-associated transcripts specifically transcribed in ESC have been detected and supported with epigenetic and chromatin-chromatin interactions data. We identified super-enhancers transcriptionally active specifically in ESC and associated with genes implicated in the maintenance of pluripotency. Similarly, we detected non-coding transcripts of yet unknown function being regulated by ESC specific super-enhancers. Taken together, these results demonstrate that current protocols of iPSC reprogramming do not trigger activation of numerous <i>cis</i>-regulatory regions. It thus reinforces the need for already suggested deeper monitoring of the non-coding transcriptome when characterizing iPSC clones. Such differences in regulatory transcript expression may indeed impact their potential for clinical applications.</p
List of Sim2 target genes identified using the ChIA-PET data.
<p>List of Sim2 target genes identified using the ChIA-PET data.</p
Identification and characterization of the SIM2 DNA binding sites by ChIP-seq.
<p><b>a.</b> Venn diagram of the number of SIM2 binding sites identified by ChIP-seq in each SIM2 clones (A6, B8, C4) and EB3 line. The sum of the bold numbers is equal to the 1229 SIM2 DNA binding sites found in at least 2 SIM2 clones. <b>b.</b> The pie chart shows the genomic distribution of these 1229 sites. <b>c.</b> Distribution of the distances between the SIM2 DNA binding sites and the closest transcription start site (TSS). <b>d.</b> Selection of gene ontology terms significantly over-represented in the list of genes associated to a SIM2 DNA binding site.</p
Cellular model.
<p><b>a.</b> Schematic representation of the inducible ROSA-TET system. The mES SIM2 clones A6, B8 and C4 contain a Flag-tagged version of the mouse <i>Sim2</i> gene under the control of a modified human CMV promoter (hCMV*-1). In presence of tetracycline in the culture media (+Tet), the tetracycline-regulatable transactivator (tTA) is trapped and cannot bind the hCMV*-1 promoter. Upon removal of tetracycline (-Tet), the tTA binds the hCMV*-1 promoter inducing the expression of the <i>Sim2</i>-Flag-IRES-Venus construct. The mES EB3 parental line does not contain the <i>Sim2</i>-Flag transgene. The puromycin-resistant (Puro<sup>R</sup>) and hygromycin-resistant (Hygro<sup>R</sup>) cassettes are used for the clone selection process. SA: Splice Acceptor; pA: poly-adenylation site; IRES: Internal Ribosome Entry Site; orange triangles represent loxP sites. Modified from [<a href="http://www.plosone.org/article/info:doi/10.1371/journal.pone.0126475#pone.0126475.ref025" target="_blank">25</a>] <b>b.</b> Fluorescence-activated cell sorting (FACS) analysis of <i>Sim2</i> expressing and non-expressing clones. Cells were grown in the presence (+Tet) or absence (-Tet) of Tetracycline during 26 hours. The y-axis represents the number of cells and the x-axis the fluorescence intensity. <b>c and d.</b> Agarose gel electrophoresis results of reverse-transcription PCR assay (RT). Total RNA from +Tet cells was reverse transcribed and amplified using primers specific to <i>Sim2</i> (c) or <i>Arnt</i> (d) in presence (RT+) or absence (RT-) of reverse transcriptase. L: loading marker; H<sub>2</sub>O: PCR negative control.</p
The Mediator complex colocalizes with the SIM2 DNA binding sites.
<p>Frequency distribution of MED1 (<b>a</b>) and MED12 (<b>b</b>) DNA binding sites in a 40kb window centered to the SIM2 peaks. Pie charts show the proportion of SIM2 DNA binding sites overlapping MED1 or MED12 DNA binding sites (in grey) (100bp window). p = Fisher’s exact test p-value; F score: measure of the significance of the association (1 = perfect match).</p
Motifs enriched in Sim2 DNA binding sites.
<p>Motifs enriched in Sim2 DNA binding sites.</p
mRNA-sequencing and ChIA-PET analyses.
<p><i>Sim2</i> (<b>a</b>) and <i>Arnt</i> (<b>b</b>) mRNA levels (RPKM) in A6, B8, C4 SIM2 clones and three EB3 replicates. <b>c.</b> Comparison of the gene expression level (mean log2 RPKM between the 3 replicates) between <i>Sim2</i> expressing cells (y-axis) and EB3 control cells (x-axis). Each blue dot represents a gene; differentially expressed genes (EdgeR FDR<0.05) are shown in red. The diagonal line represents the expected distribution of genes equally expressed between <i>Sim2</i> expressing and non-expressing cells. <b>d.</b> Enrichment of Sim2 targets among the genes upregulated in <i>Sim2</i> expressing clones as revealed by the GSEA analysis. Genes were sorted according to their expression fold change between <i>Sim2</i> expressing and non-expressing cells (x-axis, 0 showing the most upregulated gene). Black vertical bars show the position of the SIM2 targets in the ranked list. The enrichment score (ES in green) significantly deviates from zero at the beginning of the distribution showing that the SIM2 targets are not randomly distributed in the ranked list but enriched among the most upregulated genes. p: FDR corrected p-value <b>e.</b> ChIA-PET interactions occurring between SIM2 DNA binding sites and promoters of genes differentially expressed in <i>Sim2</i> expressing cells compared to EB3 cells (FDR<0.05). Blue lines show inter-chromosomal interactions and red lines intra-chromosomal interactions.</p
Overlapping SIM2 occupancy with master transcription factor binding sites.
<p><b>a.</b> Frequency distribution of OCT4, SOX2 and NANOG DNA binding sites in a 40kb window centered to the newly identified SIM2 DNA binding sites. Plots show a significant enrichment for the OSN binding sites at the SIM2 peak localization in SIM2 A6 expressing cells. Pie charts show the proportion of SIM2 DNA binding sites overlapping with the OCT4, SOX2 or NANOG binding sites (in grey) (100bp window). p = Fisher’s exact test p-value; F score: measure of the significance of the association (1 = perfect match). <b>b.</b> Protein co-immunoprecipitation experiments of SIM2-FLAG with endogenous OCT4, SOX2, KLF4 (left panel) and NANOG (right panel). Cellular protein extracts from <i>Sim2</i> expressing cells (A6) or EB3 cells were immunoprecipitated by using antibodies directed against each of the pluripotency factors (N-terminal and C-terminal part of NANOG) or IgG as a negative control for co-immunoprecipitation. Associated proteins were immunoblotted using an anti-FLAG antibody. Red star shows the SIM2-FLAG protein, blue star the signal given by the recognition of the IgG heavy chains. Ø: Beads only; kDa: kilodaltons; protein lysat: protein lysat was loaded as an input control for the immunoblot.</p