29 research outputs found

    Primate-specific endogenous retrovirus-driven transcription defines naive-like stem cells

    Get PDF
    Naive embryonic stem cells hold great promise for research and therapeutics as they have broad and robust developmental potential. While such cells are readily derived from mouse blastocysts it has not been possible to isolate human equivalents easily, although human naive-like cells have been artificially generated (rather than extracted) by coercion of human primed embryonic stem cells by modifying culture conditions or through transgenic modification. Here we show that a sub-population within cultures of human embryonic stem cells (hESCs) and induced pluripotent stem cells (hiPSCs) manifests key properties of naive state cells. These naive-like cells can be genetically tagged, and are associated with elevated transcription of HERVH, a primate-specific endogenous retrovirus. HERVH elements provide functional binding sites for a combination of naive pluripotency transcription factors, including LBP9, recently recognized as relevant to naivety in mice. LBP9-HERVH drives hESC-specific alternative and chimaeric transcripts, including pluripotency-modulating long non-coding RNAs. Disruption of LBP9, HERVH and HERVH-derived transcripts compromises self-renewal. These observations define HERVH expression as a hallmark of naive-like hESCs, and establish novel primate-specific transcriptional circuitry regulating pluripotency

    The Constrained Maximal Expression Level Owing to Haploidy Shapes Gene Content on the Mammalian X Chromosome.

    Get PDF
    X chromosomes are unusual in many regards, not least of which is their nonrandom gene content. The causes of this bias are commonly discussed in the context of sexual antagonism and the avoidance of activity in the male germline. Here, we examine the notion that, at least in some taxa, functionally biased gene content may more profoundly be shaped by limits imposed on gene expression owing to haploid expression of the X chromosome. Notably, if the X, as in primates, is transcribed at rates comparable to the ancestral rate (per promoter) prior to the X chromosome formation, then the X is not a tolerable environment for genes with very high maximal net levels of expression, owing to transcriptional traffic jams. We test this hypothesis using The Encyclopedia of DNA Elements (ENCODE) and data from the Functional Annotation of the Mammalian Genome (FANTOM5) project. As predicted, the maximal expression of human X-linked genes is much lower than that of genes on autosomes: on average, maximal expression is three times lower on the X chromosome than on autosomes. Similarly, autosome-to-X retroposition events are associated with lower maximal expression of retrogenes on the X than seen for X-to-autosome retrogenes on autosomes. Also as expected, X-linked genes have a lesser degree of increase in gene expression than autosomal ones (compared to the human/Chimpanzee common ancestor) if highly expressed, but not if lowly expressed. The traffic jam model also explains the known lower breadth of expression for genes on the X (and the Z of birds), as genes with broad expression are, on average, those with high maximal expression. As then further predicted, highly expressed tissue-specific genes are also rare on the X and broadly expressed genes on the X tend to be lowly expressed, both indicating that the trend is shaped by the maximal expression level not the breadth of expression per se. Importantly, a limit to the maximal expression level explains biased tissue of expression profiles of X-linked genes. Tissues whose tissue-specific genes are very highly expressed (e.g., secretory tissues, tissues abundant in structural proteins) are also tissues in which gene expression is relatively rare on the X chromosome. These trends cannot be fully accounted for in terms of alternative models of biased expression. In conclusion, the notion that it is hard for genes on the Therian X to be highly expressed, owing to transcriptional traffic jams, provides a simple yet robustly supported rationale of many peculiar features of X's gene content, gene expression, and evolution

    Correction to: RNA G-quadruplexes at upstream open reading frames cause DHX36- and DHX9-dependent translation of human mRNAs

    Get PDF
    Following publication of the original article [1], the authors reported the following error in the name of the fourth author

    SUPPLEMENTARY TABLES

    No full text
    Data in Table S3 underlies Figure 4. Data in Table S7 partially underlies Fig 1. Data in Tables S4 underlies Fig 3. Data in Tables S10-12 underlies Fig S1

    Data from: The constrained maximal expression level owing to haploidy shapes gene content on the mammalian X chromosome

    No full text
    X chromosomes are unusual in many regards, not least of which is their nonrandom gene content. The causes of this bias are commonly discussed in the context of sexual antagonism and the avoidance of activity in the male germline. Here, we examine the notion that, at least in some taxa, functionally biased gene content may more profoundly be shaped by limits imposed on gene expression owing to haploid expression of the X chromosome. Notably, if the X, as in primates, is transcribed at rates comparable to the ancestral rate (per promoter) prior to the X chromosome formation, then the X is not a tolerable environment for genes with very high maximal net levels of expression, owing to transcriptional traffic jams. We test this hypothesis using The Encyclopedia of DNA Elements (ENCODE) and data from the Functional Annotation of the Mammalian Genome (FANTOM5) project. As predicted, the maximal expression of human X-linked genes is much lower than that of genes on autosomes: on average, maximal expression is three times lower on the X chromosome than on autosomes. Similarly, autosome-to-X retroposition events are associated with lower maximal expression of retrogenes on the X than seen for X-to-autosome retrogenes on autosomes. Also as expected, X-linked genes have a lesser degree of increase in gene expression than autosomal ones (compared to the human/Chimpanzee common ancestor) if highly expressed, but not if lowly expressed. The traffic jam model also explains the known lower breadth of expression for genes on the X (and the Z of birds), as genes with broad expression are, on average, those with high maximal expression. As then further predicted, highly expressed tissue-specific genes are also rare on the X and broadly expressed genes on the X tend to be lowly expressed, both indicating that the trend is shaped by the maximal expression level not the breadth of expression per se. Importantly, a limit to the maximal expression level explains biased tissue of expression profiles of X-linked genes. Tissues whose tissue-specific genes are very highly expressed (e.g., secretory tissues, tissues abundant in structural proteins) are also tissues in which gene expression is relatively rare on the X chromosome. These trends cannot be fully accounted for in terms of alternative models of biased expression. In conclusion, the notion that it is hard for genes on the Therian X to be highly expressed, owing to transcriptional traffic jams, provides a simple yet robustly supported rationale of many peculiar features of X’s gene content, gene expression, and evolution

    A correlation between tissue-specific maximal expression (TSME) and binary enrichment on the X chromosome.

    No full text
    <p>This figure shows a scatterplot where each data point is a FANTOM5 library (points are colored-coded to highlight brain tissues, sex-specific tissues, and the placenta). <i>X</i>-axis corresponds to the average maximal expression of given tissue’s top 0.1% preferentially expressed genes (i.e., TSME, using a logarithmic scale). <i>Y</i>-axis corresponds to binary enrichment. The strength of the Spearman correlation and <i>p-</i>values are annotated with text above the figure panel. Data points that have standardized residuals more than 1.96 standard deviations (highlighted as grey area) from the linear regression line (which is plotted in black) have their names annotated with text. The underlying data can be found at <a href="http://fantom.gsc.riken.jp/5/data/" target="_blank">http://fantom.gsc.riken.jp/5/data/</a> and in Dryad Digital Repository (doi:<a href="http://dx.doi.org/10.5061/dryad.p4s57" target="_blank">10.5061/dryad.p4s57</a>).</p

    The comparison of change in gene expression (Z) since the human-Chimpanzee common ancestor for five somatic tissues.

    No full text
    <p>Genes are divided into X-linked (yellow) and autosomal (green). In turn, they are split into a half with low expression in the ancestor (<i>low</i>) and a half with high expression (<i>high</i>). Genes with no expression in the ancestor are excluded from this analysis (but this makes no qualitative difference). In all instances, the high-expression X-linked genes have a lower median Z-score than high-expression autosomal genes, this being significant in three instances using a Mann Whitney U test (shown as *). The combined <i>p-</i>value is highly significant (see main text). There is no consistent trend for the low-expression genes. The underlying data can be found at <a href="http://fantom.gsc.riken.jp/5/data/" target="_blank">http://fantom.gsc.riken.jp/5/data/</a> and in Dryad Digital Repository (doi:<a href="http://dx.doi.org/10.5061/dryad.p4s57" target="_blank">10.5061/dryad.p4s57</a>).</p

    The number of transcription factor binding sites per proximal promoter is higher on autosomes than on sex chromosomes.

    No full text
    <p>This figure consists of two parts identified as <i>a</i> and <i>b</i>. In part <i>a</i>, the average number of transcription factor binding sites per promoter in symmetrical windows around transcriptional start sites (TSSes) is shown. The plots have a characteristic shape of the V-sign. On the <i>x</i>-axis of panel <i>a</i>, values from negative 3 kbps to zero signify positions upstream TSSes (negative values signify positions downstream the TSS). In part <i>b</i>, V-sign-shaped curves are plotted separately for each chromosome (and the <i>x</i>-axis corresponds to the order of chromosomes from 1 to 22 plus the X and Y). The curves are similar between autosomes, but TfbsNo is lower for sex chromosomes. The underlying data can be found at <a href="http://fantom.gsc.riken.jp/5/data/" target="_blank">http://fantom.gsc.riken.jp/5/data/</a> and in Dryad Digital Repository (doi:<a href="http://dx.doi.org/10.5061/dryad.p4s57" target="_blank">10.5061/dryad.p4s57</a>).</p
    corecore