104 research outputs found

    The Effect of Transposable Element Insertions on Gene Expression Evolution in Rodents

    Get PDF
    Background:Many genomes contain a substantial number of transposable elements (TEs), a few of which are known to be involved in regulating gene expression. However, recent observations suggest that TEs may have played a very important role in the evolution of gene expression because many conserved non-genic sequences, some of which are know to be involved in gene regulation, resemble TEs. Results:Here we investigate whether new TE insertions affect gene expression profiles by testing whether gene expression divergence between mouse and rat is correlated to the numbers of new transposable elements inserted near genes. We show that expression divergence is significantly correlated to the number of new LTR and SINE elements, but not to the numbers of LINEs. We also show that expression divergence is not significantly correlated to the numbers of ancestral TEs in most cases, which suggests that the correlations between expression divergence and the numbers of new TEs are causal in nature. We quantify the effect and estimate that TE insertion has accounted for ~20% (95% confidence interval: 12% to 26%) of all expression profile divergence in rodents. Conclusions:We conclude that TE insertions may have had a major impact on the evolution of gene expression levels in rodents

    Does Selection against Transcriptional Interference Shape Retroelement-Free Regions in Mammalian Genomes?

    Get PDF
    BACKGROUND: Eukaryotic genomes are scattered with retroelements that proliferate through retrotransposition. Although retroelements make up around 40 percent of the human genome, large regions are found to be completely devoid of retroelements. This has been hypothesised to be a result of genomic regions being intolerant to insertions of retroelements. The inadvertent transcriptional activity of retroelements may affect neighbouring genes, which in turn could be detrimental to an organism. We speculate that such retroelement transcription, or transcriptional interference, is a contributing factor in generating and maintaining retroelement-free regions in the human genome. METHODOLOGY/PRINCIPAL FINDINGS: Based on the known transcriptional properties of retroelements, we expect long interspersed elements (LINEs) to be able to display a high degree of transcriptional interference. In contrast, we expect short interspersed elements (SINEs) to display very low levels of transcriptional interference. We find that genomic regions devoid of long interspersed elements (LINEs) are enriched for protein-coding genes, but that this is not the case for regions devoid of short interspersed elements (SINEs). This is expected if genes are subject to selection against transcriptional interference. We do not find microRNAs to be associated with genomic regions devoid of either SINEs or LINEs. We further observe an increased relative activity of genes overlapping LINE-free regions during early embryogenesis, where activity of LINEs has been identified previously. CONCLUSIONS/SIGNIFICANCE: Our observations are consistent with the notion that selection against transcriptional interference has contributed to the maintenance and/or generation of retroelement-free regions in the human genome

    Enrichment analysis of Alu elements with different spatial chromatin proximity in the human genome

    Get PDF
    Transposable elements (TEs) have no longer been totally considered as “junk DNA” for quite a time since the continual discoveries of their multifunctional roles in eukaryote genomes. As one of the most important and abundant TEs that still active in human genome, Alu, a SINE family, has demonstrated its indispensable regulatory functions at sequence level, but its spatial roles are still unclear. Technologies based on 3C(chromosomeconformation capture) have revealed the mysterious three-dimensional structure of chromatin, and make it possible to study the distal chromatin interaction in the genome. To find the role TE playing in distal regulation in human genome, we compiled the new released Hi-C data, TE annotation, histone marker annotations, and the genome-wide methylation data to operate correlation analysis, and found that the density of Alu elements showed a strong positive correlation with the level of chromatin interactions (hESC: r=0.9, P<2.2×1016; IMR90 fibroblasts: r = 0.94, P < 2.2 × 1016) and also have a significant positive correlation withsomeremote functional DNA elements like enhancers and promoters (Enhancer: hESC: r=0.997, P=2.3×10−4; IMR90: r=0.934, P=2×10−2; Promoter: hESC: r = 0.995, P = 3.8 × 10−4; IMR90: r = 0.996, P = 3.2 × 10−4). Further investigation involving GC content and methylation status showed the GC content of Alu covered sequences shared a similar pattern with that of the overall sequence, suggesting that Alu elements also function as the GC nucleotide and CpG site provider. In all, our results suggest that the Alu elements may act as an alternative parameter to evaluate the Hi-C data, which is confirmed by the correlation analysis of Alu elements and histone markers. Moreover, the GC-rich Alu sequence can bring high GC content and methylation flexibility to the regions with more distal chromatin contact, regulating the transcription of tissue-specific genes

    HIV-2 as a model to identify a functional HIV cure

    Get PDF
    Two HIV virus types exist: HIV-1 is pandemic and aggressive, whereas HIV-2 is confined mainly to West Africa and less pathogenic. Despite the fact that it has been almost 40 years since the discovery of AIDS, there is still no cure or vaccine against HIV. Consequently, the concepts of functional vaccines and cures that aim to limit HIV disease progression and spread by persistent control of viral replication without life-long treatment have been suggested as more feasible options to control the HIV pandemic. To identify virus-host mechanisms that could be targeted for functional cure development, researchers have focused on a small fraction of HIV-1 infected individuals that control their infection spontaneously, so-called elite controllers. However, these efforts have not been able to unravel the key mechanisms of the infection control. This is partly due to lack in statistical power since only 0.15% of HIV-1 infected individuals are natural elite controllers. The proportion of long-term viral control is larger in HIV-2 infection compared with HIV-1 infection. We therefore present the idea of using HIV-2 as a model for finding a functional cure against HIV. Understanding the key differences between HIV-1 and HIV-2 infections, and the cross-reactive effects in HIV-1/HIV-2 dual-infection could provide novel insights in developing functional HIV cures and vaccines

    The overmethylated genes in Helicobacter pylori-infected gastric mucosa are demethylated in gastric cancers

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The transitional-CpG sites between weakly methylated genes and densely methylated retroelements are overmethylated in the gastric mucosa infected with <it>Helicobacter pylori </it>(<it>H. pylori</it>) and they are undermethylated in the gastric cancers depending on the level of loss of heterozygosity (LOH) events. This study delineated the transitional-CpG methylation patterns of CpG-island-containing and -lacking genes in view of the retroelements.</p> <p>Methods</p> <p>The transitional-CpG sites of eight CpG-island-containing genes and six CpG-island-lacking genes were semi-quantitatively examined by performing radioisotope-labelling methylation-specific PCR under stringent conditions. The level of LOH in the gastric cancers was estimated using the 40 microsatellite markers on eight cancer-associated chromosomes. Each gene was scored as overmethylated or undermethylated based on an intermediate level of transitional-CpG methylation common in the <it>H. pylori</it>-negative gastric mucosa.</p> <p>Results</p> <p>The eight CpG-island genes examined were overmethylated depending on the proximity to the nearest retroelement in the <it>H. pylori</it>-positive gastric mucosa. The six CpG-island-lacking genes were similarly methylated in the <it>H. pylori</it>-positive and -negative gastric mucosa. In the gastric cancers, long transitional-CpG segments of the CpG-island genes distant from the retroelements remained overmethylated, whereas the overmethylation of short transitional-CpG segments close to the retroelements was not significant. Both the CpG-island-containing and -lacking genes tended to be decreasingly methylated in a LOH-level-dependent manner.</p> <p>Conclusions</p> <p>The overmethylated genes under the influence of retroelement methylation in the <it>H. pylori</it>-infected stomach are demethylated in the gastric cancers influenced by LOH.</p

    Gene Properties and Chromatin State Influence the Accumulation of Transposable Elements in Genes

    Get PDF
    Transposable elements (TEs) are mobile DNA sequences found in the genomes of almost all species. By measuring the normalized coverage of TE sequences within genes, we identified sets of genes with conserved extremes of high/low TE density in the genomes of human, mouse and cow and denoted them as ‘shared upper/lower outliers (SUOs/SLOs)’. By comparing these outlier genes to the genomic background, we show that a large proportion of SUOs are involved in metabolic pathways and tend to be mammal-specific, whereas many SLOs are related to developmental processes and have more ancient origins. Furthermore, the proportions of different types of TEs within human and mouse orthologous SUOs showed high similarity, even though most detectable TEs in these two genomes inserted after their divergence. Interestingly, our computational analysis of polymerase-II (Pol-II) occupancy at gene promoters in different mouse tissues showed that 60% of tissue-specific SUOs show strong Pol-II binding only in embryonic stem cells (ESCs), a proportion significantly higher than the genomic background (37%). In addition, our analysis of histone marks such as H3K4me3 and H3K27me3 in mouse ESCs also suggest a strong association between TE-rich genes and open-chromatin at promoters. Finally, two independent whole-transcriptome datasets show a positive association between TE density and gene expression level in ESCs. While this study focuses on genes with extreme TE densities, the above results clearly show that the probability of TE accumulation/fixation in mammalian genes is not random and is likely associated with different factors/gene properties and, most importantly, an association between the TE insertion/fixation rate and gene activity status in ES cells

    A Novel Protein Isoform of the Multicopy Human NAIP Gene Derives from Intragenic Alu SINE Promoters

    Get PDF
    The human neuronal apoptosis inhibitory protein (NAIP) gene is no longer principally considered a member of the Inhibitor of Apoptosis Protein (IAP) family, as its domain structure and functions in innate immunity also warrant inclusion in the Nod-Like Receptor (NLR) superfamily. NAIP is located in a region of copy number variation, with one full length and four partly deleted copies in the reference human genome. We demonstrate that several of the NAIP paralogues are expressed, and that novel transcripts arise from both internal and upstream transcription start sites. Remarkably, two internal start sites initiate within Alu short interspersed element (SINE) retrotransposons, and a third novel transcription start site exists within the final intron of the GUSBP1 gene, upstream of only two NAIP copies. One Alu functions alone as a promoter in transient assays, while the other likely combines with upstream L1 sequences to form a composite promoter. The novel transcripts encode shortened open reading frames and we show that corresponding proteins are translated in a number of cell lines and primary tissues, in some cases above the level of full length NAIP. Interestingly, some NAIP isoforms lack their caspase-sequestering motifs, suggesting that they have novel functions. Moreover, given that human and mouse NAIP have previously been shown to employ endogenous retroviral long terminal repeats as promoters, exaptation of Alu repeats as additional promoters provides a fascinating illustration of regulatory innovations adopted by a single gene

    Effects of L1-ORF2 fragments on green fluorescent protein gene expression

    Get PDF
    The retrotransposon known as long interspersed nuclear element-1 (L1) is 6 kb long, although most L1s in mammalian and other eukaryotic cells are truncated. L1 contains two open reading frames, ORF1 and ORF2, that code for an RNA-binding protein and a protein with endonuclease and reverse transcriptase activities, respectively. In this work, we examined the effects of full length L1-ORF2 and ORF2 fragments on green fluorescent protein gene (GFP) expression when inserted into the pEGFP-C1 vector downstream of GFP. All of the ORF2 fragments in sense orientation inhibited GFP expression more than when in antisense orientation, which suggests that small ORF2 fragments contribute to the distinct inhibitory effects of this ORF on gene expression. These results provide the first evidence that different 280-bp fragments have distinct effects on the termination of gene transcription, and that when inserted in the antisense direction, fragment 280-9 (the 3' end fragment of ORF2) induces premature termination of transcription that is consistent with the effect of ORF2

    Genome-Wide Assessments Reveal Extremely High Levels of Polymorphism of Two Active Families of Mouse Endogenous Retroviral Elements

    Get PDF
    Endogenous retroviral elements (ERVs) in mice are significant genomic mutagens, causing ∼10% of all reported spontaneous germ line mutations in laboratory strains. The majority of these mutations are due to insertions of two high copy ERV families, the IAP and ETn/MusD elements. This significant level of ongoing retrotranspositional activity suggests that inbred mice are highly variable in content of these two ERV groups. However, no comprehensive genome-wide studies have been performed to assess their level of polymorphism. Here we compared three test strains, for which sufficient genomic sequence is available, to each other and to the reference C57BL/6J genome and detected very high levels of insertional polymorphism for both ERV families, with an estimated false discovery rate of only 0.4%. Specifically, we found that at least 60% of IAP and 25% of ETn/MusD elements detected in any strain are absent in one or more of the other three strains. The polymorphic nature of a set of 40 ETn/MusD elements found within gene introns was confirmed using genomic PCR on DNA from a panel of mouse strains. For some cases, we detected gene-splicing abnormalities involving the ERV and obtained additional evidence for decreased gene expression in strains carrying the insertion. In total, we identified nearly 700 polymorphic IAP or ETn/MusD ERVs or solitary LTRs that reside in gene introns, providing potential candidates that may contribute to gene expression differences among strains. These extreme levels of polymorphism suggest that ERV insertions play a significant role in genetic drift of mouse lines
    corecore