Search CORE

6 research outputs found

Spliced integrated retrotransposed element (SpIRE) formation in the human genome

Author: Christine R. Beck (699745)
Jeffrey M. Kidd (145815)
John B. Moldovan (240328)
John V. Moran (240345)
Naveen Jasti (4911400)
Peter A. Larson (4512199)
Publication venue
Publication date: 01/03/2018
Field of study

<div>Human Long interspersed element-1 (L1) retrotransposons contain an internal RNA polymerase II promoter within their 5′ untranslated region (UTR) and encode two proteins, (ORF1p and ORF2p) required for their mobilization (i.e., retrotransposition). The evolutionary success of L1 relies on the continuous retrotransposition of full-length L1 mRNAs. Previous studies identified functional splice donor (SD), splice acceptor (SA), and polyadenylation sequences in L1 mRNA and provided evidence that a small number of spliced L1 mRNAs retrotransposed in the human genome. Here, we demonstrate that the retrotransposition of intra-5′UTR or 5′UTR/ORF1 spliced L1 mRNAs leads to the generation of spliced integrated retrotransposed elements (SpIREs). We identified a new intra-5′UTR SpIRE that is ten times more abundant than previously identified SpIREs. Functional analyses demonstrated that both intra-5′UTR and 5′UTR/ORF1 SpIREs lack Cis-acting transcription factor binding sites and exhibit reduced promoter activity. The 5′UTR/ORF1 SpIREs also produce nonfunctional ORF1p variants. Finally, we demonstrate that sequence changes within the L1 5′UTR over evolutionary time, which permitted L1 to evade the repressive effects of a host protein, can lead to the generation of new L1 splicing events, which, upon retrotransposition, generates a new SpIRE subfamily. We conclude that splicing inhibits L1 retrotransposition, SpIREs generally represent evolutionary “dead-ends” in the L1 retrotransposition process, mutations within the L1 5′UTR alter L1 splicing dynamics, and that retrotransposition of the resultant spliced transcripts can generate interindividual genomic variation.</div

Directory of Open Access Journals

FigShare

A working model for the generation of SpIREs.

Author: Christine R. Beck (699745)
Jeffrey M. Kidd (145815)
John B. Moldovan (240328)
John V. Moran (240345)
Naveen Jasti (4911400)
Peter A. Larson (4512199)
Publication venue
Publication date
Field of study

(A) Canonical L1 retrotransposition. An L1 is transcribed from a genomic location (red chromosome). Translation of the mRNA (multicolored wavy line) occurs in the cytoplasm and ORF1p (yellow circles) and ORF2p (blue oval) bind back onto their respective mRNA (Cis-preference) to form an RNP. The L1 RNP then enters the nucleus and a de novo L1 insertion occurs at a new genomic location (green chromosome) by TPRT. This insertion, if full length, could act as a source element, giving rise to new insertions (green arrow) at a new genomic location (gray chromosome). (B) Retrotransposition of intra-5′UTR spliced L1 isoform. A full-length L1 element is transcribed from its genomic location (red chromosome) and undergoes intra-5′UTR splicing. Translation of the mRNA (multicolored wavy line) occurs in the cytoplasm and ORF1p (yellow circles) and ORF2p (blue oval) bind back onto their respective mRNA (Cis-preference) to form an RNP. The L1 RNP then enters the nucleus and L1 mRNAs subject to intra-5′UTR splicing can undergo a single round of retrotransposition (green chromosome) by TPRT. However, because the intra-5′UTR splicing event deletes sequences required for L1 promoter activity, the resultant insertion is unlikely to undergo subsequent rounds of retrotransposition in future generations (dashed green arrow). (C) Retrotransposition of 5′UTR/ORF1 spliced L1 isoform. An L1 is transcribed from its genomic location (red chromosome) and is subject to 5′UTR/ORF1 splicing. Translation of the mRNA (multicolored wavy line) occurs in the cytoplasm; however, because translation occurs at downstream AUG codons, ORF1p (yellow circles) is truncated and nonfunctional, the 5′UTR/ORF1 spliced L1 mRNA relies on a wild-type source of ORF1p to be supplied from another L1 in trans. In the rare instance that Trans-complementation occurs (dotted arrow), it is highly unlikely that the resultant SpIRE will generate RNAs that can undergo retrotransposition in future generations (dashed thin green arrow). L1, Long interspersed element-1; ORF, open reading frame; RNP, ribonucleoprotein particle; SpIRE, spliced integrated retrotransposed element; TPRT, target-site primed reverse transcription; UTR, untranslated region.</p

FigShare

Intra-5′UTR and 5′UTR/ORF1 SpIREs are retrotransposition-defective.

Author: Christine R. Beck (699745)
Jeffrey M. Kidd (145815)
John B. Moldovan (240328)
John V. Moran (240345)
Naveen Jasti (4911400)
Peter A. Larson (4512199)
Publication venue
Publication date
Field of study

(A) Results from the SpIRE97/622 retrotransposition assay. The x-axis indicates the construct names. The y-axis indicates the relative retrotransposition efficiency (%). The CMV promoter either augments L1 expression (+CMV, black bars) or is absent (ΔCMV, gray bars) from the L1 expression construct. The relative retrotransposition efficiencies are normalized to pJM101/L1.3 (set at 100%). The pJM105/L1.3 plasmid served as a negative control. The images and data are from one representative experiment (<a href="http://www.plosbiology.org/article/info:doi/10.1371/journal.pbio.2003067#pbio.2003067.s007" target="_blank">S3 Table</a>). Error bars represent the standard deviation of technical triplicates for the depicted assay. Each assay was repeated three times, yielding similar results. (B) Results from the SpIRE97/976 retrotransposition assay. The x-axis indicates the construct names. The y-axis indicates the relative retrotransposition efficiency (%). A CMV promoter augments L1 expression (+CMV, black bars). The relative retrotransposition efficiencies are normalized to pJM101/L1.3 (set at 100%). The pJM105/L1.3 plasmid served as a negative control. The images and data are from one representative experiment (<a href="http://www.plosbiology.org/article/info:doi/10.1371/journal.pbio.2003067#pbio.2003067.s008" target="_blank">S4 Table</a>). Error bars represent the standard deviation of technical triplicates for the depicted assay. Each assay was repeated three times, yielding similar results. (C) Results from the SpIRE97/976 Trans-complementation assay. The x-axis indicates the “reporter” (top text) and the “driver” (bottom text) construct names. The y-axis indicates the relative Trans-complementation efficiency (%). The results of each assay were normalized to the pPL97/976/L1.3 “reporter” plasmid + pJBM561 “driver plasmid” co-transfection experiment, which was set at 100%. The image at the bottom right-hand side of the figure represents the efficiency of pJM101/L1.3 retrotransposition in cis. The pPL97-976/L1.3 “reporter” plasmid + pCEP4 “driver plasmid” co-transfection experiment served as a negative control. The images and data are from one representative experiment (<a href="http://www.plosbiology.org/article/info:doi/10.1371/journal.pbio.2003067#pbio.2003067.s009" target="_blank">S5 Table</a>). Error bars represent standard deviations of technical triplicates for the depicted experiment. Each assay was repeated four times, yielding similar results. CMV, cytomegalovirus; L1, Long interspersed element-1; ORF, open reading frame; SpIRE, spliced integrated retrotransposed element; UTR, untranslated region.</p

FigShare

Sequence changes within the 5′UTR affect intra-5′UTR splice site choice.

Author: Christine R. Beck (699745)
Jeffrey M. Kidd (145815)
John B. Moldovan (240328)
John V. Moran (240345)
Naveen Jasti (4911400)
Peter A. Larson (4512199)
Publication venue
Publication date
Field of study

(A) Schematic of the L1PA1 and L1PA3 5′UTRs. Top schematic, the relative positions of the SD (red lettering), SA (green lettering), and putative branch point sequence (ACCTCAC, black lettering) in the L1PA1 5′UTR that led to the formation of SpIRE97/790 are indicated in the schematic. Superscript numbers indicate the first and last nucleotide of the indicated sequence. Note that nucleotide positions are indicated in the context of L1.3 (accession #L19088). Numbers below the branch point (underlined A; 95.75) and above the SA A788G789 (84.95) indicate the predicted score of those sequences for utilization in a splicing reaction, as determined using Human Splicing Finder v.3.0 (<a href="http://www.umd.be/HSF3/" target="_blank">http://www.umd.be/HSF3/</a>) [<a href="http://www.plosbiology.org/article/info:doi/10.1371/journal.pbio.2003067#pbio.2003067.ref105" target="_blank">105</a>]. Note that predicted scores above 80 are considered “strong” [<a href="http://www.plosbiology.org/article/info:doi/10.1371/journal.pbio.2003067#pbio.2003067.ref105" target="_blank">105</a>]. Bottom schematic, the relative positions of the SD (red lettering), SAs A851G852 (purple lettering), A916G917 (green lettering), and putative branch point sequence (TCCAGAG, black lettering) in the L1PA3 5′UTR are indicated in the schematic. Superscript numbers indicate the first and last nucleotide of the indicated sequence. Numbers below the branch point (underlined A; 75.73) and SAs A851G852 (83.75) and A916G917 (79.66) indicate the predicted strength of those sequences for utilization in a splicing reaction, as determined using Human Splicing Finder v.3.0 (<a href="http://www.umd.be/HSF3/" target="_blank">http://www.umd.be/HSF3/</a>) [<a href="http://www.plosbiology.org/article/info:doi/10.1371/journal.pbio.2003067#pbio.2003067.ref105" target="_blank">105</a>]. The L1PA3 5′UTR contains a 129-bp sequence (gray triangle) containing the SA A851G852 that was lost in the transition from the L1PA3 to L1PA2/L1PA1 subfamilies. The 129-bp deletion results in repositioning the SA A916G917 in L1PA3 to closer proximity of a putative branch point in the L1PA2/L1PA1 subfamilies 5′UTR (now noted as A788G789 in the top schematic), leading to a higher predicted score (84.95 in PA1 compared to 79.66 in PA3). (B) Schematic of luciferase constructs and results from luciferase assays. Top panel: the L1RP 5′UTR (gray rectangle) was used to drive the transcription of the firefly luciferase reporter gene (green rectangle) present in plasmid pGL4.11. The following plasmids were created: pJBMWTLUC contains the full-length L1RP 5′UTR; pJBMWT129PA4LUC contains the 129-bp (black box in 5′UTR) sequence derived from L1PA4 within the L1RP 5′UTR; pJBMWT129SCRLUC contains a scrambled version of the 129-bp sequence (black and white striped box) within the 5′UTR. Bottom panel: luciferase assay. The x-axis indicates the name of the luciferase expression plasmid. The y-axis indicates the relative firefly luciferase units normalized to a co-transfected Renilla luciferase internal control. These data represent the averages of three biological replicates (<a href="http://www.plosbiology.org/article/info:doi/10.1371/journal.pbio.2003067#pbio.2003067.s010" target="_blank">S6 Table</a>). Each biological replicate contained six technical replicates. Error bars indicate the standard deviation between three biological replicates. P-values were determined using a Student one-tailed t test and “n.s.” indicates that there was no statistical difference. (C) Results from the EGFP retrotransposition assay: the x-axis indicates the construct names. The y-axis indicates the relative retrotransposition efficiency (%). The relative retrotransposition efficiencies are normalized to pL1RP-EGFP (set at 100%). The data are from one representative experiment (<a href="http://www.plosbiology.org/article/info:doi/10.1371/journal.pbio.2003067#pbio.2003067.s011" target="_blank">S7 Table</a>). Error bars represent the standard deviation of technical triplicates for the depicted assay. Each assay was repeated four times, yielding similar results. (D) Results from RT-PCR assays: a 2.0% agarose gel depicting the results from a representative qualitative RT-PCR experiment. DNA size markers (1 kb Plus DNA Ladder) are shown at the left of the gel. Plasmid names are indicated above the gel; UTF = untransfected HeLa-JVM cells, H2O = water control for PCR reactions. The right half of the agarose gel (“NO RT”) indicates the results from a representative experiment conducted without the addition of reverse transcriptase. The inset to the right of the gel indicates the major (*, **, and ***) and minor (+, @, and $) cDNA products detected in the experiments. The assay was repeated four times, yielding similar results. H2O, water control for PCR reactions; L1, Long interspersed element-1; RT-PCR, reverse transcription PCR; SA, splice acceptor; SD, splice donor; SpIRE, spliced integrated retrotransposed element; UTF, untransfected HeLa-JVM cells; UTR, untranslated region.</p

FigShare

L1 mRNA contains potential SD and SA sites.

Author: Christine R. Beck (699745)
Jeffrey M. Kidd (145815)
John B. Moldovan (240328)
John V. Moran (240345)
Naveen Jasti (4911400)
Peter A. Larson (4512199)
Publication venue
Publication date
Field of study

(A) Schematic of a full-length retrotransposition competent genomic L1. Top: the 5′ and 3′ UTRs (gray rectangles), ORF1 (yellow rectangle), and ORF2 (blue rectangle) are indicated in the schematic. The approximate positions of sense transcription initiation and antisense transcription initiation are indicated with black arrows on the top and bottom of the 5′UTR, respectively. The approximate positions of the coiled-coil (CC), RNA recognition motif (RRM), and C-terminal domain (CTD) are indicated in black lettering in ORF1. The endonuclease (EN), reverse transcriptase (RT), and cysteine-rich (C) domain are indicated in white lettering in ORF2. The 3′UTR ends in an AN. The L1 is flanked by target-site duplications (black arrowheads) in genomic DNA (black helical lines). Bottom: a magnified schematic of the 5′UTR and 5′ end of ORF1. The black arrow indicates the relative position of sense transcription initiation. The SD (red) and SA (green) sequences used to generate SpIREs are indicated above the 5′UTR (gray rectangle) and ORF1 (yellow rectangle). The position of the SD and SA sequences relative to L1.3 are indicated with superscript numbers. The relative positions of Cis-acting transcription factor binding sequences are indicated in the 5′UTR. (B–D) Schematics of the splicing events generating SpIRE97/622, SpIRE97/790, and SpIRE97/976. The SD (red underlined GU nucleotides) and SA (green underlined AG nucleotides) demark the intron boundaries used to generate each class of SpIRE. The left half of the figure depicts the L1 mRNA sequence before splicing and the right half of the figure depicts L1 mRNA after splicing. AN, poly(A) tract; C, cysteine-rich; CC, coiled-coil; CTD, C-terminal domain; EN, endonuclease; L1, Long interspersed element-1; ORF, open reading frame; poly(A), polyadenosine; RRM, RNA recognition motif; RT, reverse transcriptase; RUNX3, runt related transcription factor 3; SA, splice acceptor; SD, splice donor; SpIRE, spliced integrated retrotransposed element; SP1, specificity protein 1; SRY, sex determining region Y; UTR, untranslated region; YY1, yin and yang 1.</p

FigShare

ORF1p expression from intra-5′UTR and 5′UTR/ORF1 SpIREs.

Author: Christine R. Beck (699745)
Jeffrey M. Kidd (145815)
John B. Moldovan (240328)
John V. Moran (240345)
Naveen Jasti (4911400)
Peter A. Larson (4512199)
Publication venue
Publication date
Field of study

(A) Schematics of the engineered L1 constructs. The L1 5′UTR (gray rectangle), ORF1 (yellow rectangle), and ORF2 (blue rectangle) are indicated in the constructs. Relative positions of the SpIRE97/622 and SpIRE97/976 deletions (red triangles) are indicated on the bottom two constructs, respectively. The CMV promoter (white arrowhead) and the mneoI retrotransposition indicator cassette (green rectangle = neo gene sequence; black “v” line = intron interrupting neo coding sequence, SD = splice donor site, SA = splice acceptor site) are indicated at the 5′ and 3′ ends of the constructs, respectively. The black lollipop at the 3′ end on top of the constructs indicates the sense SV40 polyadenylation signal. The black arrow and gray lollipop on the bottom of the constructs are embedded within the mneoI retrotransposition indicator cassette and indicate an SV40 early promoter and herpes simplex virus thymidine kinase polyadenylation signal, respectively, in the antisense orientation. (B) Representative ORF1p western blot from WCLs. Molecular weight standards (kDa) are indicated to the left of the image. The black arrowhead indicates the predicted size of full-length ORF1p (about 40 kDa). Construct names are indicated above the image; pCEP/GFP = negative control. The antibody used in the western blot experiment is indicated to the right of the gel (α-N-ORF1p). The eIF3 protein (110 kDa) served as a loading control. Western blots were performed three times, yielding similar results. (C) Schematic of ORF1 and relative location of antibody binding. Top: The relative positions in ORF1 (yellow rectangle) of the SA sequence at nucleotides 974–975 (green), the canonical ORF1 initiator methionine (AUG, black, 40 kDa), the two putative initiator methionine codons (AUG, orange, 33 kDa; AUG, blue, 27 kDa), and the N- and C-terminal epitopes recognized by the ORF1p Ab (red and purple stars, respectively) are indicated in the figure. (D) Representative western blots from WCLs: molecular weight standards (kDa) are indicated to the left of the gels. The predicted sizes of full-length ORF1p (black arrowhead) and the N-terminal truncated ORF1p variants (orange and blue arrows, respectively) are highlighted on the gel. Construct names are indicated above the image; pCEP/GFP = negative control. The antibodies used in the western blot experiments are indicated to the left (α-N-ORF1p) and right (α-C-ORF1p) of the gel images, respectively. The eIF3 protein (110 kDa) served as a loading control. The unlabeled band at about 25 kDa in the α-C-ORF1p experiment is an unknown cross-reacting product that was not detected in RNPs or with an antibody to a C-terminal ORF1p T7-gene10 epitope tag (S4A Fig and S4B Fig). Western blots were performed three times, yielding similar results. α-C-ORF1p, C-terminal ORF1p antibody; α-elF3, eukaryotic initiation factor 3 antibody; α-N-ORF1p, N-terminal ORF1p antibody; Ab, antibody; AUG, translation initiation codon; CMV, cytomegalovirus; kDa, kilodalton; L1, Long interspersed element-1; ORF, open reading frame; SA, splice acceptor; SD, splice donor; SpIRE, spliced integrated retrotransposed element; UTR, untranslated region; WCL, whole cell lysate.</p

FigShare