5 research outputs found

    A translational reporter system identified leaderless and leadered initiation codon preferences.

    No full text
    <p>(A) Libraries of leader sequences were generated using two overlapping oligonucleotides, each with a single randomized codon positioned either at the leaderless position (+1) or the leadered position (+30), in-frame with the zeocin-resistance (<i>zeo</i><sup>r</sup>) gene. Self-primed heterodimers were inserted between the promoter and the <i>zeo</i><sup>r</sup> gene and transformed into <i>E</i>. <i>coli</i>. The library was electroporated into <i>M</i>. <i>smegmatis</i>. Hygromycin selection allowed maintenance of the complete library, while zeocin selection required translation initiation at either one of the randomized codon sites. Following selection in zeocin, plasmids were recovered and the leader regions amplified for Ion-Torrent sequencing. Deep sequencing of amplicon libraries allowed the unbiased identification and estimation of relative efficiency of initiation codons. (B) A Shine-Dalgarno site was omitted to facilitate direct comparison between leaderless and leadered architectures. Read counts were compiled for each of the 64 possible codons at the leaderless position (columns) and leadered position (rows). Heat map indicates read counts of each combinatorial leaderless/leadered codon pair, from 10<sup>0</sup> (blue) through 10<sup>4</sup> (yellow). Only ATG or GTG at the leaderless position were capable of initiating translation of <i>zeo</i><sup>r</sup>. At the leadered codon position, no enrichment indicated that translation initiation did not occur at any of the possible codons. A further reduction of the expected stop codons suggested that they prevented read through of leaderless ribosomes into the <i>zeo</i><sup><i>r</i></sup> ORF. (C) A Shine-Dalgarno sequence enables efficient use of diverse leadered initiation codons. A consensus Shine-Dalgarno (SD) element was placed upstream of the randomized leadered codon position. Zeocin-resistant pools showed a complex pattern of active translation initiation codons at both the leaderless and leadered positions. The presence of a Shine-Dalgarno supported translation initiation activity of ATG and GTG triplets in the leadered position, as well as the less common TTG and ATT triplets.</p

    Small protein ORFs are frequently coupled to the ORF downstream.

    No full text
    <p>(A) <i>M</i>. <i>tuberculosis</i> leaderless transcripts initiate unannotated small protein ORFs that terminate at the start of the annotated gene downstream more often than expected. All small protein ORF stop codons within 100 nucleotides of an annotated gene start are shown relative to that start codon (0 = coupled RTGA overlap). Three structural classes are identified: uORFs (the small ORF terminates upstream of the annotated start), coupled ORFs (linked by an RTGA tetramer), and overlapping ORFs. The y-axis shows the fraction of small ORFs that terminate a specified distance (x-axis) from the annotated start codon of the downstream gene. (B) One example of a coupled small protein in <i>M</i>. <i>tuberculosis</i> and <i>M</i>. <i>smegmatis</i>, upstream of orthologous genes. The primary sequence of the encoded small protein is not conserved, but the leaderless initiation and coupled linkage is maintained.</p

    Leaderless gene architectures bring promoters and ORFs together.

    No full text
    <p>(A) Logo plot of TSS and proximal promoter region of traditional leadered genes. A purine (A or G) is favored at the +1 nucleotide, and an AT rich -10 element appears upstream. The 5’ UTR downstream of the transcription start site shows no sequence constraints or enrichment. (B) A Logo plot of the 5’ UTR from the translation initiation codon shows a Shine-Dalgarno-like AGGAGG sequence enrichment, centered 9–10 nt upstream (positions 10–11). From the initiation codon, the coding sequence downstream shows the wobble bias of the G-C rich mycobacterial genome. (C) The proximal promoter regions of leaderless genes have a -10 sequence of similar composition and spacing to that of leadered genes (compare to 2A). The TSS is also the first nucleotide of the translation initiation codon. There is no evidence of Shine-Dalgarno sequence enrichment upstream. The ORF initiated by leaderless codons shows the same wobble bias as seen in leadered ORFs.</p

    Definition of <i>cis</i> elements that support translation initiation in mycobacteria.

    No full text
    <p>(A) Zeo-seq viability reporter libraries were generated to determine the sequence context preferences for a SD upstream of a leadered initiation codon. Randomized nucleotides were positioned upstream of a leadered initiation codon, and zeocin selection enriched for Shine-Dalgarno-like sequences, indicating that mycobacteria adhere to this canonical translation criterion. (B) Leaderless translation initiation exhibits no sequence preference in the adjacent mRNA. A block of 6 nt was randomized immediately downstream of a leaderless initiation zeocin reporter construct. Sequences in the recovered pools of zeocin-resistant <i>M</i>. <i>smegmatis</i> were not enriched in composition or motifs in this region. The absence of any detectable enrichment in the randomized region for the leaderless pool indicates that there are no nucleotide preferences for efficient leaderless initiation in mycobacteria downstream of the RTG codon.</p

    Leaderless and leadered genes produce distinct RNA-seq and ribosome profiling 5’ boundaries.

    No full text
    <p>(A) The transcription start site (TSS) and translation initiation site are the same in leaderless genes. No 5’ UTR and no Shine-Dalgarno (SD) sequence imply that an assembled 70S ribosome engages the 5’ terminal initiation codon directly, followed by elongation to translate the ORF. Individual sequence reads from RNA-seq (green) and ribosome profiling (Ribo-seq, orange) analyses were mapped to the genome, and the abundance of the individual reads is indicated by the height of the peaks. In leaderless translation RNA-seq and ribosome profiling have coincident 5’ boundaries. The 5’ triplet is nearly always ATG or GTG and, in the typical example shown, corresponds to the predicted N-terminus of the annotated ORF. (B) Traditional gene structures generate nested ribosome profiling profiles, with a 5’ UTR that includes an SD ribosome-binding site upstream of the initiating methionine codon (ATG). The 30S and 50S ribosomal subunits assemble at the SD to form a complete 70S ribosome that begins translation at the adjacent AUG with an N-terminal formylated methionine (fM) amino acid residue. RNA-seq reads (green) indicate positive-strand transcription at this locus, and upstream of an annotated ORF. Mapped ribosome profiling reads (orange) begin downstream of the onset of RNA-seq reads, and ~17–35 nt upstream of the initiation codon of the annotated ORF. In both examples, JCVI correctly predicted the respective ORFs (black).</p
    corecore