88 research outputs found

    Machine learning: Lighting up protein design

    Get PDF
    Using a neural network to predict how green fluorescent proteins respond to genetic mutations illuminates properties that could help design new proteins

    Brr2p-mediated conformational rearrangements in the spliceosome during activation and substrate repositioning

    Get PDF
    Brr2p is one of eight RNA helicases involved in pre-mRNA splicing. Detailed understanding of the functions of Brr2p and other spliceosomal helicases has been limited by lack of knowledge of their in vivo substrates. To address this, sites of direct Brr2p–RNA interaction were identified by in vivo UV cross-linking in budding yeast. Cross-links identified in the U4 and U6 small nuclear RNAs (snRNAs) suggest U4/U6 stem I as a Brr2p substrate during spliceosome activation. Further Brr2p cross-links were identified in loop 1 of the U5 snRNA and near splice sites and 3′ ends of introns, suggesting the possibility of a previously uncharacterized function for Brr2p in the catalytic center of the spliceosome. Consistent with this, mutant brr2-G858R reduced second-step splicing efficiency and enhanced cross-linking to 3′ ends of introns. Furthermore, RNA sequencing indicated preferential inhibition of splicing of introns with structured 3′ ends. The Brr2-G858Rp cross-linking pattern in U6 was consistent with an open conformation for the catalytic center of the spliceosome during first-to-second-step transition. We propose a previously unsuspected function for Brr2p in driving conformational rearrangements that lead to competence for the second step of splicing

    Transcriptome-wide analysis of exosome targets

    Get PDF
    The exosome plays major roles in RNA processing and surveillance but the in vivo target range and substrate acquisition mechanisms remain unclear. Here we apply in vivo RNA crosslinking (CRAC) to the nucleases (Rrp44, Rrp6), two structural subunits (Rrp41, Csl4) and a cofactor (Trf4) of the yeast exosome. Analysis of wild-type Rrp44 and catalytic mutants showed that both the CUT and SUT classes of non-coding RNA, snoRNAs and, most prominently, pre-tRNAs and other Pol III transcripts are targeted for oligoadenylation and exosome degradation. Unspliced pre-mRNAs were also identified as targets for Rrp44 and Rrp6. CRAC performed using cleavable proteins (split-CRAC) revealed that Rrp44 endonuclease and exonuclease activities cooperate on most substrates. Mapping oligoadenylated reads suggests that the endonuclease activity may release stalled exosome substrates. Rrp6 was preferentially associated with structured targets, which frequently did not associate with the core exosome indicating that substrates follow multiple pathways to the nucleases

    Evidence in disease and non-disease contexts that nonsense mutations cause altered splicing via motif disruption

    Get PDF
    Transcripts containing premature termination codons (PTCs) can be subject to nonsense-associated alternative splicing (NAS). Two models have been evoked to explain this, scanning and splice motif disruption. The latter postulates that exonic cis motifs, such as exonic splice enhancers (ESEs), are disrupted by nonsense mutations. We employ genome-wide transcriptomic and k-mer enrichment methods to scrutinize this model. First, we show that ESEs are prone to disruptive nonsense mutations owing to their purine richness and paucity of TGA, TAA and TAG. The motif model correctly predicts that NAS rates should be low (we estimate 5–30%) and approximately in line with estimates for the rate at which random point mutations disrupt splicing (8–20%). Further, we find that, as expected, NAS-associated PTCs are predictable from nucleotide-based machine learning approaches to predict splice disruption and, at least for pathogenic variants, are enriched in ESEs. Finally, we find that both in and out of frame mutations to TAA, TGA or TAG are associated with exon skipping. While a higher relative frequency of such skip-inducing mutations in-frame than out of frame lends some credence to the scanning model, these results reinforce the importance of considering splice motif modulation to understand the etiology of PTC-associated disease

    PAR-CLIP data indicate that Nrd1-Nab3-dependent transcription termination regulates expression of hundreds of protein coding genes in yeast

    Get PDF
    Background Nrd1 and Nab3 are essential sequence-specific yeast RNA binding proteins that function as a heterodimer in the processing and degradation of diverse classes of RNAs. These proteins also regulate several mRNA coding genes; however, it remains unclear exactly what percentage of the mRNA component of the transcriptome these proteins control. To address this question, we used the pyCRAC software package developed in our laboratory to analyze CRAC and PAR-CLIP data for Nrd1-Nab3-RNA interactions. Results We generated high-resolution maps of Nrd1-Nab3-RNA interactions, from which we have uncovered hundreds of new Nrd1-Nab3 mRNA targets, representing between 20 and 30% of protein-coding transcripts. Although Nrd1 and Nab3 showed a preference for binding near 5′ ends of relatively short transcripts, they bound transcripts throughout coding sequences and 3′ UTRs. Moreover, our data for Nrd1-Nab3 binding to 3′ UTRs was consistent with a role for these proteins in the termination of transcription. Our data also support a tight integration of Nrd1-Nab3 with the nutrient response pathway. Finally, we provide experimental evidence for some of our predictions, using northern blot and RT-PCR assays. Conclusions Collectively, our data support the notion that Nrd1 and Nab3 function is tightly integrated with the nutrient response and indicate a role for these proteins in the regulation of many mRNA coding genes. Further, we provide evidence to support the hypothesis that Nrd1-Nab3 represents a failsafe termination mechanism in instances of readthrough transcription.</p

    Protein Evolution via Amino Acid and Codon Elimination

    Get PDF
    BACKGROUND: Global residue-specific amino acid mutagenesis can provide important biological insight and generate proteins with altered properties, but at the risk of protein misfolding. Further, targeted libraries are usually restricted to a handful of amino acids because there is an exponential correlation between the number of residues randomized and the size of the resulting ensemble. Using GFP as the model protein, we present a strategy, termed protein evolution via amino acid and codon elimination, through which simplified, native-like polypeptides encoded by a reduced genetic code were obtained via screening of reduced-size ensembles. METHODOLOGY/PRINCIPAL FINDINGS: The strategy involves combining a sequential mutagenesis scheme to reduce library size with structurally stabilizing mutations, chaperone complementation, and reduced temperature of gene expression. In six steps, we eliminated a common buried residue, Phe, from the green fluorescent protein (GFP), while retaining activity. A GFP variant containing 11 Phe residues was used as starting scaffold to generate 10 separate variants in which each Phe was replaced individually (in one construct two adjacent Phe residues were changed simultaneously), while retaining varying levels of activity. Combination of these substitutions to generate a Phe-free variant of GFP abolished fluorescence. Combinatorial re-introduction of five Phe residues, based on the activities of the respective single amino acid replacements, was sufficient to restore GFP activity. Successive rounds of mutagenesis generated active GFP variants containing, three, two, and zero Phe residues. These GFPs all displayed progenitor-like fluorescence spectra, temperature-sensitive folding, a reduced structural stability and, for the least stable variants, a reduced steady state abundance. CONCLUSIONS/SIGNIFICANCE: The results provide strategies for the design of novel GFP reporters. The described approach offers a means to enable engineering of active proteins that lack certain amino acids, a key step towards expanding the functional repertoire of uniquely labeled proteins in synthetic biology

    Global mapping of RNA homodimers in living cells

    Get PDF
    RNA homodimerization is important for various physiological processes, including the assembly of membraneless organelles, RNA subcellular localization, and packaging of viral genomes. However, understanding RNA dimerization has been hampered by the lack of systematic in vivo detection methods. Here, we show that CLASH, PARIS, and other RNA proximity ligation methods detect RNA homodimers transcriptome-wide as “overlapping” chimeric reads that contain more than one copy of the same sequence. Analyzing published proximity ligation data sets, we show that RNA:RNA homodimers mediated by direct base-pairing are rare across the human transcriptome, but highly enriched in specific transcripts, including U8 snoRNA, U2 snRNA, and a subset of tRNAs. Mutations in the homodimerization domain of U8 snoRNA impede dimerization in vitro and disrupt zebrafish development in vivo, suggesting an evolutionarily conserved role of this domain. Analysis of virus-infected cells reveals homodimerization of SARS-CoV-2 and Zika genomes, mediated by specific palindromic sequences located within protein-coding regions of N gene in SARS-CoV-2 and NS2A gene in Zika. We speculate that regions of viral genomes involved in homodimerization may constitute effective targets for antiviral therapies

    MicroRNA-145 Targets YES and STAT1 in Colon Cancer Cells

    Get PDF
    BACKGROUND: MicroRNAs (miRNAs) have emerged as important gene regulators and are recognized as key players in tumorigenesis. miR-145 is reported to be down-regulated in several cancers, but knowledge of its targets in colon cancer remains limited. METHODOLOGY/PRINCIPAL FINDINGS: To investigate the role of miR-145 in colon cancer, we have employed a microarray based approach to identify miR-145 targets. Based on seed site enrichment analyses and unbiased word analyses, we found a significant enrichment of miRNA binding sites in the 3'-untranslated regions (UTRs) of transcripts down-regulated upon miRNA overexpression. Gene Ontology analysis showed an overrepresentation of genes involved in cell death, cellular growth and proliferation, cell cycle, gene expression and cancer. A number of the identified miRNA targets have previously been implicated in cancer, including YES, FSCN1, ADAM17, BIRC2, VANGL1 as well as the transcription factor STAT1. Both YES and STAT1 were verified as direct miR-145 targets. CONCLUSIONS/SIGNIFICANCE: The study identifies and validates new cancer-relevant direct targets of miR-145 in colon cancer cells and hereby adds important mechanistic understanding of the tumor-suppressive functions of miR-145

    Box C/D snoRNP catalysed methylation is aided by additional pre-rRNA base-pairing

    Get PDF
    Box C/D small nucleolar RNPs catalyse 2′-O-methylation of eukaryotic ribosomal RNA. A large-scale analysis of yeast box C/D snoRNAs reveals conserved ‘extra base-pairing' between snoRNAs and regions adjacent to their rRNA methylation site and points to a role for the non-catalytic protein subunits Nop56 and Nop58 in rRNA binding
    corecore