61,318 research outputs found

    Definition of a high-affinity Gag recognition structure mediating packaging of a retroviral RNA genome

    Get PDF
    All retroviral genomic RNAs contain a cis-acting packaging signal by which dimeric genomes are selectively packaged into nascent virions. However, it is not understood how Gag (the viral structural protein) interacts with these signals to package the genome with high selectivity. We probed the structure of murine leukemia virus RNA inside virus particles using SHAPE, a high-throughput RNA structure analysis technology. These experiments showed that NC (the nucleic acid binding domain derived from Gag) binds within the virus to the sequence UCUG-UR-UCUG. Recombinant Gag and NC proteins bound to this same RNA sequence in dimeric RNA in vitro; in all cases, interactions were strongest with the first U and final G in each UCUG element. The RNA structural context is critical: High-affinity binding requires base-paired regions flanking this motif, and two UCUG-UR-UCUG motifs are specifically exposed in the viral RNA dimer. Mutating the guanosine residues in these two motifs—only four nucleotides per genomic RNA—reduced packaging 100-fold, comparable to the level of nonspecific packaging. These results thus explain the selective packaging of dimeric RNA. This paradigm has implications for RNA recognition in general, illustrating how local context and RNA structure can create information-rich recognition signals from simple single-stranded sequence elements in large RNAs

    Towards 3D structure prediction of large RNA molecules: an integer programming framework to insert local 3D motifs in RNA secondary structure

    Get PDF
    Motivation: The prediction of RNA 3D structures from its sequence only is a milestone to RNA function analysis and prediction. In recent years, many methods addressed this challenge, ranging from cycle decomposition and fragment assembly to molecular dynamics simulations. However, their predictions remain fragile and limited to small RNAs. To expand the range and accuracy of these techniques, we need to develop algorithms that will enable to use all the structural information available. In particular, the energetic contribution of secondary structure interactions is now well documented, but the quantification of non-canonical interactions—those shaping the tertiary structure—is poorly understood. Nonetheless, even if a complete RNA tertiary structure energy model is currently unavailable, we now have catalogues of local 3D structural motifs including non-canonical base pairings. A practical objective is thus to develop techniques enabling us to use this knowledge for robust RNA tertiary structure predictors

    A new procedure to analyze RNA non-branching structures

    Get PDF
    RNA structure prediction and structural motifs analysis are challenging tasks in the investigation of RNA function. We propose a novel procedure to detect structural motifs shared between two RNAs (a reference and a target). In particular, we developed two core modules: (i) nbRSSP_extractor, to assign a unique structure to the reference RNA encoded by a set of non-branching structures; (ii) SSD_finder, to detect structural motifs that the target RNA shares with the reference, by means of a new score function that rewards the relative distance of the target non-branching structures compared to the reference ones. We integrated these algorithms with already existing software to reach a coherent pipeline able to perform the following two main tasks: prediction of RNA structures (integration of RNALfold and nbRSSP_extractor) and search for chains of matches (integration of Structator and SSD_finder)

    Flexible RNA design under structure and sequence constraints using formal languages

    Get PDF
    The problem of RNA secondary structure design (also called inverse folding) is the following: given a target secondary structure, one aims to create a sequence that folds into, or is compatible with, a given structure. In several practical applications in biology, additional constraints must be taken into account, such as the presence/absence of regulatory motifs, either at a specific location or anywhere in the sequence. In this study, we investigate the design of RNA sequences from their targeted secondary structure, given these additional sequence constraints. To this purpose, we develop a general framework based on concepts of language theory, namely context-free grammars and finite automata. We efficiently combine a comprehensive set of constraints into a unifying context-free grammar of moderate size. From there, we use generic generic algorithms to perform a (weighted) random generation, or an exhaustive enumeration, of candidate sequences. The resulting method, whose complexity scales linearly with the length of the RNA, was implemented as a standalone program. The resulting software was embedded into a publicly available dedicated web server. The applicability demonstrated of the method on a concrete case study dedicated to Exon Splicing Enhancers, in which our approach was successfully used in the design of \emph{in vitro} experiments.Comment: ACM BCB 2013 - ACM Conference on Bioinformatics, Computational Biology and Biomedical Informatics (2013

    Systematic discovery of structural elements governing stability of mammalian messenger RNAs.

    Get PDF
    Decoding post-transcriptional regulatory programs in RNA is a critical step towards the larger goal of developing predictive dynamical models of cellular behaviour. Despite recent efforts, the vast landscape of RNA regulatory elements remains largely uncharacterized. A long-standing obstacle is the contribution of local RNA secondary structure to the definition of interaction partners in a variety of regulatory contexts, including--but not limited to--transcript stability, alternative splicing and localization. There are many documented instances where the presence of a structural regulatory element dictates alternative splicing patterns (for example, human cardiac troponin T) or affects other aspects of RNA biology. Thus, a full characterization of post-transcriptional regulatory programs requires capturing information provided by both local secondary structures and the underlying sequence. Here we present a computational framework based on context-free grammars and mutual information that systematically explores the immense space of small structural elements and reveals motifs that are significantly informative of genome-wide measurements of RNA behaviour. By applying this framework to genome-wide human mRNA stability data, we reveal eight highly significant elements with substantial structural information, for the strongest of which we show a major role in global mRNA regulation. Through biochemistry, mass spectrometry and in vivo binding studies, we identified human HNRPA2B1 (heterogeneous nuclear ribonucleoprotein A2/B1, also known as HNRNPA2B1) as the key regulator that binds this element and stabilizes a large number of its target genes. We created a global post-transcriptional regulatory map based on the identity of the discovered linear and structural cis-regulatory elements, their regulatory interactions and their target pathways. This approach could also be used to reveal the structural elements that modulate other aspects of RNA behaviour

    eIF4A1-dependent mRNAs employ purine-rich 5’UTR sequences to activate localised eIF4A1-unwinding through eIF4A1-multimerisation to facilitate translation

    Get PDF
    Altered eIF4A1 activity promotes translation of highly structured, eIF4A1-dependent oncogene mRNAs at root of oncogenic translational programmes. It remains unclear how these mRNAs recruit and activate eIF4A1 unwinding specifically to facilitate their preferential translation. Here, we show that single-stranded RNA sequence motifs specifically activate eIF4A1 unwinding allowing local RNA structural rearrangement and translation of eIF4A1-dependent mRNAs in cells. Our data demonstrate that eIF4A1-dependent mRNAs contain AG-rich motifs within their 5’UTR which specifically activate eIF4A1 unwinding of local RNA structure to facilitate translation. This mode of eIF4A1 regulation is used by mRNAs encoding components of mTORC-signalling and cell cycle progression, and renders these mRNAs particularly sensitive to eIF4A1-inhibition. Mechanistically, we show that binding of eIF4A1 to AG-rich sequences leads to multimerization of eIF4A1 with eIF4A1 subunits performing distinct enzymatic activities. Our structural data suggest that RNA-binding of multimeric eIF4A1 induces conformational changes in the RNA resulting in an optimal positioning of eIF4A1 proximal to the RNA duplex enabling efficient unwinding. Our data proposes a model in which AG-motifs in the 5’UTR of eIF4A1-dependent mRNAs specifically activate eIF4A1, enabling assembly of the helicase-competent multimeric eIF4A1 complex, and positioning these complexes proximal to stable localised RNA structure allowing ribosomal subunit scanning

    Local Absence of Secondary Structure Permits Translation of mRNAs that Lack Ribosome-Binding Sites

    Get PDF
    The initiation of translation is a fundamental and highly regulated process in gene expression. Translation initiation in prokaryotic systems usually requires interaction between the ribosome and an mRNA sequence upstream of the initiation codon, the so-called ribosome-binding site (Shine-Dalgarno sequence). However, a large number of genes do not possess Shine-Dalgarno sequences, and it is unknown how start codon recognition occurs in these mRNAs. We have performed genome-wide searches in various groups of prokaryotes in order to identify sequence elements and/or RNA secondary structural motifs that could mediate translation initiation in mRNAs lacking Shine-Dalgarno sequences. We find that mRNAs without a Shine-Dalgarno sequence are generally less structured in their translation initiation region and show a minimum of mRNA folding at the start codon. Using reporter gene constructs in bacteria, we also provide experimental support for local RNA unfoldedness determining start codon recognition in Shine-Dalgarno–independent translation. Consistent with this, we show that AUG start codons reside in single-stranded regions, whereas internal AUG codons are usually in structured regions of the mRNA. Taken together, our bioinformatics analyses and experimental data suggest that local absence of RNA secondary structure is necessary and sufficient to initiate Shine-Dalgarno–independent translation. Thus, our results provide a plausible mechanism for how the correct translation initiation site is recognized in the absence of a ribosome-binding site

    Monovalent ions modulate the flux through multiple folding pathways of an RNA pseudoknot

    Get PDF
    The functions of RNA pseudoknots (PKs), which are minimal tertiary structural motifs and an integral part of several ribozymes and ribonucleoprotein complexes, are determined by their structure, stability and dynamics. Therefore, it is important to elucidate the general principles governing their thermodynamics/folding mechanisms. Here, we combine experiments and simulations to examine the folding/unfolding pathways of the VPK pseudoknot, a variant of the Mouse Mammary Tumor Virus (MMTV) PK involved in ribosomal frameshifting. Fluorescent nucleotide analogs (2-aminopurine and pyrrolocytidine) placed at different stem/loop positions in the PK, and laser temperature-jump approaches serve as local probes allowing us to monitor the order of assembly of VPK with two helices with different intrinsic stabilities. The experiments and molecular simulations show that at 50 mM KCl the dominant folding pathway populates only the more stable partially folded hairpin. As the salt concentration is increased a parallel folding pathway emerges, involving the less stable hairpin structure as an alternate intermediate. Notably, the flux between the pathways is modulated by the ionic strength. The findings support the principle that the order of PK structure formation is determined by the relative stabilities of the hairpins, which can be altered by sequence variations or salt concentrations. Our study not only unambiguously demonstrates that PK folds by parallel pathways, but also establishes that quantitative description of RNA self-assembly requires a synergistic combination of experiments and simulations.Comment: Supporting Information include
    corecore