1,831 research outputs found

    Bridge helix bending promotes RNA polymerase II backtracking through a critical and conserved threonine residue.

    Get PDF
    The dynamics of the RNA polymerase II (Pol II) backtracking process is poorly understood. We built a Markov State Model from extensive molecular dynamics simulations to identify metastable intermediate states and the dynamics of backtracking at atomistic detail. Our results reveal that Pol II backtracking occurs in a stepwise mode where two intermediate states are involved. We find that the continuous bending motion of the Bridge helix (BH) serves as a critical checkpoint, using the highly conserved BH residue T831 as a sensing probe for the 3'-terminal base paring of RNA:DNA hybrid. If the base pair is mismatched, BH bending can promote the RNA 3'-end nucleotide into a frayed state that further leads to the backtracked state. These computational observations are validated by site-directed mutagenesis and transcript cleavage assays, and provide insights into the key factors that regulate the preferences of the backward translocation

    Probing Local Atomic Environments to Model RNA Energetics and Structure

    Full text link
    Ribonucleic acids (RNA) are critical components of living systems. Understanding RNA structure and its interaction with other molecules is an essential step in understanding RNA-driven processes within the cell. Experimental techniques like X-ray crystallography, nuclear magnetic resonance (NMR) spectroscopy, and chemical probing methods have provided insights into RNA structures on the atomic scale. To effectively exploit experimental data and characterize features of an RNA structure, quantitative descriptors of local atomic environments are required. Here, I investigated different ways to describe RNA local atomic environments. First, I investigated the solvent-accessible surface area (SASA) as a probe of RNA local atomic environment. SASA contains information on the level of exposure of an RNA atom to solvents and, in some cases, is highly correlated to reactivity profiles derived from chemical probing experiments. Using Bayesian/maximum entropy (BME), I was able to reweight RNA structure models based on the agreement between SASA and chemical reactivities. Next, I developed a numerical descriptor (the atomic fingerprint), that is capable of discriminating different atomic environments. Using atomic fingerprints as features enable the prediction of RNA structure and structure-related properties. Two detailed examples are discussed. Firstly, a classification model was developed to predict Mg2+^{2+} ion binding sites. Results indicate that the model could predict Mg2+^{2+} binding sites with reasonable accuracy, and it appears to outperform existing methods. Secondly, a set of models were developed to identify cavities in RNA that are likely to accommodate small-molecule ligands. The models were also used to identify bound-like conformations from an ensemble of RNA structures. The frameworks presented here provide paths to connect the local atomic environment to RNA structure, and I envision they will provide opportunities to develop novel RNA modeling tools.PHDPhysicsUniversity of Michigan, Horace H. Rackham School of Graduate Studieshttp://deepblue.lib.umich.edu/bitstream/2027.42/163135/1/jingrux_1.pd

    Methods for Molecular Modelling of Protein Complexes.

    Get PDF
    Biological processes are often mediated by complexes formed between proteins and various biomolecules. The 3D structures of such protein-biomolecule complexes provide insights into the molecular mechanism of their action. The structure of these complexes can be predicted by various computational methods. Choosing an appropriate method for modelling depends on the category of biomolecule that a protein interacts with and the availability of structural information about the protein and its interacting partner. We intend for the contents of this chapter to serve as a guide as to what software would be the most appropriate for the type of data at hand and the kind of 3D complex structure required. Particularly, we have dealt with protein-small molecule ligand, protein-peptide, protein-protein, and protein-nucleic acid interactions.Most, if not all, model building protocols perform some sampling and scoring. Typically, several alternate conformations and configurations of the interactors are sampled. Each such sample is then scored for optimization. To boost the confidence in these predicted models, their assessment using other independent scoring schemes besides the inbuilt/default ones would prove to be helpful. This chapter also lists such software and serves as a guide to gauge the fidelity of modelled structures of biomolecular complexes

    Theoretical and computational modeling of rna-ligand interactions

    Get PDF
    Ribonucleic acid (RNA) is a polymeric nucleic acid that plays a variety of critical roles in gene expression and regulation at the level of transcription and translation. Recently, there has been an enormous interest in the development of therapeutic strategies that target RNA molecules. Instead of modifying the product of gene expression, i.e., proteins, RNAtargeted therapeutics aims to modulate the relevant key RNA elements in the disease-related cellular pathways. Such approaches have two significant advantages. First, diseases with related proteins that are difficult or unable to be drugged become druggable by targeting the corresponding messenger RNAs (mRNAs) that encode the amino acid sequences. Second, besides coding mRNAs, the vast majority of the human genome sequences are transcribed to noncoding RNAs (ncRNAs), which serve as enzymatic, structural, and regulatory elements in cellular pathways of most human diseases. Targeting noncoding RNAs would open up remarkable new opportunities for disease treatment. The first step in modeling the RNA-drug interaction is to understand the 3D structure of the given RNA target. With current theoretical models, accurate prediction of 3D structures for large RNAs from sequence remains computationally infeasible. One of the major challenges comes from the flexibility in the RNA molecule, especially in loop/junction regions, and the resulting rugged energy landscape. However, structure probing techniques, such as the “selective 20-hydroxyl acylation analyzed by primer extension” (SHAPE) experiment, enable the quantitative detection of the relative flexibility and hence structure information of RNA structural elements. Therefore, one may incorporate the SHAPE data into RNA 3D structure prediction. In the first project, we investigate the feasibility of using a machine-learning-based approach to predict the SHAPE reactivity from the 3D RNA structure and compare the machine-learning result to that of a physics-based model. In the second project, in order to provide a user-friendly tool for RNA biologists, we developed a fully automated web interface, “SHAPE predictoR” (SHAPER) for predicting SHAPE profile from any given 3D RNA structure. In a cellular environment, various factors, such as metal ions and small molecules, interact with an RNA molecule to modulate RNA cellular activity. RNA is a highly charged polymer with each backbone phosphate group carrying one unit of negative (electronic) charge. In order to fold into a compact functional tertiary structure, it requires metal ions to reduce Coulombic repulsive electrostatic forces by neutralizing the backbone charges. In particular, Mg2+ ion is essential for the folding and stability of RNA tertiary structures. In the third project, we introduce a machine-learning-based model, the “Magnesium convolutional neural network” (MgNet) model, to predict Mg2+ binding site for a given 3D RNA structure, and show the use of the model in investigating the important coordinating RNA atoms and identifying novel Mg2+ binding motifs. Besides Mg2+ ions, small molecules, such as drug molecules, can also bind to an RNA to modulate its activities. Motivated by the tremendous potential of RNA-targeted drug discovery, in the fourth project, we develop a novel approach to predicting RNA-small molecule binding. Specifically, we develop a statistical potential-based scoring/ranking method (SPRank) to identify the native binding mode of the small molecule from a pool of decoys and estimate the binding affinity for the given RNA-small molecule complex. The results tested on a widely used data set suggest that SPRank can achieve (moderately) better performance than the current state-of-art models

    ModeRNA: a tool for comparative modeling of RNA 3D structure

    Get PDF
    RNA is a large group of functionally important biomacromolecules. In striking analogy to proteins, the function of RNA depends on its structure and dynamics, which in turn is encoded in the linear sequence. However, while there are numerous methods for computational prediction of protein three-dimensional (3D) structure from sequence, with comparative modeling being the most reliable approach, there are very few such methods for RNA. Here, we present ModeRNA, a software tool for comparative modeling of RNA 3D structures. As an input, ModeRNA requires a 3D structure of a template RNA molecule, and a sequence alignment between the target to be modeled and the template. It must be emphasized that a good alignment is required for successful modeling, and for large and complex RNA molecules the development of a good alignment usually requires manual adjustments of the input data based on previous expertise of the respective RNA family. ModeRNA can model post-transcriptional modifications, a functionally important feature analogous to post-translational modifications in proteins. ModeRNA can also model DNA structures or use them as templates. It is equipped with many functions for merging fragments of different nucleic acid structures into a single model and analyzing their geometry. Windows and UNIX implementations of ModeRNA with comprehensive documentation and a tutorial are freely available

    Discovering new potential inhibitors to SARS-CoV-2 RNA dependent RNA polymerase (RdRp) using high throughput virtual screening and molecular dynamics simulations.

    Get PDF
    RNA dependent RNA polymerase (RdRp), is an essential in the RNA replication within the life cycle of the severely acute respiratory coronavirus-2 (SARS-CoV-2), causing the deadly respiratory induced sickness COVID-19. Remdesivir is a prodrug that has seen some success in inhibiting this enzyme, however there is still the pressing need for effective alternatives. In this study, we present the discovery of four non-nucleoside small molecules that bind favorably to SARS-CoV-2 RdRp over the active form of the popular drug remdesivir (RTP) and adenosine triphosphate (ATP) by utilizing high-throughput virtual screening (HTVS) against the vast ZINC compound database coupled with extensive molecular dynamics (MD) simulations. After post-trajectory analysis, we found that the simulations of complexes containing both ATP and RTP remained stable for the duration of their trajectories. Additionally, it was revealed that the phosphate tail of RTP was stabilized by both the positive amino acid pocket and magnesium ions near the entry channel of RdRp which includes residues K551, R553, R555 and K621. It was also found that residues D623, D760, and N691 further stabilized the ribose portion of RTP with U10 on the template RNA strand forming hydrogen pairs with the adenosine motif. Using these models of RdRp, we employed them to screen the ZINC database of ~ 17 million molecules. Using docking and drug properties scoring, we narrowed down our selection to fourteen candidates. These were subjected to 200 ns simulations each underwent free energy calculations. We identified four hit compounds from the ZINC database that have similar binding poses to RTP while possessing lower overall binding free energies, with ZINC097971592 having a binding free energy two times lower than RTP

    Dynamic Energy Landscapes of Riboswitches Help Interpret Conformational Rearrangements and Function

    Get PDF
    Riboswitches are RNAs that modulate gene expression by ligand-induced conformational changes. However, the way in which sequence dictates alternative folding pathways of gene regulation remains unclear. In this study, we compute energy landscapes, which describe the accessible secondary structures for a range of sequence lengths, to analyze the transcriptional process as a given sequence elongates to full length. In line with experimental evidence, we find that most riboswitch landscapes can be characterized by three broad classes as a function of sequence length in terms of the distribution and barrier type of the conformational clusters: low-barrier landscape with an ensemble of different conformations in equilibrium before encountering a substrate; barrier-free landscape in which a direct, dominant “downhill” pathway to the minimum free energy structure is apparent; and a barrier-dominated landscape with two isolated conformational states, each associated with a different biological function. Sharing concepts with the “new view” of protein folding energy landscapes, we term the three sequence ranges above as the sensing, downhill folding, and functional windows, respectively. We find that these energy landscape patterns are conserved in various riboswitch classes, though the order of the windows may vary. In fact, the order of the three windows suggests either kinetic or thermodynamic control of ligand binding. These findings help understand riboswitch structure/function relationships and open new avenues to riboswitch design

    Structure of the Picornavirus Replication Platform: A Potential Drug Target for Inhibiting Virus Replication

    Get PDF
    Picornaviruses are small, positive-stranded RNA viruses, divided into twelve different genera. Members of the Picornaviridae family cause a wide range of human and animal diseases including the common cold, poliomyelitis, foot and mouth disease, and dilated cardiomyopathy. The picornavirus genome is replicated via a highly conserved mechanism involving a presumed cloverleaf structure located at the 5’ noncoding region of the virus genome. The 5’ cloverleaf consists of three stem loops (B, C, and D) and one stem (A), which interact with a variety of virus and host cell proteins during replication. In this dissertation, human rhinovirus serotype 14 (HRV-14) SLB and the 5’cloverleaf (5’CL) solution structures were determined using nuclear magnetic resonance (NMR) and small-angle X-ray scattering (SAXS). HRV-14 SLB adopts a predominantly A-form helical structure. The stem contains five Watson−Crick base pairs and one wobble base pair and is capped by an eight-nucleotide loop. The wobble base pair introduces perturbations in the helical parameters, but does not appear to introduce flexibility. The helix major groove appears to be more accessible than in typical standard A-form RNA. Flexibility is seen throughout the loop and in the terminal nucleotides. The pyrimidine-rich region of the loop, the apparent recognition site for the poly(C) binding protein, is the most disordered region of the structure. The solution structure of HRV-14 5ʹCL was determined in the absence and presence of magnesium. In the absence of magnesium, the structure adopts an open, somewhat extended conformation. In the presence of magnesium, the structure compacts, bringing SLB and SLD into close contact, a geometry that creates an extensive accessible major groove surface, and permits interaction between the proteins that target each stem loop. A deeper understanding of these structures will offer invaluable information regarding the picornavirus replication mechanism. The results from these studies have the potential to elucidate unique drug targets with broad spectrum efficacy against a range of picornaviruses

    Study of complex RNA function modulated by small molecules: the development of RNA directed small molecule library and probing the S-adenosyl methionine discrimination between on and off conformational states of the SAM-I riboswitch

    Get PDF
    RNA recently remained unexploited and is now drawing interest as a potential drug target. The methodology and available drug libraries for RNA targeting/screening are in rudimentary stages. The interactions made by ligands with RNA can be explored for RNA based drug development. The dissertation is composed of 4 chapters. The first chapter focuses on the structural features of RNA and the attempts made to target RNA previously. The second chapter focuses on the development of a small molecule library enriched with substructures derived from RNA binding ligands. For this study a fragment-based approach (fragment based approach is detailed in chapter 2) is used in order to accommodate the conformational flexibility of RNA. The library molecules are used for screening against suitable RNA targets using NMR. We identified at least 5 ligands out of which 2 are novel ligands binding to the ribosomal 16s rRNA. The third chapter is focused on the role of small molecules in inducing conformational changes in an RNA genetic regulatory element called the S-Adenosyl methionine (SAM) SAM-I riboswitch. The mechanistic features of the SAM-I riboswitch to understand the basis for specificity and discrimination and its gene regulation mechanism are reported. To address the conformational dynamics Bacillus subtilis and Thermoanearobacter tencongenesis SAM-I riboswitches in response to SAM binding several conformer mimics are designed, synthesized and characterized using NMR, equilibrium dialysis, and inline probing. The study shows that apart from the conserved residues of the binding pocket, residues downstream of the binding pocket are involved in detecting SAM and assist the binding of SAM to the riboswitch with weak affinity. Our data highlights the capacity of a so-called antiterminator helix from the expression platform to assist the formation of a partial P1 helix of the aptamer domain. A stable P1 is involved in recognition and tight binding of SAM. Our in vitro experiments suggest that the riboswitch could switch from an unbound conformation to tightly SAM bound structure through weakly binding intermediate structures in the presence of the small molecule SAM. The future directions are included in the fourth chapter along with the conclusions
    corecore