Search CORE

2,606 research outputs found

Ab initio RNA folding

Author: Cragnolini Tristan
Derreumaux Philippe
Pasquali Samuela
Publication venue: 'IOP Publishing'
Publication date: 30/12/2014
Field of study

RNA molecules are essential cellular machines performing a wide variety of functions for which a specific three-dimensional structure is required. Over the last several years, experimental determination of RNA structures through X-ray crystallography and NMR seems to have reached a plateau in the number of structures resolved each year, but as more and more RNA sequences are being discovered, need for structure prediction tools to complement experimental data is strong. Theoretical approaches to RNA folding have been developed since the late nineties when the first algorithms for secondary structure prediction appeared. Over the last 10 years a number of prediction methods for 3D structures have been developed, first based on bioinformatics and data-mining, and more recently based on a coarse-grained physical representation of the systems. In this review we are going to present the challenges of RNA structure prediction and the main ideas behind bioinformatic approaches and physics-based approaches. We will focus on the description of the more recent physics-based phenomenological models and on how they are built to include the specificity of the interactions of RNA bases, whose role is critical in folding. Through examples from different models, we will point out the strengths of physics-based approaches, which are able not only to predict equilibrium structures, but also to investigate dynamical and thermodynamical behavior, and the open challenges to include more key interactions ruling RNA folding.Comment: 28 pages, 18 figure

arXiv.org e-Print Archive

Hal-Diderot

Empirical Potential Function for Simplified Protein Models: Combining Contact and Local Sequence-Structure Descriptors

Author: Adamian
Anfinsen
Avbelj
Bahar
Bastolla
Betancourt
Brooks
Buchete
Cannata
Chiu
Cline
Crasto
Dill
Dill
Dima
Dobson
Fain
Fitzkee
Fletcher
Friedrichs
Gan
Goldstein
Guntert
Hao
Head-Gordon
Hou
Hu
Hunter
Joachims
Kolinski
Kolodny
Kuang
Lazaridis
Levinthal
Levitt
Lezon
Li
Li
Liang
Loose
Lu
Maiorov
McConkey
McGuffin
Mirny
Miyazawa
Murphy
Park
Park
Pearlman
Pei
Przytycka
Riddle
Sagot
Samudrala
Samudrala
Schölkopf
Shortle
Shortle
Simons
Simons
Simons
Thomas
Tobi
Tobi
Tsai
Vendruscolo
Vendruscolo
Vriend
Wang
Wang
Xia
Xia
Zhang
Zhang
Zhang
Zhou
Publication venue: 'Wiley'
Publication date: 01/01/2006
Field of study

An effective potential function is critical for protein structure prediction and folding simulation. Simplified protein models such as those requiring only

C_\alpha

or backbone atoms are attractive because they enable efficient search of the conformational space. We show residue specific reduced discrete state models can represent the backbone conformations of proteins with small RMSD values. However, no potential functions exist that are designed for such simplified protein models. In this study, we develop optimal potential functions by combining contact interaction descriptors and local sequence-structure descriptors. The form of the potential function is a weighted linear sum of all descriptors, and the optimal weight coefficients are obtained through optimization using both native and decoy structures. The performance of the potential function in test of discriminating native protein structures from decoys is evaluated using several benchmark decoy sets. Our potential function requiring only backbone atoms or

C_\alpha

atoms have comparable or better performance than several residue-based potential functions that require additional coordinates of side chain centers or coordinates of all side chain atoms. By reducing the residue alphabets down to size 5 for local structure-sequence relationship, the performance of the potential function can be further improved. Our results also suggest that local sequence-structure correlation may play important role in reducing the entropic cost of protein folding.Comment: 20 pages, 5 figures, 4 tables. In press, Protein

arXiv.org e-Print Archive

Crossref

Draft crystal structure of the vault shell at 9-A resolution.

Author: Anderson Daniel H
Eisenberg David
Kickhoefer Valerie A
Rome Leonard H
Sievers Stuart A
Publication venue: eScholarship, University of California
Publication date: 01/11/2007
Field of study

Vaults are the largest known cytoplasmic ribonucleoprotein structures and may function in innate immunity. The vault shell self-assembles from 96 copies of major vault protein and encapsulates two other proteins and a small RNA. We crystallized rat liver vaults and several recombinant vaults, all among the largest non-icosahedral particles to have been crystallized. The best crystals thus far were formed from empty vaults built from a cysteine-tag construct of major vault protein (termed cpMVP vaults), diffracting to about 9-A resolution. The asymmetric unit contains a half vault of molecular mass 4.65 MDa. X-ray phasing was initiated by molecular replacement, using density from cryo-electron microscopy (cryo-EM). Phases were improved by density modification, including concentric 24- and 48-fold rotational symmetry averaging. From this, the continuous cryo-EM electron density separated into domain-like blocks. A draft atomic model of cpMVP was fit to this improved density from 15 domain models. Three domains were adapted from a nuclear magnetic resonance substructure. Nine domain models originated in ab initio tertiary structure prediction. Three C-terminal domains were built by fitting poly-alanine to the electron density. Locations of loops in this model provide sites to test vault functions and to exploit vaults as nanocapsules

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

Automated protein structure modeling in CASP9 by I‐TASSER pipeline combined with QUARK‐based ab initio folding and FG‐MD‐based structure refinement

Author: Roy Ambrish
Xu Dong
Zhang Jian
Zhang Yang
Publication venue: 'Wiley'
Publication date: 01/01/2011
Field of study

I‐TASSER is an automated pipeline for protein tertiary structure prediction using multiple threading alignments and iterative structure assembly simulations. In CASP9 experiments, two new algorithms, QUARK and fragment‐guided molecular dynamics (FG‐MD), were added to the I‐TASSER pipeline for improving the structural modeling accuracy. QUARK is a de novo structure prediction algorithm used for structure modeling of proteins that lack detectable template structures. For distantly homologous targets, QUARK models are found useful as a reference structure for selecting good threading alignments and guiding the I‐TASSER structure assembly simulations. FG‐MD is an atomic‐level structural refinement program that uses structural fragments collected from the PDB structures to guide molecular dynamics simulation and improve the local structure of predicted model, including hydrogen‐bonding networks, torsion angles, and steric clashes. Despite considerable progress in both the template‐based and template‐free structure modeling, significant improvements on protein target classification, domain parsing, model selection, and ab initio folding of β‐proteins are still needed to further improve the I‐TASSER pipeline. Proteins 2011; © 2011 Wiley‐Liss, Inc.Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/88077/1/23111_ftp.pd

PubMed Central

Deep Blue Documents at the University of Michigan

Mass & secondary structure propensity of amino acids explain their mutability and evolutionary replacements

Author: Bohórquez Hugo J.
Patarroyo Manuel Elkin
Suárez Carlos F.
Publication venue
Publication date: 01/01/2017
Field of study

Why is an amino acid replacement in a protein accepted during evolution? The answer given by bioinformatics relies on the frequency of change of each amino acid by another one and the propensity of each to remain unchanged. We propose that these replacement rules are recoverable from the secondary structural trends of amino acids. A distance measure between high-resolution Ramachandran distributions reveals that structurally similar residues coincide with those found in substitution matrices such as BLOSUM: Asn Asp, Phe Tyr, Lys Arg, Gln Glu, Ile Val, Met → Leu; with Ala, Cys, His, Gly, Ser, Pro, and Thr, as structurally idiosyncratic residues. We also found a high average correlation (\overline{R} R = 0.85) between thirty amino acid mutability scales and the mutational inertia (I X ), which measures the energetic cost weighted by the number of observations at the most probable amino acid conformation. These results indicate that amino acid substitutions follow two optimally-efficient principles: (a) amino acids interchangeability privileges their secondary structural similarity, and (b) the amino acid mutability depends directly on its biosynthetic energy cost, and inversely with its frequency. These two principles are the underlying rules governing the observed amino acid substitutions. © 2017 The Author(s)

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

PROTINFO: new algorithms for enhanced protein structure predictions

Author: Hung Ling-Hong
Liu Tianyun
Ngan Shing-Chung
Samudrala Ram
Publication venue: Oxford University Press
Publication date: 01/01/2005
Field of study

We describe new algorithms and modules for protein structure prediction available as part of the PROTINFO web server. The modules, comparative and de novo modelling, have significantly improved back-end algorithms that were rigorously evaluated at the sixth meeting on the Critical Assessment of Protein Structure Prediction methods. We were one of four server groups invited to make an oral presentation (only the best performing groups are asked to do so). These two modules allow a user to submit a protein sequence and return atomic coordinates representing the tertiary structure of that protein. The PROTINFO server is available at

CiteSeerX

Crossref

PubMed Central

Kinetics and Thermodynamics of Protein Folding

Author: Duan Yong
Lei Hongxing
Publication venue: 'IntechOpen'
Publication date: 22/09/2011
Field of study

IntechOpen

Crossref

Template Based Modeling and Structural Refinement of Protein-Protein Interactions.

Author: Govindarajoo Brandon
Publication venue
Publication date: 01/01/2016
Field of study

Determining protein structures from sequence is a fundamental problem in molecular biology, as protein structure is essential to understanding protein function. In this study, I developed one of the first fully automated pipelines for template based quaternary structure prediction starting from sequence. Two critical steps for template based modeling are identifying the correct homologous structures by threading which generates sequence to structure alignments and refining the initial threading template coordinates closer to the native conformation. I developed SPRING (single-chain-based prediction of interactions and geometries), a monomer threading to dimer template mapping program, which was compared to the dimer co-threading program, COTH, using 1838 non homologous target complex structures. SPRING’s similarity score outperformed COTH in the first place ranking of templates, correctly identifying 798 and 527 interfaces respectively. More importantly the results were found to be complementary and the programs could be combined in a consensus based threading program showing a 5.1% improvement compared to SPRING. Template based modeling requires a structural analog being present in the PDB. A full search of the PDB, using threading and structural alignment, revealed that only 48.7% of the PDB has a suitable template whereas only 39.4% of the PDB has templates that can be identified by threading. In order to circumvent this, I included intramolecular domain-domain interfaces into the PDB library to boost template recognition of protein dimers; the merging of the two classes of interfaces improved recognition of heterodimers by 40% using benchmark settings. Next the template based assembly of protein complexes pipeline, TACOS, was created. The pipeline combines threading templates and domain knowledge from the PDB into a knowledge based energy score. The energy score is integrated into a Monte Carlo sampling simulation that drives the initial template closer to the native topology. The full pipeline was benchmarked using 350 non homologous structures and compared to two state of the art programs for dimeric structure prediction: ZDOCK and MODELLER. On average, TACOS models global and interface structure have a better quality than the models generated by MODELLER and ZDOCK.PHDBioinformaticsUniversity of Michigan, Horace H. Rackham School of Graduate Studieshttp://deepblue.lib.umich.edu/bitstream/2027.42/135847/1/bgovi_1.pd

Deep Blue Documents at the University of Michigan

Structural Bioinformatics of Proteins: Predicting the Tertiary and Quaternary Structure of Proteins from Sequence

Author: Attila Gursoy
Baldo Oliva
Elisenda Feliu
Jaume Bonet
Joan Planas-Iglesias
Manuel A. Marín-López
Publication venue: 'IntechOpen'
Publication date: 01/01/2012
Field of study

IntechOpen

Copenhagen University Research Information System