Search CORE

5,476 research outputs found

Protein Docking by the Underestimation of Free Energy Funnels in the Space of Encounter Complexes

Author: Paschalidis Ioannis Ch.
Shen Yang
Vajda Sandor
Vakili Pirooz
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/10/2008
Field of study

Similarly to protein folding, the association of two proteins is driven by a free energy funnel, determined by favorable interactions in some neighborhood of the native state. We describe a docking method based on stochastic global minimization of funnel-shaped energy functions in the space of rigid body motions (SE(3)) while accounting for flexibility of the interface side chains. The method, called semi-definite programming-based underestimation (SDU), employs a general quadratic function to underestimate a set of local energy minima and uses the resulting underestimator to bias further sampling. While SDU effectively minimizes functions with funnel-shaped basins, its application to docking in the rotational and translational space SE(3) is not straightforward due to the geometry of that space. We introduce a strategy that uses separate independent variables for side-chain optimization, center-to-center distance of the two proteins, and five angular descriptors of the relative orientations of the molecules. The removal of the center-to-center distance turns out to vastly improve the efficiency of the search, because the five-dimensional space now exhibits a well-behaved energy surface suitable for underestimation. This algorithm explores the free energy surface spanned by encounter complexes that correspond to local free energy minima and shows similarity to the model of macromolecular association that proceeds through a series of collisions. Results for standard protein docking benchmarks establish that in this space the free energy landscape is a funnel in a reasonably broad neighborhood of the native state and that the SDU strategy can generate docking predictions with less than 5 � ligand interface Ca root-mean-square deviation while achieving an approximately 20-fold efficiency gain compared to Monte Carlo methods

Boston University Institutional Repository (OpenBU)

Directory of Open Access Journals

PubMed Central

Paradigms for computational nucleic acid design

Author: Dirks Robert M.
Lin Milo
Pierce Niles A.
Winfree Erik
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2004
Field of study

The design of DNA and RNA sequences is critical for many endeavors, from DNA nanotechnology, to PCR‐based applications, to DNA hybridization arrays. Results in the literature rely on a wide variety of design criteria adapted to the particular requirements of each application. Using an extensively studied thermodynamic model, we perform a detailed study of several criteria for designing sequences intended to adopt a target secondary structure. We conclude that superior design methods should explicitly implement both a positive design paradigm (optimize affinity for the target structure) and a negative design paradigm (optimize specificity for the target structure). The commonly used approaches of sequence symmetry minimization and minimum free‐energy satisfaction primarily implement negative design and can be strengthened by introducing a positive design component. Surprisingly, our findings hold for a wide range of secondary structures and are robust to modest perturbation of the thermodynamic parameters used for evaluating sequence quality, suggesting the feasibility and ongoing utility of a unified approach to nucleic acid design as parameter sets are refined further. Finally, we observe that designing for thermodynamic stability does not determine folding kinetics, emphasizing the opportunity for extending design criteria to target kinetic features of the energy landscape

CiteSeerX

Caltech Authors

The Energy Landscape, Folding Pathways and the Kinetics of a Knotted Protein

The folding pathway and rate coefficients of the folding of a knotted protein are calculated for a potential energy function with minimal energetic frustration. A kinetic transition network is constructed using the discrete path sampling approach, and the resulting potential energy surface is visualized by constructing disconnectivity graphs. Owing to topological constraints, the low-lying portion of the landscape consists of three distinct regions, corresponding to the native knotted state and to configurations where either the N- or C-terminus is not yet folded into the knot. The fastest folding pathways from denatured states exhibit early formation of the N-terminus portion of the knot and a rate-determining step where the C-terminus is incorporated. The low-lying minima with the N-terminus knotted and the C-terminus free therefore constitute an off-pathway intermediate for this model. The insertion of both the N- and C-termini into the knot occur late in the folding process, creating large energy barriers that are the rate limiting steps in the folding process. When compared to other protein folding proteins of a similar length, this system folds over six orders of magnitude more slowly.Comment: 19 page

arXiv.org e-Print Archive

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Protein docking refinement by convex underestimation in the low-dimensional subspace of encounter complexes

Author: Kozakov Dmytro
Li Keyong
Moghadasi Mohammad
Nan Feng
Paschalidis Ioannis Ch.
Roshandelpoor Athar
Vajda Sandor
Vakili Pirooz
Zarbafian Shahrooz
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 12/04/2018
Field of study

We propose a novel stochastic global optimization algorithm with applications to the refinement stage of protein docking prediction methods. Our approach can process conformations sampled from multiple clusters, each roughly corresponding to a different binding energy funnel. These clusters are obtained using a density-based clustering method. In each cluster, we identify a smooth “permissive” subspace which avoids high-energy barriers and then underestimate the binding energy function using general convex polynomials in this subspace. We use the underestimator to bias sampling towards its global minimum. Sampling and subspace underestimation are repeated several times and the conformations sampled at the last iteration form a refined ensemble. We report computational results on a comprehensive benchmark of 224 protein complexes, establishing that our refined ensemble significantly improves the quality of the conformations of the original set given to the algorithm. We also devise a method to enhance the ensemble from which near-native models are selected.Published versio

Boston University Institutional Repository (OpenBU)

Directory of Open Access Journals

Energy Landscape and Global Optimization for a Frustrated Model Protein

Author: Becker O. M.
Berry R. S.
Brown S.
Bryngelson J. D.
Carr J. M.
Cox G. A.
Cvijović D.
David J. Wales
Del Carpio C. A.
Elmaci N.
Evans D. A.
Guo Z.
Guo Z. Y.
Guo Z. Y.
Henkelman G.
Honeycutt J. D.
Honeycutt J. D.
Humphrey W.
Johnston R. L.
Kim J.
Kim J.
Kim J.
Kim S. Y.
Komatsuzaki T.
Kumeda Y.
Larrass S. A.
Li Z.
Mark T. Oakley
Mezey P. G.
Miller M. A.
Munro L. J.
Nymeyer H.
Onuchic J. N.
Pan P. W.
Pedersen J. T.
Rabow A. A.
Roy L. Johnston
Rylance G. J.
Shea J. E.
Shea J.-E.
Stoycheva A. D.
Ueda Y.
Unger R.
Vekhter B.
Vekhter B.
Wales D. J.
Wales D. J.
Wales D. J.
Wales D. J.
Wales D. J.
Wales D. J.
Wales D. J.
Wales D. J.
Publication venue: American Chemical Society
Publication date: 26/08/2011
Field of study

The three-color (BLN) 69-residue model protein was designed to exhibit frustrated folding. We investigate the energy landscape of this protein using disconnectivity graphs and compare it to a Go model, which is designed to reduce the frustration by removing all non-native attractive interactions. Finding the global minimum on a frustrated energy landscape is a good test of global optimization techniques, and we present calculations evaluating the performance of basin-hopping and genetic algorithms for this system.Comparisons are made with the widely studied 46-residue BLN protein.We show that the energy landscape of the 69-residue BLN protein contains several deep funnels, each of which corresponds to a different β-barrel structure

Crossref

University of Birmingham Research Portal

PubMed Central

Introduction to protein folding for physicists

Author: Abagyan R. A.
Arnold V. I.
Calera C. Gómez-Moreno
Chothia C.
Creighton T. E.
Dawkins R.
Dawkins R.
Drenth J.
Dubrovin B. A.
Fersht A.
Gibson G.
Goldstein H.
Greiner W.
Halgren T. A.
Haltia T.
Head-Gordon T.
Lesk A. M.
Levinthal C.
Pablo Echenique
Plotkin S. S.
Smith C.
van Kampen N. G.
Publication venue: 'Informa UK Limited'
Publication date: 01/04/2007
Field of study

The prediction of the three-dimensional native structure of proteins from the knowledge of their amino acid sequence, known as the protein folding problem, is one of the most important yet unsolved issues of modern science. Since the conformational behaviour of flexible molecules is nothing more than a complex physical problem, increasingly more physicists are moving into the study of protein systems, bringing with them powerful mathematical and computational tools, as well as the sharp intuition and deep images inherent to the physics discipline. This work attempts to facilitate the first steps of such a transition. In order to achieve this goal, we provide an exhaustive account of the reasons underlying the protein folding problem enormous relevance and summarize the present-day status of the methods aimed to solving it. We also provide an introduction to the particular structure of these biological heteropolymers, and we physically define the problem stating the assumptions behind this (commonly implicit) definition. Finally, we review the 'special flavor' of statistical mechanics that is typically used to study the astronomically large phase spaces of macromolecules. Throughout the whole work, much material that is found scattered in the literature has been put together here to improve comprehension and to serve as a handy reference.Comment: 53 pages, 18 figures, the figures are at a low resolution due to arXiv restrictions, for high-res figures, go to http://www.pabloechenique.co

arXiv.org e-Print Archive

Crossref

Digital.CSIC

Empirical Potential Function for Simplified Protein Models: Combining Contact and Local Sequence-Structure Descriptors

Author: Adamian
Anfinsen
Avbelj
Bahar
Bastolla
Betancourt
Brooks
Buchete
Cannata
Chiu
Cline
Crasto
Dill
Dill
Dima
Dobson
Fain
Fitzkee
Fletcher
Friedrichs
Gan
Goldstein
Guntert
Hao
Head-Gordon
Hou
Hu
Hunter
Joachims
Kolinski
Kolodny
Kuang
Lazaridis
Levinthal
Levitt
Lezon
Li
Li
Liang
Loose
Lu
Maiorov
McConkey
McGuffin
Mirny
Miyazawa
Murphy
Park
Park
Pearlman
Pei
Przytycka
Riddle
Sagot
Samudrala
Samudrala
Schölkopf
Shortle
Shortle
Simons
Simons
Simons
Thomas
Tobi
Tobi
Tsai
Vendruscolo
Vendruscolo
Vriend
Wang
Wang
Xia
Xia
Zhang
Zhang
Zhang
Zhou
Publication venue: 'Wiley'
Publication date: 01/01/2006
Field of study

An effective potential function is critical for protein structure prediction and folding simulation. Simplified protein models such as those requiring only

C_\alpha

or backbone atoms are attractive because they enable efficient search of the conformational space. We show residue specific reduced discrete state models can represent the backbone conformations of proteins with small RMSD values. However, no potential functions exist that are designed for such simplified protein models. In this study, we develop optimal potential functions by combining contact interaction descriptors and local sequence-structure descriptors. The form of the potential function is a weighted linear sum of all descriptors, and the optimal weight coefficients are obtained through optimization using both native and decoy structures. The performance of the potential function in test of discriminating native protein structures from decoys is evaluated using several benchmark decoy sets. Our potential function requiring only backbone atoms or

C_\alpha

atoms have comparable or better performance than several residue-based potential functions that require additional coordinates of side chain centers or coordinates of all side chain atoms. By reducing the residue alphabets down to size 5 for local structure-sequence relationship, the performance of the potential function can be further improved. Our results also suggest that local sequence-structure correlation may play important role in reducing the entropic cost of protein folding.Comment: 20 pages, 5 figures, 4 tables. In press, Protein

arXiv.org e-Print Archive

Crossref

A multiobjective optimization approach to statistical mechanics

Author: Seoane Luís F.
Solé Ricard V.
Publication venue
Publication date: 15/09/2015
Field of study

Optimization problems have been the subject of statistical physics approximations. A specially relevant and general scenario is provided by optimization methods considering tradeoffs between cost and efficiency, where optimal solutions involve a compromise between both. The theory of Pareto (or multi objective) optimization provides a general framework to explore these problems and find the space of possible solutions compatible with the underlying tradeoffs, known as the {\em Pareto front}. Conflicts between constraints can lead to complex landscapes of Pareto optimal solutions with interesting implications in economy, engineering, or evolutionary biology. Despite their disparate nature, here we show how the structure of the Pareto front uncovers profound universal features that can be understood in the context of thermodynamics. In particular, our study reveals that different fronts are connected to different classes of phase transitions, which we can define robustly, along with critical points and thermodynamic potentials. These equivalences are illustrated with classic thermodynamic examples.Comment: 14 pages, 8 figure

arXiv.org e-Print Archive

CiteSeerX