5,476 research outputs found
Protein Docking by the Underestimation of Free Energy Funnels in the Space of Encounter Complexes
Similarly to protein folding, the association of two proteins is driven
by a free energy funnel, determined by favorable interactions in some neighborhood of the
native state. We describe a docking method based on stochastic global minimization of
funnel-shaped energy functions in the space of rigid body motions (SE(3)) while accounting
for flexibility of the interface side chains. The method, called semi-definite
programming-based underestimation (SDU), employs a general quadratic function to
underestimate a set of local energy minima and uses the resulting underestimator to bias
further sampling. While SDU effectively minimizes functions with funnel-shaped basins, its
application to docking in the rotational and translational space SE(3) is not
straightforward due to the geometry of that space. We introduce a strategy that uses
separate independent variables for side-chain optimization, center-to-center distance of the
two proteins, and five angular descriptors of the relative orientations of the molecules.
The removal of the center-to-center distance turns out to vastly improve the efficiency of
the search, because the five-dimensional space now exhibits a well-behaved energy surface
suitable for underestimation. This algorithm explores the free energy surface spanned by
encounter complexes that correspond to local free energy minima and shows similarity to the
model of macromolecular association that proceeds through a series of collisions. Results
for standard protein docking benchmarks establish that in this space the free energy
landscape is a funnel in a reasonably broad neighborhood of the native state and that the
SDU strategy can generate docking predictions with less than 5 � ligand interface Ca
root-mean-square deviation while achieving an approximately 20-fold efficiency gain compared
to Monte Carlo methods
Paradigms for computational nucleic acid design
The design of DNA and RNA sequences is critical for many endeavors, from DNA nanotechnology, to PCR‐based applications, to DNA hybridization arrays. Results in the literature rely on a wide variety of design criteria adapted to the particular requirements of each application. Using an extensively studied thermodynamic model, we perform a detailed study of several criteria for designing sequences intended to adopt a target secondary structure. We conclude that superior design methods should explicitly implement both a positive design paradigm (optimize affinity for the target structure) and a negative design paradigm (optimize specificity for the target structure). The commonly used approaches of sequence symmetry minimization and minimum free‐energy satisfaction primarily implement negative design and can be strengthened by introducing a positive design component. Surprisingly, our findings hold for a wide range of secondary structures and are robust to modest perturbation of the thermodynamic parameters used for evaluating sequence quality, suggesting the feasibility and ongoing utility of a unified approach to nucleic acid design as parameter sets are refined further. Finally, we observe that designing for thermodynamic stability does not determine folding kinetics, emphasizing the opportunity for extending design criteria to target kinetic features of the energy landscape
The Energy Landscape, Folding Pathways and the Kinetics of a Knotted Protein
The folding pathway and rate coefficients of the folding of a knotted protein
are calculated for a potential energy function with minimal energetic
frustration. A kinetic transition network is constructed using the discrete
path sampling approach, and the resulting potential energy surface is
visualized by constructing disconnectivity graphs. Owing to topological
constraints, the low-lying portion of the landscape consists of three distinct
regions, corresponding to the native knotted state and to configurations where
either the N- or C-terminus is not yet folded into the knot. The fastest
folding pathways from denatured states exhibit early formation of the
N-terminus portion of the knot and a rate-determining step where the C-terminus
is incorporated. The low-lying minima with the N-terminus knotted and the
C-terminus free therefore constitute an off-pathway intermediate for this
model. The insertion of both the N- and C-termini into the knot occur late in
the folding process, creating large energy barriers that are the rate limiting
steps in the folding process. When compared to other protein folding proteins
of a similar length, this system folds over six orders of magnitude more
slowly.Comment: 19 page
Protein docking refinement by convex underestimation in the low-dimensional subspace of encounter complexes
We propose a novel stochastic global optimization algorithm with applications to the refinement stage of protein docking prediction methods. Our approach can process conformations sampled from multiple clusters, each roughly corresponding to a different binding energy funnel. These clusters are obtained using a density-based clustering method. In each cluster, we identify a smooth “permissive” subspace which avoids high-energy barriers and then underestimate the binding energy function using general convex polynomials in this subspace. We use the underestimator to bias sampling towards its global minimum. Sampling and subspace underestimation are repeated several times and the conformations sampled at the last iteration form a refined ensemble. We report computational results on a comprehensive benchmark of 224 protein complexes, establishing that our refined ensemble significantly improves the quality of the conformations of the original set given to the algorithm. We also devise a method to enhance the ensemble from which near-native models are selected.Published versio
Energy Landscape and Global Optimization for a Frustrated Model Protein
The three-color (BLN) 69-residue model protein was designed to exhibit frustrated folding. We investigate the energy landscape of this protein using disconnectivity graphs and compare it to a Go model, which is designed to reduce the frustration by removing all non-native attractive interactions. Finding the global minimum on a frustrated energy landscape is a good test of global optimization techniques, and we present calculations evaluating the performance of basin-hopping and genetic algorithms for this system.Comparisons are made with the widely studied 46-residue BLN protein.We show that the energy landscape of the 69-residue BLN protein contains several deep funnels, each of which corresponds to a different β-barrel structure
Introduction to protein folding for physicists
The prediction of the three-dimensional native structure of proteins from the
knowledge of their amino acid sequence, known as the protein folding problem,
is one of the most important yet unsolved issues of modern science. Since the
conformational behaviour of flexible molecules is nothing more than a complex
physical problem, increasingly more physicists are moving into the study of
protein systems, bringing with them powerful mathematical and computational
tools, as well as the sharp intuition and deep images inherent to the physics
discipline. This work attempts to facilitate the first steps of such a
transition. In order to achieve this goal, we provide an exhaustive account of
the reasons underlying the protein folding problem enormous relevance and
summarize the present-day status of the methods aimed to solving it. We also
provide an introduction to the particular structure of these biological
heteropolymers, and we physically define the problem stating the assumptions
behind this (commonly implicit) definition. Finally, we review the 'special
flavor' of statistical mechanics that is typically used to study the
astronomically large phase spaces of macromolecules. Throughout the whole work,
much material that is found scattered in the literature has been put together
here to improve comprehension and to serve as a handy reference.Comment: 53 pages, 18 figures, the figures are at a low resolution due to
arXiv restrictions, for high-res figures, go to http://www.pabloechenique.co
Empirical Potential Function for Simplified Protein Models: Combining Contact and Local Sequence-Structure Descriptors
An effective potential function is critical for protein structure prediction
and folding simulation. Simplified protein models such as those requiring only
or backbone atoms are attractive because they enable efficient
search of the conformational space. We show residue specific reduced discrete
state models can represent the backbone conformations of proteins with small
RMSD values. However, no potential functions exist that are designed for such
simplified protein models. In this study, we develop optimal potential
functions by combining contact interaction descriptors and local
sequence-structure descriptors. The form of the potential function is a
weighted linear sum of all descriptors, and the optimal weight coefficients are
obtained through optimization using both native and decoy structures. The
performance of the potential function in test of discriminating native protein
structures from decoys is evaluated using several benchmark decoy sets. Our
potential function requiring only backbone atoms or atoms have
comparable or better performance than several residue-based potential functions
that require additional coordinates of side chain centers or coordinates of all
side chain atoms. By reducing the residue alphabets down to size 5 for local
structure-sequence relationship, the performance of the potential function can
be further improved. Our results also suggest that local sequence-structure
correlation may play important role in reducing the entropic cost of protein
folding.Comment: 20 pages, 5 figures, 4 tables. In press, Protein
A multiobjective optimization approach to statistical mechanics
Optimization problems have been the subject of statistical physics
approximations. A specially relevant and general scenario is provided by
optimization methods considering tradeoffs between cost and efficiency, where
optimal solutions involve a compromise between both. The theory of Pareto (or
multi objective) optimization provides a general framework to explore these
problems and find the space of possible solutions compatible with the
underlying tradeoffs, known as the {\em Pareto front}. Conflicts between
constraints can lead to complex landscapes of Pareto optimal solutions with
interesting implications in economy, engineering, or evolutionary biology.
Despite their disparate nature, here we show how the structure of the Pareto
front uncovers profound universal features that can be understood in the
context of thermodynamics. In particular, our study reveals that different
fronts are connected to different classes of phase transitions, which we can
define robustly, along with critical points and thermodynamic potentials. These
equivalences are illustrated with classic thermodynamic examples.Comment: 14 pages, 8 figure
- …