2,602 research outputs found
Encounter complexes and dimensionality reduction in protein-protein association
An outstanding challenge has been to understand the mechanism whereby proteins associate. We report here the results of exhaustively sampling the conformational space in protein–protein association using a physics-based energy function. The agreement between experimental intermolecular paramagnetic relaxation enhancement (PRE) data and the PRE profiles calculated from the docked structures shows that the method captures both specific and non-specific encounter complexes. To explore the energy landscape in the vicinity of the native structure, the nonlinear manifold describing the relative orientation of two solid bodies is projected onto a Euclidean space in which the shape of low energy regions is studied by principal component analysis. Results show that the energy surface is canyon-like, with a smooth funnel within a two dimensional subspace capturing over 75% of the total motion. Thus, proteins tend to associate along preferred pathways, similar to sliding of a protein along DNA in the process of protein-DNA recognition
Protein folding in high-dimensional spaces:hypergutters and the role of non-native interactions
We explore the consequences of very high dimensionality in the dynamical
landscape of protein folding. Consideration of both typical range of
stabilising interactions, and folding rates themselves, leads to a model of the
energy hypersurface that is characterised by the structure of diffusive
"hypergutters" as well as the familiar "funnels". Several general predictions
result: (1) intermediate subspaces of configurations will always be visited;
(2) specific but non-native interactions are important in stabilising these
low-dimensional diffusive searches on the folding pathway; (3) sequential
barriers will commonly be found, even in "two-state"proteins; (4) very early
times will show charactreristic departures from single-exponential kinetics;
(5) contributions of non-native interactions to phi-values are calculable, and
may be significant. The example of a three-helix bundle is treated in more
detail as an illustration. The model also shows that high-dimensional
structures provide conceptual relations between the "folding funnel",
"diffusion-collision", "nucleation-condensation" and "topomer search" models of
protein folding. It suggests that kinetic strategies for fast folding may be
encoded rather generally in non-native, rather than native interactions. The
predictions are related to very recent findings in experiment and simulation.Comment: Submitted to Biophys.
Protein folding tames chaos
Protein folding produces characteristic and functional three-dimensional
structures from unfolded polypeptides or disordered coils. The emergence of
extraordinary complexity in the protein folding process poses astonishing
challenges to theoretical modeling and computer simulations. The present work
introduces molecular nonlinear dynamics (MND), or molecular chaotic dynamics,
as a theoretical framework for describing and analyzing protein folding. We
unveil the existence of intrinsically low dimensional manifolds (ILDMs) in the
chaotic dynamics of folded proteins. Additionally, we reveal that the
transition from disordered to ordered conformations in protein folding
increases the transverse stability of the ILDM. Stated differently, protein
folding reduces the chaoticity of the nonlinear dynamical system, and a folded
protein has the best ability to tame chaos. Additionally, we bring to light the
connection between the ILDM stability and the thermodynamic stability, which
enables us to quantify the disorderliness and relative energies of folded,
misfolded and unfolded protein states. Finally, we exploit chaos for protein
flexibility analysis and develop a robust chaotic algorithm for the prediction
of Debye-Waller factors, or temperature factors, of protein structures
Representability of algebraic topology for biomolecules in machine learning based scoring and virtual screening
This work introduces a number of algebraic topology approaches, such as
multicomponent persistent homology, multi-level persistent homology and
electrostatic persistence for the representation, characterization, and
description of small molecules and biomolecular complexes. Multicomponent
persistent homology retains critical chemical and biological information during
the topological simplification of biomolecular geometric complexity.
Multi-level persistent homology enables a tailored topological description of
inter- and/or intra-molecular interactions of interest. Electrostatic
persistence incorporates partial charge information into topological
invariants. These topological methods are paired with Wasserstein distance to
characterize similarities between molecules and are further integrated with a
variety of machine learning algorithms, including k-nearest neighbors, ensemble
of trees, and deep convolutional neural networks, to manifest their descriptive
and predictive powers for chemical and biological problems. Extensive numerical
experiments involving more than 4,000 protein-ligand complexes from the PDBBind
database and near 100,000 ligands and decoys in the DUD database are performed
to test respectively the scoring power and the virtual screening power of the
proposed topological approaches. It is demonstrated that the present approaches
outperform the modern machine learning based methods in protein-ligand binding
affinity predictions and ligand-decoy discrimination
- …