2,602 research outputs found

    Encounter complexes and dimensionality reduction in protein-protein association

    Get PDF
    An outstanding challenge has been to understand the mechanism whereby proteins associate. We report here the results of exhaustively sampling the conformational space in protein–protein association using a physics-based energy function. The agreement between experimental intermolecular paramagnetic relaxation enhancement (PRE) data and the PRE profiles calculated from the docked structures shows that the method captures both specific and non-specific encounter complexes. To explore the energy landscape in the vicinity of the native structure, the nonlinear manifold describing the relative orientation of two solid bodies is projected onto a Euclidean space in which the shape of low energy regions is studied by principal component analysis. Results show that the energy surface is canyon-like, with a smooth funnel within a two dimensional subspace capturing over 75% of the total motion. Thus, proteins tend to associate along preferred pathways, similar to sliding of a protein along DNA in the process of protein-DNA recognition

    Protein folding in high-dimensional spaces:hypergutters and the role of non-native interactions

    Get PDF
    We explore the consequences of very high dimensionality in the dynamical landscape of protein folding. Consideration of both typical range of stabilising interactions, and folding rates themselves, leads to a model of the energy hypersurface that is characterised by the structure of diffusive "hypergutters" as well as the familiar "funnels". Several general predictions result: (1) intermediate subspaces of configurations will always be visited; (2) specific but non-native interactions are important in stabilising these low-dimensional diffusive searches on the folding pathway; (3) sequential barriers will commonly be found, even in "two-state"proteins; (4) very early times will show charactreristic departures from single-exponential kinetics; (5) contributions of non-native interactions to phi-values are calculable, and may be significant. The example of a three-helix bundle is treated in more detail as an illustration. The model also shows that high-dimensional structures provide conceptual relations between the "folding funnel", "diffusion-collision", "nucleation-condensation" and "topomer search" models of protein folding. It suggests that kinetic strategies for fast folding may be encoded rather generally in non-native, rather than native interactions. The predictions are related to very recent findings in experiment and simulation.Comment: Submitted to Biophys.

    Protein folding tames chaos

    Full text link
    Protein folding produces characteristic and functional three-dimensional structures from unfolded polypeptides or disordered coils. The emergence of extraordinary complexity in the protein folding process poses astonishing challenges to theoretical modeling and computer simulations. The present work introduces molecular nonlinear dynamics (MND), or molecular chaotic dynamics, as a theoretical framework for describing and analyzing protein folding. We unveil the existence of intrinsically low dimensional manifolds (ILDMs) in the chaotic dynamics of folded proteins. Additionally, we reveal that the transition from disordered to ordered conformations in protein folding increases the transverse stability of the ILDM. Stated differently, protein folding reduces the chaoticity of the nonlinear dynamical system, and a folded protein has the best ability to tame chaos. Additionally, we bring to light the connection between the ILDM stability and the thermodynamic stability, which enables us to quantify the disorderliness and relative energies of folded, misfolded and unfolded protein states. Finally, we exploit chaos for protein flexibility analysis and develop a robust chaotic algorithm for the prediction of Debye-Waller factors, or temperature factors, of protein structures

    Representability of algebraic topology for biomolecules in machine learning based scoring and virtual screening

    Full text link
    This work introduces a number of algebraic topology approaches, such as multicomponent persistent homology, multi-level persistent homology and electrostatic persistence for the representation, characterization, and description of small molecules and biomolecular complexes. Multicomponent persistent homology retains critical chemical and biological information during the topological simplification of biomolecular geometric complexity. Multi-level persistent homology enables a tailored topological description of inter- and/or intra-molecular interactions of interest. Electrostatic persistence incorporates partial charge information into topological invariants. These topological methods are paired with Wasserstein distance to characterize similarities between molecules and are further integrated with a variety of machine learning algorithms, including k-nearest neighbors, ensemble of trees, and deep convolutional neural networks, to manifest their descriptive and predictive powers for chemical and biological problems. Extensive numerical experiments involving more than 4,000 protein-ligand complexes from the PDBBind database and near 100,000 ligands and decoys in the DUD database are performed to test respectively the scoring power and the virtual screening power of the proposed topological approaches. It is demonstrated that the present approaches outperform the modern machine learning based methods in protein-ligand binding affinity predictions and ligand-decoy discrimination
    • …
    corecore