11 research outputs found

    Utilizing Machine Learning to Greatly Expand the Range and Accuracy of Bottom-Up Coarse-Grained Models Through Virtual Particles

    Full text link
    Coarse-grained (CG) models parameterized using atomistic reference data, i.e., 'bottom up' CG models, have proven useful in the study of biomolecules and other soft matter. However, the construction of highly accurate, low resolution CG models of biomolecules remains challenging. We demonstrate in this work how virtual particles, CG sites with no atomistic correspondence, can be incorporated into CG models within the context of relative entropy minimization (REM) as latent variables. The methodology presented, variational derivative relative entropy minimization (VD-REM), enables optimization of virtual particle interactions through a gradient descent algorithm aided by machine learning. We apply this methodology to the challenging case of a solvent-free CG model of a 1,2-dioleoyl-sn-glycero-3-phosphocholine (DOPC) lipid bilayer and demonstrate that introduction of virtual particles captures solvent-mediated behavior and higher-order correlations which REM alone cannot capture in a more standard CG model based only on the mapping of collections of atoms to the CG sites.Comment: 35 pages, 9 figure

    Statistically Optimal Force Aggregation for Coarse-Graining Molecular Dynamics

    Full text link
    Machine-learned coarse-grained (CG) models have the potential for simulating large molecular complexes beyond what is possible with atomistic molecular dynamics. However, training accurate CG models remains a challenge. A widely used methodology for learning CG force-fields maps forces from all-atom molecular dynamics to the CG representation and matches them with a CG force-field on average. We show that there is flexibility in how to map all-atom forces to the CG representation, and that the most commonly used mapping methods are statistically inefficient and potentially even incorrect in the presence of constraints in the all-atom simulation. We define an optimization statement for force mappings and demonstrate that substantially improved CG force-fields can be learned from the same simulation data when using optimized force maps. The method is demonstrated on the miniproteins Chignolin and Tryptophan Cage and published as open-source code.Comment: 44 pages, 19 figure

    OpenMSCG: A Software Tool for Bottom-Up Coarse-Graining

    Get PDF
    The “bottom-up” approach to coarse-graining, for building accurate and efficient computational models to simulate large-scale and complex phenomena and processes, is an important approach in computational chemistry, biophysics, and materials science. As one example, the Multiscale Coarse-Graining (MS-CG) approach to developing CG models can be rigorously derived using statistical mechanics applied to fine-grained, i.e., all-atom simulation data for a given system. Under a number of circumstances, a systematic procedure, such as MS-CG modeling, is particularly valuable. Here, we present the development of the OpenMSCG software, a modularized open-source software that provides a collection of successful and widely applied bottom-up CG methods, including Boltzmann Inversion (BI), Force-Matching (FM), Ultra-Coarse-Graining (UCG), Relative Entropy Minimization (REM), Essential Dynamics Coarse-Graining (EDCG), and Heterogeneous Elastic Network Modeling (HeteroENM). OpenMSCG is a high-performance and comprehensive toolset that can be used to derive CG models from large-scale fine-grained simulation data in file formats from common molecular dynamics (MD) software packages, such as GROMACS, LAMMPS, and NAMD. OpenMSCG is modularized in the Python programming framework, which allows users to create and customize modeling “recipes” for reproducible results, thus greatly improving the reliability, reproducibility, and sharing of bottom-up CG models and their applications

    Navigating protein landscapes with a machine-learned transferable coarse-grained model

    Full text link
    The most popular and universally predictive protein simulation models employ all-atom molecular dynamics (MD), but they come at extreme computational cost. The development of a universal, computationally efficient coarse-grained (CG) model with similar prediction performance has been a long-standing challenge. By combining recent deep learning methods with a large and diverse training set of all-atom protein simulations, we here develop a bottom-up CG force field with chemical transferability, which can be used for extrapolative molecular dynamics on new sequences not used during model parametrization. We demonstrate that the model successfully predicts folded structures, intermediates, metastable folded and unfolded basins, and the fluctuations of intrinsically disordered proteins while it is several orders of magnitude faster than an all-atom model. This showcases the feasibility of a universal and computationally efficient machine-learned CG model for proteins

    Bottom-up Coarse-Graining: Principles and Perspectives

    No full text
    Large-scale computational molecular models provide scientists a means to investigate the effect of microscopic details on emergent mesoscopic behavior. Elucidating the relationship between variations on the molecular scale and macroscopic observable properties facilitates an understanding of the molecular interactions driving the properties of real world materials and complex systems (e.g., those found in biology, chemistry, and materials science). As a result, discovering an explicit, systematic connection between microscopic nature and emergent mesoscopic behavior is a fundamental goal for this type of investigation. The molecular forces critical to driving the behavior of complex heterogeneous systems are often unclear. More problematically, simulations of representative model systems are often prohibitively expensive from both spatial and temporal perspectives, impeding straightforward investigations over possible hypotheses characterizing molecular behavior. While the reduction in resolution of a study, such as moving from an atomistic simulation to that of the resolution of large coarse-grained (CG) groups of atoms, can partially ameliorate the cost of individual simulations, the relationship between the proposed microscopic details and this intermediate resolution is nontrivial and presents new obstacles to study. Small portions of these complex systems can be realistically simulated. Alone, these smaller simulations likely do not provide insight into collectively emergent behavior. However, by proposing that the driving forces in both smaller and larger systems (containing many related copies of the smaller system) have an explicit connection, systematic bottom-up CG techniques can be used to transfer CG hypotheses discovered using a smaller scale system to a larger system of primary interest. The proposed connection between different CG systems is prescribed by (i) the CG representation (mapping) and (ii) the functional form and parameters used to represent the CG energetics, which approximate potentials of mean force (PMFs). As a result, the design of CG methods that facilitate a variety of physically relevant representations, approximations, and force fields is critical to moving the frontier of systematic CG forward. Crucially, the proposed connection between the system used for parametrization and the system of interest is orthogonal to the optimization used to approximate the potential of mean force present in all systematic CG methods. The empirical efficacy of machine learning techniques on a variety of tasks provides strong motivation to consider these approaches for approximating the PMF and analyzing these approximations

    OpenMSCG: A Software Tool for Bottom-up Coarse-graining

    No full text
    The “bottom-up” approach to coarse-graining – for building accurate and efficient computational models to simulate large-scale and complex phenomena and processes – is an important approach in computational chemistry, biophysics, and materials science. As one example, the multiscale coarse-graining (MS-CG) approach to developing CG models can be rigorously derived using statistical mechanics applied to fine-grained, i.e., all-atom simulation data for a given system. Under a number of circumstances, a systematic procedure such as MS-CG modeling is particularly valuable. Here we present the development of the OpenMSCG software, a modularized open-source software that provides a collection of successful and widely applied bottom-up CG methods, including Boltzmann Inversion (BI), Force-Matching (FM), Ultra-Coarse-Graining (UCG), Relative Entropy Minimization (REM), Essential Dynamics Coarse-Graining (ED-CG), and Heterogeneous Elastic Network Modeling (HeteroENM). OpenMSCG is a high-performance and comprehensive toolset that can be used to derive CG models from large-scale fine-grained simulation data in file formats from common molecular dynamics (MD) software packages, such as GROMACS, LAMMPS and NAMD. OpenMSCG is modulized in the Python programming framework, which allows users to create and customize modeling “recipes” for reproducible results, thus greatly improving the reliability, reproducibility, and sharing of bottom-up CG models and their applications

    Immature HIV-1 lattice assembly dynamics are regulated by scaffolding from nucleic acid and the plasma membrane

    No full text
    The packaging and budding of Gag polyprotein and viral RNA is a critical step in the HIV-1 life cycle. High-resolution structures of the Gag polyprotein have revealed that the capsid (CA) and spacer peptide 1 (SP1) domains contain important interfaces for Gag self-assembly. However, the molecular details of the multimerization process, especially in the presence of RNA and the cell membrane, have remained unclear. In this work, we investigate the mechanisms that work in concert between the polyproteins, RNA, and membrane to promote immature lattice growth. We develop a coarse-grained (CG) computational model that is derived from sub nano-meter resolution structural data. Our simulations recapitulate contiguous and hexameric lattice assembly driven only by weak anisotropic attractions at the helical CA-SP1 junction. Importantly, analysis from CG and single-particle tracking photo-activated localization (spt-PALM) trajectories indicates that viral RNA and the membrane are critical constituents that actively promote Gag multimerization through scaffolding, while over expression of short competitor RNA can suppress assembly. We also find that the CA amino-terminal domain imparts intrinsic curvature to the Gag lattice. As a consequence, immature lattice growth appears to be coupled to the dynamics of spontaneous membrane deformation. Our findings elucidate a simple network of interactions that regulate the early stages of HIV-1 assembly and budding
    corecore