18 research outputs found

    End-to-End Differentiable Molecular Mechanics Force Field Construction

    Full text link
    Molecular mechanics (MM) potentials have long been a workhorse of computational chemistry. Leveraging accuracy and speed, these functional forms find use in a wide variety of applications from rapid virtual screening to detailed free energy calculations. Traditionally, MM potentials have relied on human-curated, inflexible, and poorly extensible discrete chemical perception rules (atom types) for applying parameters to molecules or biopolymers, making them difficult to optimize to fit quantum chemical or physical property data. Here, we propose an alternative approach that uses graph nets to perceive chemical environments, producing continuous atom embeddings from which valence and nonbonded parameters can be predicted using a feed-forward neural network. Since all stages are built using smooth functions, the entire process of chemical perception and parameter assignment is differentiable end-to-end with respect to model parameters, allowing new force fields to be easily constructed, extended, and applied to arbitrary molecules. We show that this approach has the capacity to reproduce legacy atom types and can be fit to MM and QM energies and forces, among other targets

    Quantifying configuration-sampling error in Langevin simulations of complex molecular systems

    Get PDF
    While Langevin integrators are popular in the study of equilibrium properties of complex systems, it is challenging to estimate the timestep-induced discretization error: the degree to which the sampled phase-space or configuration-space probability density departs from the desired target density due to the use of a finite integration timestep. Sivak et al., introduced a convenient approach to approximating a natural measure of error between the sampled density and the target equilibrium density, the Kullback-Leibler (KL) divergence, in phase space, but did not specifically address the issue of configuration-space properties, which are much more commonly of interest in molecular simulations. Here, we introduce a variant of this near-equilibrium estimator capable of measuring the error in the configuration-space marginal density, validating it against a complex but exact nested Monte Carlo estimator to show that it reproduces the KL divergence with high fidelity. To illustrate its utility, we employ this new near-equilibrium estimator to assess a claim that a recently proposed Langevin integrator introduces extremely small configuration-space density errors up to the stability limit at no extra computational expense. Finally, we show how this approach to quantifying sampling bias can be applied to a wide variety of stochastic integrators by following a straightforward procedure to compute the appropriate shadow work, and describe how it can be extended to quantify the error in arbitrary marginal or conditional distributions of interest

    choderalab/espaloma: Version 0.3.2 - Support Latest DGL Version

    No full text
    Note This release updates espaloma to support the newest version (1.1.2) of DGL. There are no API breaking changes or updates to the espaloma model. We have attached espaloma-0.3.2.pt to this release for convince, but this model is identical to espaloma-0.3.1.pt. espaloma-latest.pt is also the same as espaloma-0.3.1.pt. Please see https://github.com/choderalab/espaloma/releases/tag/0.3.1 for more information about the espaloma 0.3.x family of models. What's Changed Support latest (1.1.2) DGL version Bug fixes, packaging, and documentation updates Installation Guide Update by @mikemhenry in https://github.com/choderalab/espaloma/pull/175 Add docker and apptainer image building on demand by @mikemhenry in https://github.com/choderalab/espaloma/pull/176 Update to new dgl version (1.1.1) by @mikemhenry in https://github.com/choderalab/espaloma/pull/179 Add Instructions on loading local model by @ijpulidos in https://github.com/choderalab/espaloma/pull/183 Test & Update to newest DGL Version (1.1.2) by @mikemhenry in https://github.com/choderalab/espaloma/pull/186 Update readme and docs for version 0.3.2 @mikemhenry in https://github.com/choderalab/espaloma/pull/188 Full Changelog: https://github.com/choderalab/espaloma/compare/0.3.1...0.3.

    choderalab/openmmtools: Release 0.8.3 - minor feature update

    No full text
    This release extends some of the systems in openmmtools.testsystems to allow multiple CustomGBForce-based GB models supported by OpenMM's customgbforces.py functionality to be used. TolueneImplicit, HostGuestImplicit, and LysozymeImplicit now accept the implicitSolvent option, allowing GB models from simtk.openmm.app (such as HCT, OBC1, OBC2, GBn, and GBn2) to be optionally specified. OBC1 remains the default. These three classes also support any kwargs supported by AmberPrmtopFile.createSystem(). Any parameters that createSystem supports will be passed along from the constructor. TolueneImplicit and HostGuestImplicit also have variants like TolueneImplicitHCT, TolueneImplicitOBC1, etc. for all OpenMM-supported implicit solvent models

    choderalab/openmmtools: Bugfix release 0.9.1

    No full text
    This release fixes a Python 2 bug caused by how Python 2 and Python 3 handle differently user-defined equality operators. This was affecting the sanity checks in alchemy.AlchemicalState to always fail when checking the compatibility of a System

    Toward Learned Chemical Perception of Force Field Typing Rules

    No full text
    Molecular mechanics force fields define how the energy and forces of a molecular system are computed from its atomic positions, and enable the study of such systems through computational methods like molecular dynamics and Monte Carlo simulations. Despite progress toward automated force field parameterization, considerable human expertise is required to develop or extend force fields. In particular, human input has long been required to define atom types, which encode chemically unique environments that determine which parameters must be assigned. However, relying on humans to establish atom types is suboptimal: the resulting atom types are often unjustified from a statistical perspective, leading to over- or under-fitting; they are difficult to extend in a systematic and consistent manner when new chemistries must be modeled or new data becomes available; and human effort is not scalable when force fields must be generated for new (bio)polymers or materials. We aim to replace human specification of atom types with an automated approach, based on solid statistics and driven by experimental and/or quantum chemical reference data. Here, we describe a novel technology for this purpose, termed SMARTY, which generalizes atom typing by using direct chemical perception with SMARTS strings, and adopting a hierarchical approach to type assignment. The SMARTY technology enables creation of a move set in atom-typing space that can be used in a Monte Carlo optimization approach to atom typing. We demonstrate the power of this approach with a fully automated procedure that is able to re-discover human-defined atom types in the traditional small molecule force field parm99/parm@Frosst. Furthermore, we show how an extension of this approach that makes use of SMIRKS strings to match multiple atoms, which we term SMIRKY, allows us to take full advantage of the advances in direct chemical perception for valence types (bonds, angles, and torsions) afforded by the recently-proposed SMIRNOFF direct chemical perception force field typing language. We assess these approaches using several molecular datasets, including one which covers a diverse molecular subset from DrugBank. </div

    Binding Modes of Ligands Using Enhanced Sampling (BLUES): Rapid Decorrelation of Ligand Binding Modes Using Nonequilibrium Candidate Monte Carlo

    No full text
    Accurately predicting protein-ligand binding is a major goal in computational chemistry, but even the prediction of ligand binding modes in proteins poses major challenges. Here, we focus on solving the binding mode prediction problem for rigid fragments. That is, we focus on computing the dominant placement, conformation, and orientations of a relatively rigid, fragment-like ligand in a receptor, and the populations of the multiple binding modes which may be relevant. This problem is important in its own right, but is even more timely given the recent success of alchemical free energy calculations. Alchemical calculations are increasingly used to predict binding free energies of ligands to receptors. However, the accuracy of these calculations is dependent on proper sampling of the relevant ligand binding modes. Unfortunately, ligand binding modes may often be uncertain, hard to predict, and/or slow to interconvert on simulation timescales, so proper sampling with current techniques can require prohibitively long simulations. We need new methods which dramatically improve sampling of ligand binding modes. Here, we develop and apply a nonequilibrium candidate Monte Carlo (NCMC) method to improve sampling of ligand binding modes.In this technique the ligand is rotated and subsequently allowed to relax in its new position through alchemical perturbation before accepting or rejecting the rotation and relaxation as a nonequilibrium Monte Carlo move. When applied to a T4 lysozyme model binding system, this NCMC method shows over two orders of magnitude improvement in binding mode sampling efficiency compared to a brute force molecular dynamics simulation. This is a first step towards applying this methodology to pharmaceutically relevant binding of fragments and, eventually, drug-like molecules. We are making this approach available via our new Binding Modes of Ligands using Enhanced Sampling (BLUES) package which is freely available on GitHub.</div

    Biomolecular Simulations under Realistic Macroscopic Salt Conditions

    No full text
    Biomolecular simulations are typically performed in an aqueous environment where the number of ions remains fixed for the duration of the simulation, generally with either a minimally neutralizing ion environment or a number of salt pairs intended to match the macroscopic salt concentration. In contrast, real biomolecules experience local ion environments where the salt concentration is dynamic and may differ from bulk. The degree of salt concentration variability and average deviation from the macroscopic concentration remains, as yet, unknown. Here, we describe the theory and implementation of a Monte Carlo <i>osmostat</i> that can be added to explicit solvent molecular dynamics or Monte Carlo simulations to sample from a semigrand canonical ensemble in which the number of salt pairs fluctuates dynamically during the simulation. The osmostat reproduces the correct equilibrium statistics for a simulation volume that can exchange ions with a large reservoir at a defined macroscopic salt concentration. To achieve useful Monte Carlo acceptance rates, the method makes use of nonequilibrium candidate Monte Carlo (NCMC) moves in which monovalent ions and water molecules are alchemically transmuted using short nonequilibrium trajectories, with a modified Metropolis-Hastings criterion ensuring correct equilibrium statistics for an (<i>Δμ</i>, <i>N</i>, <i>p</i>, <i>T</i>) ensemble to achieve a ∼10<sup>46</sup>× boost in acceptance rates. We demonstrate how typical protein (DHFR and the tyrosine kinase Src) and nucleic acid (Drew–Dickerson B-DNA dodecamer) systems exhibit salt concentration distributions that significantly differ from fixed-salt bulk simulations and display fluctuations that are on the same order of magnitude as the average

    choderalab/openmmtools: 0.13.1 - Bugfix release

    No full text
    In this release: Fix pickling of CompoundThermodynamicState (#284). Add missing term to OBC2 GB alchemical Force (#288). Generalize forcefactories.restrain_atoms() to non-protein receptors (#290). Standardize integrator global variables in ContextCache (#291)
    corecore