43 research outputs found
Towards Automated Benchmarking of Atomistic Forcefields: Neat Liquid Densities and Static Dielectric Constants from the ThermoML Data Archive
Atomistic molecular simulations are a powerful way to make quantitative
predictions, but the accuracy of these predictions depends entirely on the
quality of the forcefield employed. While experimental measurements of
fundamental physical properties offer a straightforward approach for evaluating
forcefield quality, the bulk of this information has been tied up in formats
that are not machine-readable. Compiling benchmark datasets of physical
properties from non-machine-readable sources require substantial human effort
and is prone to accumulation of human errors, hindering the development of
reproducible benchmarks of forcefield accuracy. Here, we examine the
feasibility of benchmarking atomistic forcefields against the NIST ThermoML
data archive of physicochemical measurements, which aggregates thousands of
experimental measurements in a portable, machine-readable, self-annotating
format. As a proof of concept, we present a detailed benchmark of the
generalized Amber small molecule forcefield (GAFF) using the AM1-BCC charge
model against measurements (specifically bulk liquid densities and static
dielectric constants at ambient pressure) automatically extracted from the
archive, and discuss the extent of available data. The results of this
benchmark highlight a general problem with fixed-charge forcefields in the
representation low dielectric environments such as those seen in binding
cavities or biological membranes
Best practices for constructing, preparing, and evaluating protein-ligand binding affinity benchmarks
Free energy calculations are rapidly becoming indispensable in
structure-enabled drug discovery programs. As new methods, force fields, and
implementations are developed, assessing their expected accuracy on real-world
systems (benchmarking) becomes critical to provide users with an assessment of
the accuracy expected when these methods are applied within their domain of
applicability, and developers with a way to assess the expected impact of new
methodologies. These assessments require construction of a benchmark - a set of
well-prepared, high quality systems with corresponding experimental
measurements designed to ensure the resulting calculations provide a realistic
assessment of expected performance when these methods are deployed within their
domains of applicability. To date, the community has not yet adopted a common
standardized benchmark, and existing benchmark reports suffer from a myriad of
issues, including poor data quality, limited statistical power, and
statistically deficient analyses, all of which can conspire to produce
benchmarks that are poorly predictive of real-world performance. Here, we
address these issues by presenting guidelines for (1) curating experimental
data to develop meaningful benchmark sets, (2) preparing benchmark inputs
according to best practices to facilitate widespread adoption, and (3) analysis
of the resulting predictions to enable statistically meaningful comparisons
among methods and force fields
Assessing the Conformational Equilibrium of Carboxylic Acid via QM and MD Studies on Acetic Acid
Accurate hydrogen placement in molecular modeling is crucial for studying the interactions and dynamics of biomolecular systems. It is difficult to locate hydrogen atoms from many experimental structural characterization approaches, such as due to the weak scattering of x-ray radiation. Hydrogen atoms are usually added and positioned in silico when preparing experimental structures for modeling and simulation. The carboxyl functional group is a prototypical example of a functional group that requires protonation during structure preparation. To our knowledge, when in their neutral form, carboxylic acids are typically protonated in the syn conformation by default in classical molecular modeling packages, with no consideration of alternative conformations, though we are not aware of any careful examination of this topic. Here, we investigate the general belief that carboxylic acids should always be protonated in the syn conformation. We calculate and compare the relative energetic stabilities of syn and anti acetic acid using ab initio quantum mechanical calculations and atomistic molecular dynamics simulations. We show that while the syn conformation is the preferred state, the anti state may in some cases also be present under normal NPT conditions in solution
Biomolecular Solvation Structure Revealed by Molecular Dynamics Simulations
To compare ordered water positions from experiment
with those from molecular dynamics (MD) simulations, a number of MD models of
water structure in crystalline endoglucanase were calculated. The starting MD
model was derived from a joint X-ray and neutron diffraction crystal structure,
enabling the use of experimentally assigned protonation states. Simulations
were performed in the crystalline state, using a periodic 2x2x2 supercell with
explicit solvent. Water X-ray and neutron scattering density maps were computed
from MD trajectories using standard macromolecular crystallography methods. In
one set of simulations, harmonic restraints were applied to bias the protein
structure toward the crystal structure. For these simulations, the recall of
crystallographic waters using strong peaks in the MD water electron density was
very good, and there also was substantial visual agreement between the
boomerang-like wings of the neutron scattering density and the crystalline
water hydrogen positions. An unrestrained simulation also was performed. For
this simulation, the recall of crystallographic waters was much lower. For both
restrained and unrestrained simulations, the strongest water density peaks were
associated with crystallographic waters. The results demonstrate that it is now
possible to recover crystallographic water structure using restrained MD
simulations, but that it is not yet reasonable to expect unrestrained MD
simulations to do the same. Further development and generalization of MD water
models for force field development, macromolecular crystallography, and
medicinal chemistry applications is now warranted. In particular, the
combination of room-temperature crystallography, neutron diffraction, and
crystalline MD simulations promises to substantially advance modeling of
biomolecular solvation
Recommended from our members
Assessing the Conformational Equilibrium of Carboxylic Acid via Quantum Mechanical and Molecular Dynamics Studies on Acetic Acid.
Accurate hydrogen placement in molecular modeling is crucial for studying the interactions and dynamics of biomolecular systems. The carboxyl functional group is a prototypical example of a functional group that requires protonation during structure preparation. To our knowledge, when in their neutral form, carboxylic acids are typically protonated in the syn conformation by default in classical molecular modeling packages, with no consideration of alternative conformations, though we are not aware of any careful examination of this topic. Here, we investigate the general belief that carboxylic acids should always be protonated in the syn conformation. We calculate and compare the relative energetic stabilities of syn and anti acetic acid using ab initio quantum mechanical calculations and atomistic molecular dynamics simulations. We focus on the carboxyl torsional potential and configurations of microhydrated acetic acid from molecular dynamics simulations, probing the effects of solvent, force field (GAFF vs GAFF2), and partial charge assignment of acetic acid. We show that while the syn conformation is the preferred state, the anti state may in some cases also be present under normal NPT conditions in solution
Benchmark Assessment of Molecular Geometries and Energies from Small Molecule Force Fields
Force fields are used in a wide variety of contexts for classical molecular simulation, including studies on protein-ligand binding, membrane permeation, and thermophysical property prediction. The quality of these studies relies on the quality of the force fields used to represent the systems. Focusing on small molecules of fewer than 50 heavy atoms, our aim in this work is to compare nine force fields: GAFF, GAFF2, MMFF94, MMFF94S, OPLS3e, SMIRNOFF99Frosst, and the Open Force Field Parsley, versions 1.0, 1.1 and 1.2. On a dataset comprising 22,675 molecular structures of 3,271 molecules, we analyzed force field-optimized geometries and conformer energies compared these to reference quantum mechanical (QM) data. We show that while OPLS3e performs best, the latest Open Force Field Parsley release is approaching a comparable level of accuracy in reproducing QM geometries and energetics for this set of molecules. Meanwhile, the performance of established force fields such as MMFF94s and GAFF2 is generally somewhat worse. We also find that the series of recent Open Force Field versions provide significant increases in accuracy. Our molecule set and results are available for other researchers to use in testing.</div