Data-driven analysis of the number of Lennard–Jones types needed in a force field

Abstract

Force fields used in molecular simulations contain numerical parameters, such as Lennard-Jones (LJ) parameters, which are assigned to the atoms in a molecule based on a classification of their chemical environments. The number of classes, or types, should be no more than needed to maximize agreement with experiment, as parsimony avoids overfitting and simplifies parameter optimization. However, types have historically been crafted based largely on chemical intuition, so current force fields may contain more types than needed. In this study, we seek the minimum number of LJ parameter types needed to represent key properties of organic liquids. We find that highly competitive force field accuracy is obtained with minimalist sets of LJ types; e.g. two H types and one type apiece for C, O, and N atoms. We also find that the fitness surface has multiple minima, which can lead to local trapping of the optimizer

    Similar works