13 research outputs found
Targeting Histidine Side Chains in Molecular Design through Nitrogen–Halogen Bonds
Halogen bonds are directional noncovalent
interactions that can
be used to target electron donors in a protein binding site. In this
study, we employ quantum chemical calculations to explore halogen···nitrogen
contacts involving histidine side chains. We characterize the energetics
on the MP2 level of theory using SCS-MP2 and CCSDÂ(T)/CBS as reference
calculations and elucidate their energy profile in suboptimal geometries.
We derive simple rules allowing medicinal chemists and chemical biologists
to easily determine preferred areas of interaction in a binding site
and exploit them for scaffold decoration and design. Our work shows
that nitrogen–halogen bonds are valuable interactions that
are this far underexploited in patent applications, lead structure,
and clinical candidate selection. We highlight their potential to
increase binding affinities and suggest that they can significantly
contribute to inducing and tuning subtype selectivities
Machine Learning Estimates of Natural Product Conformational Energies
<div><p>Machine learning has been used for estimation of potential energy surfaces to speed up molecular dynamics simulations of small systems. We demonstrate that this approach is feasible for significantly larger, structurally complex molecules, taking the natural product Archazolid A, a potent inhibitor of vacuolar-type ATPase, from the myxobacterium <i>Archangium gephyra</i> as an example. Our model estimates energies of new conformations by exploiting information from previous calculations via Gaussian process regression. Predictive variance is used to assess whether a conformation is in the interpolation region, allowing a controlled trade-off between prediction accuracy and computational speed-up. For energies of relaxed conformations at the density functional level of theory (implicit solvent, DFT/BLYP-disp3/def2-TZVP), mean absolute errors of less than 1 kcal/mol were achieved. The study demonstrates that predictive machine learning models can be developed for structurally complex, pharmaceutically relevant compounds, potentially enabling considerable speed-ups in simulations of larger molecular structures.</p></div
Performance of ML models trained separately on each individual MD run and tested on the other MD runs.
<p>RMSE: root mean square error (kJ/mol), MAE: mean absolute error (kJ/mol), MAE (%): MAE as a percentage of the range of training set energy values, <i>R</i><sup>2</sup>: squared Pearson correlation coefficient.</p
Learning using predictive variance.
<p>Shown is the trade-off between mean absolute error (MAE, solid line, left scale) and number of predicted conformations (<i>m</i>, dashed line, right scale). Results are averaged over all possible orderings of the four MD runs (4! = 24; standard deviations ca. 0.4 kJ/mol and 35 samples). Squared correlation is <i>R</i><sup>2</sup> = 0.99.</p
Configuration of the myxobacterial polyketide Archazolid A, a potent inhibitor of vacuolar-type ATPase (V-ATPase).
<p>Configuration of the myxobacterial polyketide Archazolid A, a potent inhibitor of vacuolar-type ATPase (V-ATPase).</p
Performance of ML models trained on randomized subsets of increasing size of the complete MD data.
<p>See <a href="http://www.ploscompbiol.org/article/info:doi/10.1371/journal.pcbi.1003400#pcbi-1003400-t001" target="_blank">Table 1</a> for abbreviations.</p
Influence of sampling.
<p>Shown are smoothed PCA maps of absolute prediction errors for ML models trained on individual MD data (top row) and ML models trained on randomized subsets of all MD data (bottom row). Color indicates magnitude of error (blue = low, red = high); training samples are shown as black dots.</p
Projection of MD conformations of Archazolid A onto two dimensions (, ) by principal component analysis.
<p>Shown are distribution of individual conformations (left) and smoothed energy landscape generated by LiSARD <a href="http://www.ploscompbiol.org/article/info:doi/10.1371/journal.pcbi.1003400#pcbi.1003400-Reutlinger1" target="_blank">[52]</a> (right). Labels indicate reported NMR-motivated structures (A = <i>c5a</i>, B = <i>c5b</i>, P = <i>nmr</i>) and lowest-energy MD conformations (8, 595, 40). Color coding is from lowest (blue) to highest (red) relative energy.</p
Predicted vs calculated ΔE values of Archazolid A conformations.
<p>All predictions were obtained by stratified 10-fold cross-validation of the complete MD data. NMR-based conformations <i>c5a</i>, <i>c5b</i>, <i>nmr</i> are marked by red circles (external test data).</p
Reported conformations <i>5a</i>, <i>5b</i> (grey), and <i>nmr</i> (black) of Archazolid A derived from NMR studies [9].
<p>Molecules were superimposed by minimizing root mean square deviation in PyMol (<a href="http://www.pymol.org" target="_blank">www.pymol.org</a>).</p