37 research outputs found
Theoretical insights into specific ion effects and strong‐weak acid‐base rules for ions in solution : deriving the law of matching solvent affinities from first principles
We present a detailed study of specific ion effects, volcano plots and the law of matching solvent affinities by means of a conceptual density functional theory (DFT) approach. Our results highlight that specific ion effects and the corresponding implications on the solvation energy are mainly due to differences in the electric chemical potentials and chemical hardnesses of the ions and the solvent. Our approach can be further used to identify reliable criteria for the validity of the law of matching solvent affinities. Basic expressions are derived, which allow us to study the limiting conditions for this empirical observation with regard to matching chemical reactivity indices. Moreover, we show that chaotropic and kosmotropic concepts and their implications for the stability of ion pairs are directly related to a generalized strong and weak acids and bases (SWAB) principle for ions in solution, which is also applicable to rationalize the shape of volcano plots for different solvents. In contrast to previous assumptions, all empirical findings can be explained by the properties of local solvent‐ion complexes which dominate the specific global behavior of ion pairs in solution.University of FloridaProjekt DEA
Insights into Hildebrand solubility parameters : contributions from cohesive energies or electrophilicity densities?
We introduce certain concepts and expressions from conceptual density functional theory (DFT) to study the properties of the Hildebrand solubility parameter. The original form of the Hildebrand solubility parameter is used to qualitatively estimate solubilities for various apolar and aprotic substances and solvents and is based on the square root of the cohesive energy density. Our results show that a revised expression allows the replacement of cohesive energy densities by electrophilicity densities, which are numerically accessible by simple DFT calculations. As an extension, the reformulated expression provides a deeper interpretation of the main contributions and, in particular, emphasizes the importance of charge transfer mechanisms. All calculated values of the Hildebrand parameters for a large number of common solvents are compared with experimental values and show good agreement for non‐ or moderately polar aprotic solvents in agreement with the original formulation of the Hildebrand solubility parameters. The observed deviations for more polar and protic solvents define robust limits from the original formulation which remain valid. Likewise, we show that the use of machine learning methods leads to only slightly better predictability.University of Florid
Extended similarity indices: the benefits of comparing more than two objects simultaneously. Part 2: speed, consistency, diversity selection
Despite being a central concept in cheminformatics, molecular similarity has so far been limited to the simultaneous comparison of only two molecules at a time and using one index, generally the Tanimoto coefficent. In a recent contribution we have not only introduced a complete mathematical framework for extended similarity calculations, (i.e. comparisons of more than two molecules at a time) but defined a series of novel idices. Part 1 is a detailed analysis of the effects of various parameters on the similarity values calculated by the extended formulas. Their features were revealed by sum of ranking differences and ANOVA. Here, in addition to characterizing several important aspects of the newly introduced similarity metrics, we will highlight their applicability and utility in real-life scenarios using datasets with popular molecular fingerprints. Remarkably, for large datasets, the use of extended similarity measures provides an unprecedented speed-up over “traditional” pairwise similarity matrix calculations. We also provide illustrative examples of a more direct algorithm based on the extended Tanimoto similarity to select diverse compound sets, resulting in much higher levels of diversity than traditional approaches. We discuss the inner and outer consistency of our indices, which are key in practical applications, showing whether the n-ary and binary indices rank the data in the same way. We demonstrate the use of the new n-ary similarity metrics on t-distributed stochastic neighbor embedding (t-SNE) plots of datasets of varying diversity, or corresponding to ligands of different pharmaceutical targets, which show that our indices provide a better measure of set compactness than standard binary measures. We also present a conceptual example of the applicability of our indices in agglomerative hierarchical algorithms. The Python code for calculating the extended similarity metrics is freely available at: https://github.com/ramirandaq/MultipleComparison
Extended similarity indices: the benefits of comparing more than two objects simultaneously. Part 1: Theory and characteristics
Quantification of the similarity of objects is a key concept in many areas of computational science. This includes cheminformatics, where molecular similarity is usually quantified based on binary fingerprints. While there is a wide selection of available molecular representations and similarity metrics, there were no previous efforts to extend the computational framework of similarity calculations to the simultaneous comparison of more than two objects (molecules) at the same time. The present study bridges this gap, by introducing a straightforward computational framework for comparing multiple objects at the same time and providing extended formulas for as many similarity metrics as possible. In the binary case (i.e. when comparing two molecules pairwise) these are naturally reduced to their well-known formulas. We provide a detailed analysis on the effects of various parameters on the similarity values calculated by the extended formulas. The extended similarity indices are entirely general and do not depend on the fingerprints used. Two types of variance analysis (ANOVA) help to understand the main features of the indices: (i) ANOVA of mean similarity indices; (ii) ANOVA of sum of ranking differences (SRD). Practical aspects and applications of the extended similarity indices are detailed in the accompanying paper: Miranda-Quintana et al. J Cheminform. 2021. https://doi.org/10.1186/s13321-021-00504-4. Python code for calculating the extended similarity metrics is freely available at: https://github.com/ramirandaq/MultipleComparisons
Molecular Interactions From the Density Functional Theory for Chemical Reactivity: The Interaction Energy Between Two-Reagents
Reactivity descriptors indicate where a reagent is most reactive and how it is most likely to react. However, a reaction will only occur when the reagent encounters a suitable reaction partner. Determining whether a pair of reagents is well-matched requires developing reactivity rules that depend on both reagents. This can be achieved using the expression for the minimum-interaction-energy obtained from the density functional reactivity theory. Different terms in this expression will be dominant in different circumstances; depending on which terms control the reactivity, different reactivity indicators will be preferred