8 research outputs found

    Investigating the Behavior of Diffusion Models for Accelerating Electronic Structure Calculations

    Full text link
    We present an investigation into diffusion models for molecular generation, with the aim of better understanding how their predictions compare to the results of physics-based calculations. The investigation into these models is driven by their potential to significantly accelerate electronic structure calculations using machine learning, without requiring expensive first-principles datasets for training interatomic potentials. We find that the inference process of a popular diffusion model for de novo molecular generation is divided into an exploration phase, where the model chooses the atomic species, and a relaxation phase, where it adjusts the atomic coordinates to find a low-energy geometry. As training proceeds, we show that the model initially learns about the first-order structure of the potential energy surface, and then later learns about higher-order structure. We also find that the relaxation phase of the diffusion model can be re-purposed to sample the Boltzmann distribution over conformations and to carry out structure relaxations. For structure relaxations, the model finds geometries with ~10x lower energy than those produced by a classical force field for small organic molecules. Initializing a density functional theory (DFT) relaxation at the diffusion-produced structures yields a >2x speedup to the DFT relaxation when compared to initializing at structures relaxed with a classical force field

    Erythrocyte and Porcine Intestinal Glycosphingolipids Recognized by F4 Fimbriae of Enterotoxigenic Escherichia coli

    Get PDF
    Enterotoxigenic F4-fimbriated Escherichia coli is associated with diarrheal disease in neonatal and postweaning pigs. The F4 fimbriae mediate attachment of the bacteria to the pig intestinal epithelium, enabling an efficient delivery of diarrhea-inducing enterotoxins to the target epithelial cells. There are three variants of F4 fimbriae designated F4ab, F4ac and F4ad, respectively, having different antigenic and adhesive properties. In the present study, the binding of isolated F4ab, F4ac and F4ad fimbriae, and F4ab/ac/ad-fimbriated E. coli, to glycosphingolipids from erythrocytes and from porcine small intestinal epithelium was examined, in order to get a comprehensive view of the F4-binding glycosphingolipids involved in F4-mediated hemagglutination and adhesion to the epithelial cells of porcine intestine. Specific interactions between the F4ab, F4ac and F4ad fimbriae and both acid and non-acid glycosphingolipids were obtained, and after isolation of binding-active glycosphingolipids and characterization by mass spectrometry and proton NMR, distinct carbohydrate binding patterns were defined for each fimbrial subtype. Two novel glycosphingolipids were isolated from chicken erythrocytes, and characterized as GalNAcα3GalNAcß3Galß4Glcß1Cer and GalNAcα3GalNAcß3Galß4GlcNAcß3Galß4Glcß1Cer. These two compounds, and lactosylceramide (Galß4Glcß1Cer) with phytosphingosine and hydroxy fatty acid, were recognized by all three variants of F4 fimbriae. No binding of the F4ad fimbriae or F4ad-fimbriated E. coli to the porcine intestinal glycosphingolipids occurred. However, for F4ab and F4ac two distinct binding patterns were observed. The F4ac fimbriae and the F4ac-expressing E. coli selectively bound to galactosylceramide (Galß1Cer) with sphingosine and hydroxy 24:0 fatty acid, while the porcine intestinal glycosphingolipids recognized by F4ab fimbriae and the F4ab-fimbriated bacteria were characterized as galactosylceramide, sulfatide (SO3-3Galß1Cer), sulf-lactosylceramide (SO3-3Galß4Glcß1Cer), and globotriaosylceramide (Galα4Galß4Glcß1Cer) with phytosphingosine and hydroxy 24:0 fatty acid. Finally, the F4ad fimbriae and the F4ad-fimbriated E. coli, but not the F4ab or F4ac subtypes, bound to reference gangliotriaosylceramide (GalNAcß4Galß4Glcß1Cer), gangliotetraosylceramide (Galß3GalNAcß4Galß4Glcß1Cer), isoglobotriaosylceramide (Galα3Galß4Glcß1Cer), and neolactotetraosylceramide (Galß4GlcNAcß3Galß4Glcß1Cer)

    Accelerated Discovery of CH4 Uptake Capacity MOFs using Bayesian Optimization

    No full text
    High-throughput computational studies for discovery of metal-organic frameworks (MOFs) for separations and storage applications are often limited by the costs of computing thermodynamic quantities, with recent studies reliant ab initio results for a narrow selection of MOFs and empirical force-field methods for larger selections. Here, we conduct a proof-of-concept study using Bayesian optimization on CH4 uptake capacity of hypothetical MOFs for an existing dataset (Wilmer et al, Nature Chem. 2012, 4, 83). We show that less than 0.1% of the database needs to be screened with our Bayesian optimization approach to recover the top candidate MOFs. This opens the possibility of efficient screening of MOF databases using accurate ab-initio calculations for future adsorption studies on a minimal subset of MOFs. Furthermore, Bayesian optimization and the surrogate model presented here can offer interpretable material design insights and our framework will be applicable in the context of other target properties

    Temporal Bayesian Networks

    No full text
    Temporal formalisms are useful in several applications such as planning, scheduling and diagnosis. Probabilistic temporal reasoning emerged to deal with the uncertainties usually encountered in such applications. Bayesian networks provide a simple compact graphical representation of a probability distribution by exploiting conditional independencies. This paper presents a simple technique for representing time in Bayesian networks by expressing probabilities as functions of time. Probability transfer functions allow the formalism to deal with causal relations and dependencies between time points. Techniques to represent related time instants are distinct from those used to represent independent time instants but the probabilistic formalism is useful in both cases. The study of the cumulative e ect of repeated events involves various models such as the competing risks model and the additive model. Dynamic Bayesian networks inference mechanisms are adequate for temporal probabilistic reasoning described in this work. Examples from medical diagnosis, circuit diagnosis and common sense reasoning help illustrate the use of these techniques.

    Higher-order equivariant neural networks for charge density prediction in materials

    No full text
    Abstract The calculation of electron density distribution using density functional theory (DFT) in materials and molecules is central to the study of their quantum and macro-scale properties, yet accurate and efficient calculation remains a long-standing challenge. We introduce ChargE3Net, an E(3)-equivariant graph neural network for predicting electron density in atomic systems. ChargE3Net enables the learning of higher-order equivariant features to achieve high predictive accuracy and model expressivity. We show that ChargE3Net exceeds the performance of prior work on diverse sets of molecules and materials. When trained on the massive dataset of over 100K materials in the Materials Project database, our model is able to capture the complexity and variability in the data, leading to a significant 26.7% reduction in self-consistent iterations when used to initialize DFT calculations on unseen materials. Furthermore, we show that non-self-consistent DFT calculations using our predicted charge densities yield near-DFT performance on electronic and thermodynamic property prediction at a fraction of the computational cost. Further analysis attributes the greater predictive accuracy to improved modeling of systems with high angular variations. These results illuminate a pathway towards a machine learning-accelerated ab initio calculations for materials discovery

    Metal-organic frameworks as O2-selective adsorbents for air separations.

    No full text
    Oxygen is a critical gas in numerous industries and is produced globally on a gigatonne scale, primarily through energy-intensive cryogenic distillation of air. The realization of large-scale adsorption-based air separations could enable a significant reduction in associated worldwide energy consumption and would constitute an important component of broader efforts to combat climate change. Certain small-scale air separations are carried out using N2-selective adsorbents, although the low capacities, poor selectivities, and high regeneration energies associated with these materials limit the extent of their usage. In contrast, the realization of O2-selective adsorbents may facilitate more widespread adoption of adsorptive air separations, which could enable the decentralization of O2 production and utilization and advance new uses for O2. Here, we present a detailed evaluation of the potential of metal-organic frameworks (MOFs) to serve as O2-selective adsorbents for air separations. Drawing insights from biological and molecular systems that selectively bind O2, we survey the field of O2-selective MOFs, highlighting progress and identifying promising areas for future exploration. As a guide for further research, the importance of moving beyond the traditional evaluation of O2 adsorption enthalpy, ΔH, is emphasized, and the free energy of O2 adsorption, ΔG, is discussed as the key metric for understanding and predicting MOF performance under practical conditions. Based on a proof-of-concept assessment of O2 binding carried out for eight different MOFs using experimentally derived capacities and thermodynamic parameters, we identify two existing materials and one proposed framework with nearly optimal ΔG values for operation under user-defined conditions. While enhancements are still needed in other material properties, the insights from the assessments herein serve as a guide for future materials design and evaluation. Computational approaches based on density functional theory with periodic boundary conditions are also discussed as complementary to experimental efforts, and new predictions enable identification of additional promising MOF systems for investigation

    Hybrid Approach for Selective Sulfoxidation via Bioelectrochemically Derived Hydrogen Peroxide over a Niobium(V)–Silica Catalyst

    No full text
    In this work, we demonstrate a combined bioelectrochemical and inorganic catalytic system for resource recovery from wastewater. We designed a microbial peroxide producing cell (MPPC) for hydrogen peroxide (H<sub>2</sub>O<sub>2</sub>) production and used this bioelectrochemically derived H<sub>2</sub>O<sub>2</sub> as a green oxidant for sulfoxidation, an industrial reaction used for chemical synthesis and oxidative desulfurization of transportation fuels. We operated an MPPC equipped with a gas diffusion electrode cathode for six months, achieving a peak current density above 1.4 mA cm<sup>–2</sup> with 60% average acetate removal and 61% average anodic Coulombic efficiency. We evaluated several cathode buffers under batch and continuous-flow conditions for solubility and pH compatibility with downstream catalytic systems. During 24-h batch tests, a phosphate-buffered MPPC achieved a maximum H<sub>2</sub>O<sub>2</sub> concentration of 4.6 g L<sup>–1</sup> and a citric acid–phosphate-buffered MPPC obtained a moderate H<sub>2</sub>O<sub>2</sub> concentration (3.1 g L<sup>–1</sup>) at a low energy input (1.6 Wh g<sup>–1</sup> H<sub>2</sub>O<sub>2</sub>) and pH (10). The MPPC-derived H<sub>2</sub>O<sub>2</sub> was used directly as an oxidant for the catalytic sulfoxidation of 4-hydroxythioanisole over a solid niobium­(V)–silica catalyst. We achieved 82% conversion of 50 mM 4-hydroxythioanisole to 4-(methylsulfinyl)­phenol with 99% selectivity with a 0.5 mol % catalyst loading in 100 min in aqueous media. Our results demonstrate a new and versatile approach for valorization of wastewater through continuous production of H<sub>2</sub>O<sub>2</sub> and its subsequent use as a selective green oxidant in aqueous conditions for green chemistry applications
    corecore