163 research outputs found

    Prediction of the Chemical Context for Buchwald-Hartwig Coupling Reactions

    Get PDF
    We present machine learning models for predicting the chemical context for Buchwald-Hartwig coupling reactions, i. e., what chemicals to add to the reactants to give a productive reaction. Using reaction data from in-house electronic lab notebooks, we train two models: one based on single-label data and one based on multi-label data. Both models show excellent top-3 accuracy of approximately 90 %, which suggests strong predictivity. Furthermore, there seems to be an advantage of including multi-label data because the multi-label model shows higher accuracy and better sensitivity for the individual contexts than the single-label model. Although the models are performant, we also show that such models need to be re-trained periodically as there is a strong temporal characteristic to the usage of different contexts. Therefore, a model trained on historical data will decrease in usefulness with time as newer and better contexts emerge and replace older ones. We hypothesize that such significant transitions in the context-usage will likely affect any model predicting chemical contexts trained on historical data. Consequently, training context prediction models warrants careful planning of what data is used for training and how often the model needs to be re-trained

    Dynamics of Elongation Factor 2 Kinase Regulation in Cortical Neurons in Response to Synaptic Activity

    Get PDF
    The rapid regulation of cell signaling in response to calcium in neurons is essential for real-time processing of large amounts of information in the brain. A vital regulatory component, and one of the most energy-intensive biochemical processes in cells, is the elongation phase of mRNA translation, which is controlled by the Ca(2+)/CaM-dependent elongation factor 2 kinase (eEF2K). However, little is known about the dynamics of eEF2K regulation in neurons despite its established role in learning and synaptic plasticity. To explore eEF2K dynamics in depth, we stimulated synaptic activity in mouse primary cortical neurons. We find that synaptic activity results in a rapid, but transient, increase in eEF2K activity that is regulated by a combination of AMPA and NMDA-type glutamate receptors and the mitogen-activated protein kinase (MEK)/extracellular signal-regulated kinase (ERK) and mammalian target of rapamycin complex 1 (mTORC1) pathways. We then used computational modeling to test the hypothesis that considering Ca(2+)-coordinated MEK/ERK, mTORC1, and eEF2k activation is sufficient to describe the observed eEF2K dynamics. Although such a model could partially fit the empirical findings, it also suggested that a crucial positive regulator of eEF2K was also necessary. Through additional modeling and empirical evidence, we demonstrate that AMP kinase (AMPK) is also an important regulator of synaptic activity-driven eEF2K dynamics in neurons. Our combined modeling and experimental findings provide the first evidence that it is necessary to consider the combined interactions of Ca(2+) with MEK/ERK, mTORC1, and AMPK to adequately explain eEF2K regulation in neurons

    Parameterization of a coarse-grained model of cholesterol with point-dipole electrostatics

    Get PDF
    © 2018, Springer Nature Switzerland AG. We present a new coarse-grained (CG) model of cholesterol (CHOL) for the electrostatic-based ELBA force field. A distinguishing feature of our CHOL model is that the electrostatics is modeled by an explicit point dipole which interacts through an ideal vacuum permittivity. The CHOL model parameters were optimized in a systematic fashion, reproducing the electrostatic and nonpolar partitioning free energies of CHOL in lipid/water mixtures predicted by full-detailed atomistic molecular dynamics simulations. The CHOL model has been validated by comparison to structural, dynamic and thermodynamic properties with experimental and atomistic simulation reference data. The simulation of binary DPPC/cholesterol mixtures covering the relevant biological content of CHOL in mammalian membranes is shown to correctly predict the main lipid behavior as observed experimentally

    Molecular determinants of binding to the Plasmodium subtilisin-like protease 1.

    Get PDF
    PfSUB1, a subtilisin-like protease of the human malaria parasite Plasmodium falciparum, is known to play important roles during the life cycle of the parasite and has emerged as a promising antimalarial drug target. In order to provide a detailed understanding of the origin of binding determinants of PfSUB1 substrates, we performed molecular dynamics simulations in combination with MM-GBSA free energy calculations using a homology model of PfSUB1 in complex with different substrate peptides. Key interactions, as well as residues that potentially make a major contribution to the binding free energy, are identified at the prime and nonprime side of the scissile bond and comprise peptide residues P4 to P2'. This finding stresses the requirement for peptide substrates to interact with both prime and nonprime side residues of the PfSUB1 binding site. Analyzing the energetic contributions of individual amino acids within the peptide-PfSUB1 complexes indicated that van der Waals interactions and the nonpolar part of solvation energy dictate the binding strength of the peptides and that the most favorable interactions are formed by peptide residues P4 and P1. Hot spot residues identified in PfSUB1 are dispersed over the entire binding site, but clustered areas of hot spots also exist and suggest that either the S4-S2 or the S1-S2' binding site should be exploited in efforts to design small molecule inhibitors. The results are discussed with respect to which binding determinants are specific to PfSUB1 and, therefore, might allow binding selectivity to be obtained

    Combining Experiments and Simulations Using the Maximum Entropy Principle

    Get PDF
    A key component of computational biology is to compare the results of computer modelling with experimental measurements. Despite substantial progress in the models and algorithms used in many areas of computational biology, such comparisons sometimes reveal that the computations are not in quantitative agreement with experimental data. The principle of maximum entropy is a general procedure for constructing probability distributions in the light of new data, making it a natural tool in cases when an initial model provides results that are at odds with experiments. The number of maximum entropy applications in our field has grown steadily in recent years, in areas as diverse as sequence analysis, structural modelling, and neurobiology. In this Perspectives article, we give a broad introduction to the method, in an attempt to encourage its further adoption. The general procedure is explained in the context of a simple example, after which we proceed with a real-world application in the field of molecular simulations, where the maximum entropy procedure has recently provided new insight. Given the limited accuracy of force fields, macromolecular simulations sometimes produce results that are at not in complete and quantitative accordance with experiments. A common solution to this problem is to explicitly ensure agreement between the two by perturbing the potential energy function towards the experimental data. So far, a general consensus for how such perturbations should be implemented has been lacking. Three very recent papers have explored this problem using the maximum entropy approach, providing both new theoretical and practical insights to the problem. We highlight each of these contributions in turn and conclude with a discussion on remaining challenges

    The Carbohydrate-Binding Site in Galectin-3 Is Preorganized To Recognize a Sugarlike Framework of Oxygens: Ultra-High-Resolution Structures and Water Dynamics

    Get PDF
    The recognition of carbohydrates by proteins is a fundamental aspect of communication within and between living cells. Understanding the molecular basis of carbohydrate-protein interactions is a prerequisite for the rational design of synthetic ligands. Here we report the high- to ultrahigh-resolution crystal structures of the carbohydrate recognition domain of galectin-3 (Gal3C) in the ligand-free state (1.08 angstrom at 100 K, 1.25 angstrom at 298 K) and in complex with lactose (0.86 angstrom) or glycerol (0.9 angstrom). These structures reveal striking similarities in the positions of water and carbohydrate oxygen atoms in all three states, indicating that the binding site of Gal3C is preorganized to coordinate oxygen atoms in an arrangement that is nearly optimal for the recognition of beta-galactosides. Deuterium nuclear magnetic resonance (NMR) relaxation dispersion experiments and molecular dynamics simulations demonstrate that all water molecules in the lactose-binding site exchange with bulk water on a time scale of nanoseconds or shorter. Nevertheless, molecular dynamics simulations identify transient water binding at sites that agree well with those observed by crystallography, indicating that the energy landscape of the binding site is maintained in solution. All heavy atoms of glycerol are positioned like the corresponding atoms of lactose in the Gal3C complexes. However, binding of glycerol to Gal3C is insignificant in solution at room temperature, as monitored by NMR spectroscopy or isothermal titration calorimetry under conditions where lactose binding is readily detected. These observations make a case for protein cryo-crystallography as a valuable screening method in fragment-based drug discovery and further suggest that identification of water sites might inform inhibitor design
    corecore