2 research outputs found

    Graph Neural Networks Bootstrapped for Synthetic Selection and Validation of Small Molecule Immunomodulators

    No full text
    The Programmed Cell Death Protein 1/Programmed Death-Ligand 1 (PD-1/PD-L1) interaction is an immune checkpoint utilized by cancer cells to enhance immune suppression. There is a huge need to develop small molecule drugs that are fast acting, cost effective, and readily bioavailable compared to antibodies. Unfortunately, synthesizing and validating large libraries of small- molecules to inhibit PD-1/PD-L1 interaction in a blind manner is both time-consuming and expensive. To improve this drug discovery pipeline, we have developed a machine learning methodology trained on patent data to identify, synthesize, and validate PD-1/PD-L1 small molecule inhibitors. Our model incorporates two features: docking scores to represent the energy of binding (E) as a global feature and sub-graph features through a graph neural network (GNN) of molecular topology to represent local features. This interaction energy-based Graph Neural Network (EGNN) model outperforms traditional machine learning methods and a simple GNN with a F1 score of 0.9524 and Cohen’s kappa score of 0.8861 for the hold out test set, suggesting that the topology of the small molecule, the structural interaction in the binding pocket, and chemical diversity of the training data are all important considerations for enhancing model performance. A Bootstrapped EGNN model was used to select compounds for synthesis and experimental validation with predicted high and low potency to inhibit PD-1/PD-L1 interaction. The potent inhibitor, (4-((3-(2,3-dihydrobenzo[b][1,4]dioxin-6-yl)-2- methylbenzyl)oxy)-2,6-dimethoxybenzyl)-D-serine, is a hybrid of two known bioactive scaffolds, with an IC50 of 339.9 nM that is comparatively better than the known bioactive compound. We conclude that our bootstrapped EGNN model will be useful to identify target-specific high potency molecules designed by scaffold hopping, a well-known medicinal chemistry technique

    Learning relationships between chemical and physical stability for drug development

    No full text
    Chemical and physical stabilities are two key features considered in pharmaceutical development. Chemical stability is typically reported as a combination of potency and degradation product. For peptide products, it is common to measure physical stability via aggregation or fibrillation using the fluorescent reporter Thioflavin T. Executing stability studies is a lengthy process and requires extensive resources. To reduce the resources and shorten the process for stability studies during the development of a product, we introduce a machine learning based model for predicting the chemical stability over time using both the formulation conditions as well as the aggregation curve. In this work, we explore the relationships between the formulation, stability time point, and the measurements of chemical stability and achieve a coefficient of determination on a random test set of 0.945 and a mean absolute error (MAE) of 0.421 when using a multilayer perceptron (MLP) neural network for total degradation. Similarly, we achieve a coefficient of determination of 0.908 and an MAE of 1.435 when predicting the potency using a random forest model. When measurements of physical stability are included into the model, the MAE in the prediction of TD decreases to 0.148 for the MLP model. Using a similar strategy for the prediction of potency, the MAE decreases to 0.705 for the random forest model. Therefore, we can conclude two important points: first, chemical stability can be modeled using machine learning techniques and second there is a relationship between the physical stability of a peptide and its chemical stability
    corecore