284 research outputs found

    The ‘SAR Matrix’ method and its extensions for applications in medicinal chemistry and chemogenomics

    Get PDF
    We describe the ‘Structure-Activity Relationship (SAR) Matrix’ (SARM) methodology that is based upon a special two-step application of the matched molecular pair (MMP) formalism. The SARM method has originally been designed for the extraction, organization, and visualization of compound series and associated SAR information from compound data sets. It has been further developed and adapted for other applications including compound design, activity prediction, library extension, and the navigation of multi-target activity spaces. The SARM approach and its extensions are presented here in context to introduce different types of applications and provide an example for the evolution of a computational methodology in pharmaceutical research

    Support Vector Machine Classification and Regression Prioritize Different Structural Features for Binary Compound Activity and Potency Value Prediction

    Get PDF
    In computational chemistry and chemoinformatics, the support vector machine (SVM) algorithm is among the most widely used machine learning methods for the identification of new active compounds. In addition, support vector regression (SVR) has become a preferred approach for modeling nonlinear structure−activity relationships and predicting compound potency values. For the closely related SVM and SVR methods, fingerprints (i.e., bit string or feature set representations of chemical structure and properties) are generally preferred descriptors. Herein, we have compared SVM and SVR calculations for the same compound data sets to evaluate which features are responsible for predictions. On the basis of systematic feature weight analysis, rather surprising results were obtained. Fingerprint features were frequently identified that contributed differently to the corresponding SVM and SVR models. The overlap between feature sets determining the predictive performance of SVM and SVR was only very small. Furthermore, features were identified that had opposite effects on SVM and SVR predictions. Feature weight analysis in combination with feature mapping made it also possible to interpret individual predictions, thus balancing the black box character of SVM/SVR modeling

    Analyzing compound activity records and promiscuity degrees in light of publication statistics [version 1; referees: 2 approved]

    Get PDF
    For the generation of contemporary databases of bioactive compounds, activity information is usually extracted from the scientific literature. However, when activity data are analyzed, source publications are typically no longer taken into consideration. Therefore, compound activity data selected from ChEMBL were traced back to thousands of original publications, activity records including compound, assay, and target information were systematically generated, and their distributions across the literature were determined. In addition, publications were categorized on the basis of activity records. Furthermore, compound promiscuity, defined as the ability of small molecules to specifically interact with multiple target proteins, was analyzed in light of publication statistics, thus adding another layer of information to promiscuity assessment. It was shown that the degree of compound promiscuity was not influenced by increasing numbers of source publications. Rather, most non-promiscuous as well as promiscuous compounds, regardless of their degree of promiscuity, originated from single publications, which emerged as a characteristic feature of the medicinal chemistry literature

    Ligand Binding Sites of Inducible Costimulator and High Avidity Mutants with Improved Function

    Get PDF
    Interaction between inducible costimulator (ICOS) and its ligand is implicated in the induction of cell-mediated and humoral immune responses. However, the molecular details of this interaction are unknown. We report here a mutagenesis analysis of residues in ICOS that are critical for ligand binding. A three-dimensional model of the extracellular immunoglobulin-like domain of ICOS was used to map the residues conserved within the CD28 family. This analysis identified a surface patch containing the characteristic “PPP” sequence and is conserved in human and mouse ICOS. Mutations in this region of human ICOS reduce or abolish ligand binding. Our results suggest that the ligand binding site in ICOS maps to a region overlapping yet distinct from the CD80/CD86 binding sites in CD28 and cytotoxic T lymphocyte antigen (CTLA)-4. Thus, the analysis suggests that differences in ligand binding specificity between these related costimulatory molecules have evolved by utilization of overlapping regions with different patterns of conserved and nonconserved residues. Two site-specific mutants generated in the course of our studies bound ICOS ligand with higher avidity than wild-type ICOS. An S76E mutant protein of ICOS blocked T cell costimulatory function of ICOS ligand and inhibited T cell response to allogeneic antigens superior to wild-type ICOS. Our studies thus identified critical residues involving in ICOS receptor–ligand interaction and provide new modulators for immune responses
    corecore