38 research outputs found

    Best of Both Worlds: On the Complementarity of Ligand-Based and Structure-Based Virtual Screening

    No full text
    Virtual screening with docking is an integral component of drug design, particularly during hit finding phases. While successful prospective studies of virtual screening exist, it remains a significant challenge to identify best practices a priori due to the many factors that influence the final outcome, including targets, data sets, software, metrics, and expert knowledge of the users. This study investigates the extent to which ligand-based methods can be applied to improve structure-based methods. The use of ligand-based methods to modulate the number of hits identified using the protein–ligand complex and also the diversity of these hits from the crystallographic ligand is discussed. In this study, 40 CDK2 ligand complexes were used together with two external data sets containing both actives and inactives from GlaxoSmithKline (GSK) and actives and decoys from the Directory of Useful Decoys (DUD). Results show how ligand-based modeling can be used to select a more appropriate protein conformation for docking, as well as to assess the reliability of the docking experiment. The time gained by reducing the pool of virtual screening candidates via ligand-based similarity can be invested in more accurate docking procedures, as well as in downstream labor-intensive approaches (e.g., visual inspection) maximizing the use of the chemical and biological information available. This provides a framework for molecular modeling scientists that are involved in initiating virtual screening campaigns with practical advice to make best use of the information available to them

    Applications of Systematic Molecular Scaffold Enumeration to Enrich Structure–Activity Relationship Information

    No full text
    Establishing structure–activity relationships (SARs) in hit identification during early stage drug discovery is important in accelerating hit confirmation and expansion. We describe the development of <i>EnCore</i>, a systematic molecular scaffold enumeration protocol using single atom mutations, to enhance the application of objective scaffold definitions and to enrich SAR information from analysis of high-throughput screening output. A list of 43 literature medicinal chemistry compound series, each containing a minimum of 100 compounds, published in the <i>Journal of Medicinal Chemistry</i> was collated to validate the protocol. Analysis using the top representative Level 1 scaffolds this list of literature compound series demonstrated that <i>EnCore</i> could mimic the scaffold exploration conducted when establishing SAR. When <i>EnCore</i> was applied to analyze an HTS library containing over 200 000 compounds, we observed that over 70% of the molecular scaffolds matched extant scaffolds within the library after enumeration. In particular, over 60% of the singleton scaffolds with only one representative compound were found to have structurally related compounds after enumeration. These results illustrate the potential of <i>EnCore</i> to enrich SAR information. A case study using literature cyclooxygenase-2 inhibitors further demonstrates the advantage of <i>EnCore</i> application in establishing SAR from structurally related scaffolds. <i>EnCore</i> complements literature enumeration methods in enabling changes to the physicochemical properties of molecular scaffolds and structural modifications to aliphatic rings and linkers. The enumerated scaffold clusters generated would constitute a comprehensive collection of scaffolds for scaffold morphing and hopping

    MOESM1 of Mapping the 3D structures of small molecule binding sites

    No full text
    Additional file 1. A list of PDB IDs for two curated datasets utilised in this study: ATP-bound cAMP-dependent Kinase ensemble (n = 5) and the Pilot Dataset (n = 1085). Figure S1. Mean ROC curves for a series of experiments to determine the optimum site size to generate binding site patches surrounding fpocket surface atoms. Table S1. Summary of proteins from the sc-PDB (2013) that were considered in ROC retrieval studies. Table S2. Summary of the Pilot Dataset

    Increasing the Coverage of Medicinal Chemistry-Relevant Space in Commercial Fragments Screening

    No full text
    Analyzing the chemical space coverage in commercial fragment screening collections revealed the overlap between bioactive medicinal chemistry substructures and rule-of-three compliant fragments is only ∼25%. We recommend including these fragments in fragment screening libraries to maximize confidence in discovering hit matter within known bioactive chemical space, while incorporation of nonoverlapping substructures could offer novel hits in screening libraries. Using principal component analysis, polar and three-dimensional substructures display a higher-than-average enrichment of bioactive compounds, indicating increasing representation of these substructures may be beneficial in fragment screening

    Plane of Best Fit: A Novel Method to Characterize the Three-Dimensionality of Molecules

    No full text
    We describe a computational method, plane of best fit (PBF), to quantify and characterize the 3D character of molecules. This method is rapid and amenable to analysis of large diverse data sets. We compare PBF with alternative literature methods used to assess 3D character and apply the method to diverse data sets of fragment-like, drug-like, and natural product compound libraries. We show that exemplar fragment libraries underexploit the potential of 3D character in fragment-like chemical space and that drug-like molecules in the libraries examined are predominantly 2D in character

    China’s rural education in transition

    No full text
    Taxonomic placement of each novel genome by 16S similarity, composition (PhyloPythiaS+), and multiple marker gene similarities (Phylosift). (CSV 312 bytes

    Scaffold Diversity of Exemplified Medicinal Chemistry Space

    No full text
    The scaffold diversity of 7 representative commercial and proprietary compound libraries is explored for the first time using both Murcko frameworks and Scaffold Trees. We show that Level 1 of the Scaffold Tree is useful for the characterization of scaffold diversity in compound libraries and offers advantages over the use of Murcko frameworks. This analysis also demonstrates that the majority of compounds in the libraries we analyzed contain only a small number of well represented scaffolds and that a high percentage of singleton scaffolds represent the remaining compounds. We use Tree Maps to clearly visualize the scaffold space of representative compound libraries, for example, to display highly populated scaffolds and clusters of structurally similar scaffolds. This study further highlights the need for diversification of compound libraries used in hit discovery by focusing library enrichment on the synthesis of compounds with novel or underrepresented scaffolds

    survey_data_5km_1

    No full text
    The attached ascII format file contains locations of all detections (1) and negative locations (0) in a 5 km x 5 km grid. All cells with no data are shown with the value -999

    Scaffold-Focused Virtual Screening: Prospective Application to the Discovery of TTK Inhibitors

    No full text
    We describe and apply a scaffold-focused virtual screen based upon scaffold trees to the mitotic kinase TTK (MPS1). Using level 1 of the scaffold tree, we perform both 2D and 3D similarity searches between a query scaffold and a level 1 scaffold library derived from a 2 million compound library; 98 compounds from 27 unique top-ranked level 1 scaffolds are selected for biochemical screening. We show that this scaffold-focused virtual screen prospectively identifies eight confirmed active compounds that are structurally differentiated from the query compound. In comparison, 100 compounds were selected for biochemical screening using a virtual screen based upon whole molecule similarity resulting in 12 confirmed active compounds that are structurally similar to the query compound. We elucidated the binding mode for four of the eight confirmed scaffold hops to TTK by determining their protein–ligand crystal structures; each represents a ligand-efficient scaffold for inhibitor design
    corecore