Search CORE

FigShare

The benefits of in silico modeling to identify possible small-molecule drugs and their off-target interactions

Author: Blomberg N
Choi SH
Hastings J
Hirschey J
Mire Zloh
Stewart B Kirton
Wang X
Publication venue: 'Future Science Ltd'
Publication date: 30/01/2019
Field of study

Accepted for publication in a future issue of Future Medicinal Chemistry.The research into the use of small molecules as drugs continues to be a key driver in the development of molecular databases, computer-aided drug design software and collaborative platforms. The evolution of computational approaches is driven by the essential criteria that a drug molecule has to fulfill, from the affinity to targets to minimal side effects while having adequate absorption, distribution, metabolism, and excretion (ADME) properties. A combination of ligand- and structure-based drug development approaches is already used to obtain consensus predictions of small molecule activities and their off-target interactions. Further integration of these methods into easy-to-use workflows informed by systems biology could realize the full potential of available data in the drug discovery and reduce the attrition of drug candidates.Peer reviewe

University of Hertfordshire Research Archive

Evolutionary conservation of influenza A PB2 sequences reveals potential target sites for small molecule inhibitors.

Author: Kukol A.
Kukol A.
Patel H.
Patel H.
Publication venue: 'Elsevier BV'
Publication date: 01/01/2017
Field of study

The influenza A basic polymerase protein 2 (PB2) functions as part of a heterotrimer to replicate the viral RNA genome. To investigate novel PB2 antiviral target sites, this work identified evolutionary conserved regions across the PB2 protein sequence amongst all sub-types and hosts, as well as ligand binding hot spots which overlap with highly conserved areas. Fifteen binding sites were predicted in different PB2 domains; some of which reside in areas of unknown function. Virtual screening of ~50,000 drug-like compounds showed binding affinities of up to 10.3 kcal/mol. The highest affinity molecules were found to interact with conserved residues including Gln138, Gly222, Ile529, Asn540 and Thr530. A library containing 1738 FDA approved drugs were screened additionally and revealed Paliperidone as a top hit with a binding affinity of -10 kcal/mol. Predicted ligands are ideal leads for new antivirals as they were targeted to evolutionary conserved binding sites

WestminsterResearch

University of Hertfordshire Research Archive

Domain-based small molecule binding site annotation

Author: Dumontier Michel
Feldman Howard J
Hogue Christopher WV
Salama John J
Snyder Kevin A
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: Accurate small molecule binding site information for a protein can facilitate studies in drug docking, drug discovery and function prediction, but small molecule binding site protein sequence annotation is sparse. The Small Molecule Interaction Database (SMID), a database of protein domain-small molecule interactions, was created using structural data from the Protein Data Bank (PDB). More importantly it provides a means to predict small molecule binding sites on proteins with a known or unknown structure and unlike prior approaches, removes large numbers of false positive hits arising from transitive alignment errors, non-biologically significant small molecules and crystallographic conditions that overpredict ion binding sites. DESCRIPTION: Using a set of co-crystallized protein-small molecule structures as a starting point, SMID interactions were generated by identifying protein domains that bind to small molecules, using NCBI's Reverse Position Specific BLAST (RPS-BLAST) algorithm. SMID records are available for viewing at . The SMID-BLAST tool provides accurate transitive annotation of small-molecule binding sites for proteins not found in the PDB. Given a protein sequence, SMID-BLAST identifies domains using RPS-BLAST and then lists potential small molecule ligands based on SMID records, as well as their aligned binding sites. A heuristic ligand score is calculated based on E-value, ligand residue identity and domain entropy to assign a level of confidence to hits found. SMID-BLAST predictions were validated against a set of 793 experimental small molecule interactions from the PDB, of which 472 (60%) of predicted interactions identically matched the experimental small molecule and of these, 344 had greater than 80% of the binding site residues correctly identified. Further, we estimate that 45% of predictions which were not observed in the PDB validation set may be true positives. CONCLUSION: By focusing on protein domain-small molecule interactions, SMID is able to cluster similar interactions and detect subtle binding patterns that would not otherwise be obvious. Using SMID-BLAST, small molecule targets can be predicted for any protein sequence, with the only limitation being that the small molecule must exist in the PDB. Validation results and specific examples within illustrate that SMID-BLAST has a high degree of accuracy in terms of predicting both the small molecule ligand and binding site residue positions for a query protein

Maastricht University Research Portal

Springer - Publisher Connector

LigASite—a database of biologically relevant binding sites in proteins with known apo-structures

Author: Altschul
B. H. Dessailly
Barata
Berman
C. A. Orengo
Dessailly
Gold
Henrick
Ivanisenko
Jones
Kellenberger
Laskowski
M. F. Lensink
M ller
Najmanovich
Nissink
Porter
Rigden
S. J. Wodak
Skolnick
Sobolev
Wallace
Wang
Watson
Publication venue: Oxford University Press
Publication date: 01/01/2008
Field of study

Better characterization of binding sites in proteins and the ability to accurately predict their location and energetic properties are major challenges which, if addressed, would have many valuable practical applications. Unfortunately, reliable benchmark datasets of binding sites in proteins are still sorely lacking. Here, we present LigASite (‘LIGand Attachment SITE’), a gold-standard dataset of binding sites in 550 proteins of known structures. LigASite consists exclusively of biologically relevant binding sites in proteins for which at least one apo- and one holo-structure are available. In defining the binding sites for each protein, information from all holo-structures is combined, considering in each case the quaternary structure defined by the PQS server. LigASite is built using simple criteria and is automatically updated as new structures become available in the PDB, thereby guaranteeing optimal data coverage over time. Both a redundant and a culled non-redundant version of the dataset is available at http://www.scmbb.ulb.ac.be/Users/benoit/LigASite. The website interface allows users to search the dataset by PDB identifiers, ligand identifiers, protein names or sequence, and to look for structural matches as defined by the CATH homologous superfamilies. The datasets can be downloaded from the website as Schema-validated XML files or comma-separated flat files

Gypsum-DL: an open-source program for preparing small-molecule libraries for structure-based virtual screening

Author: Durrant Jacob D.
Green Harrison
Milliken Katherine A.
Morales Guillermo A.
Ringe John J.
Ropp Patrick J.
Spiegel Jacob O.
Walker Jennifer L.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 24/05/2019
Field of study

Computational techniques such as structure-based virtual screening require carefully prepared 3D models of potential small-molecule ligands. Though powerful, existing commercial programs for virtual-library preparation have restrictive and/or expensive licenses. Freely available alternatives, though often effective, do not fully account for all possible ionization, tautomeric, and ring-conformational variants. We here present Gypsum-DL, a free, robust open-source program that addresses these challenges. As input, Gypsum-DL accepts virtual compound libraries in SMILES or flat SDF formats. For each molecule in the virtual library, it enumerates appropriate ionization, tautomeric, chiral, cis/trans isomeric, and ring-conformational forms. As output, Gypsum-DL produces an SDF file containing each molecular form, with 3D coordinates assigned. To demonstrate its utility, we processed 1558 molecules taken from the NCI Diversity Set VI and 56,608 molecules taken from a Distributed Drug Discovery (D3) combinatorial virtual library. We also used 4463 high-quality protein-ligand complexes from the PDBBind database to show that Gypsum-DL processing can improve virtual-screening pose prediction. Gypsum-DL is available free of charge under the terms of the Apache License, Version 2.0

IUPUIScholarWorks

Pocketome: an encyclopedia of small-molecule binding sites in 4D

Author: Abagyan
Abagyan
Andrey V. Ilatovskiy
Benson
Bottegoni
Bottegoni
Carlsson
de Graaf
de Graaf
Fabbro
Günther
Iacob
Irina Kufareva
Irwin
Jaakola
Juritz
Kalinina
Katritch
Katritch
Kufareva
Kufareva
Lebon
Lee
Luo
Meslamani
Nabuurs
Raush
Rose
Ruben Abagyan
Scheer
Shoemaker
The UniProt
Vanhee
Xu
Publication venue: Oxford University Press
Publication date: 01/01/2012
Field of study

The importance of binding site plasticity in protein–ligand interactions is well-recognized, and so are the difficulties in predicting the nature and the degree of this plasticity by computational means. To assist in understanding the flexible protein–ligand interactions, we constructed the Pocketome, an encyclopedia of about one thousand experimentally solved conformational ensembles of druggable binding sites in proteins, grouped by location and consistent chain/cofactor composition. The multiplicity of pockets within the ensembles adds an extra, fourth dimension to the Pocketome entry data. Within each ensemble, the pockets were carefully classified by the degree of their pairwise similarity and compatibility with different ligands. The core of the Pocketome is derived regularly and automatically from the current releases of the Protein Data Bank and the Uniprot Knowledgebase; this core is complemented by entries built from manually provided seed ligand locations. The Pocketome website (www.pocketome.org) allows searching for the sites of interest, analysis of conformational clusters, important residues, binding compatibility matrices and interactive visualization of the ensembles using the ActiveICM web browser plugin. The Pocketome collection can be used to build multi-conformational docking and 3D activity models as well as to design cross-docking and virtual ligand screening benchmarks

CiteSeerX

eScholarship - University of California

Knowledge-based annotation of small molecule binding sites in proteins

Author: Bryant Stephen H
Madej Thomas
Panchenko Anna R
Shoemaker Benjamin A
Thangudu Ratna R
Tyagi Manoj
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background The study of protein-small molecule interactions is vital for understanding protein function and for practical applications in drug discovery. To benefit from the rapidly increasing structural data, it is essential to improve the tools that enable large scale binding site prediction with greater emphasis on their biological validity. Results We have developed a new method for the annotation of protein-small molecule binding sites, using inference by homology, which allows us to extend annotation onto protein sequences without experimental data available. To ensure biological relevance of binding sites, our method clusters similar binding sites found in homologous protein structures based on their sequence and structure conservation. Binding sites which appear evolutionarily conserved among non-redundant sets of homologous proteins are given higher priority. After binding sites are clustered, position specific score matrices (PSSMs) are constructed from the corresponding binding site alignments. Together with other measures, the PSSMs are subsequently used to rank binding sites to assess how well they match the query and to better gauge their biological relevance. The method also facilitates a succinct and informative representation of observed and inferred binding sites from homologs with known three-dimensional structures, thereby providing the means to analyze conservation and diversity of binding modes. Furthermore, the chemical properties of small molecules bound to the inferred binding sites can be used as a starting point in small molecule virtual screening. The method was validated by comparison to other binding site prediction methods and to a collection of manually curated binding site annotations. We show that our method achieves a sensitivity of 72% at predicting biologically relevant binding sites and can accurately discriminate those sites that bind biological small molecules from non-biological ones. Conclusions A new algorithm has been developed to predict binding sites with high accuracy in terms of their biological validity. It also provides a common platform for function prediction, knowledge-based docking and for small molecule virtual screening. The method can be applied even for a query sequence without structure. The method is available at <url>http://www.ncbi.nlm.nih.gov/Structure/ibis/ibis.cgi</url>.</p

Springer - Publisher Connector

Public Library of Science (PLOS)

The Overlap of Small Molecule and Protein Binding Sites within Families of Protein Structures

Author: A Leo-Macias
A Sali
A Shulman-Peleg
AA Bogan
AC Stuart
AG Murzin
AL Brass
AM Sanchez
Andrej Sali
AP Higueruelo
B de Chassey
B Ma
B Qian
BR Howard
CD Thanos
CL Drum
D Datta
D Dimitropoulos
D Wilson
DA Erlanson
DR Caffrey
E Sokolskaja
FP Davis
FP Davis
Fred P. Davis
GJ Kleywegt
GR Crabtree
H Zhu
J Kuriyan
JA Wells
JJ Ellis
JM Chandonia
KS Thorn
L Parthasarathi
LL Conte
M Wurtele
MA Marti-Renom
MD Dyer
MR Arkin
MR Arkin
O Keskin
Philip E. Bourne
R Elber
R Sedrani
RA Laskowski
RP Bhattacharyya
S Eyrisch
S Jones
SJ Projan
SL Lebeis
SR Collins
T Berg
T Clackson
T Kortemme
TD Bunney
X Wang
Y Ofran
Publication venue: Public Library of Science
Publication date: 01/02/2010
Field of study

Protein–protein interactions are challenging targets for modulation by small molecules. Here, we propose an approach that harnesses the increasing structural coverage of protein complexes to identify small molecules that may target protein interactions. Specifically, we identify ligand and protein binding sites that overlap upon alignment of homologous proteins. Of the 2,619 protein structure families observed to bind proteins, 1,028 also bind small molecules (250–1000 Da), and 197 exhibit a statistically significant (p<0.01) overlap between ligand and protein binding positions. These “bi-functional positions”, which bind both ligands and proteins, are particularly enriched in tyrosine and tryptophan residues, similar to “energetic hotspots” described previously, and are significantly less conserved than mono-functional and solvent exposed positions. Homology transfer identifies ligands whose binding sites overlap at least 20% of the protein interface for 35% of domain–domain and 45% of domain–peptide mediated interactions. The analysis recovered known small-molecule modulators of protein interactions as well as predicted new interaction targets based on the sequence similarity of ligand binding sites. We illustrate the predictive utility of the method by suggesting structural mechanisms for the effects of sanglifehrin A on HIV virion production, bepridil on the cellular entry of anthrax edema factor, and fusicoccin on vertebrate developmental pathways. The results, available at http://pibase.janelia.org, represent a comprehensive collection of structurally characterized modulators of protein interactions, and suggest that homologous structures are a useful resource for the rational design of interaction modulators