Search CORE

Queen's University Belfast Research Portal

UCL Discovery

BSSF: a fingerprint based ultrafast binding site similarity search and function analysis server

Author: Bing Xiong
Jie Wu
David L Burk
Mengzhu Xue
Hualiang Jiang
Jingkang Shen
WA Warr
A Kouranov
A Godzik
OC Redfern
SG Buchanan
K Lundstrom
DF Veber
D Lee
SF Altschul
A Bateman
BE Engelhardt
J Soding
C Chothia
L Holm
AG Murzin
CA Orengo
A Andreeva
TA Binkowski
GJ Kleywegt
RA Laskowski
RB Russell
S Schmitt
A Shulman-Peleg
AC Wallace
T Hamelryck
M Ashburner
P Willett
HM Berman
GP Brady
WR Pearson
A Gutteridge
T Fawcett
ND Gold
J Blaszczyk
K Yeturu
RA Laskowski
L Xie
MP Liang
M Brylinski
XY Jiang
D Pal
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Genome sequencing and post-genomics projects such as structural genomics are extending the frontier of the study of sequence-structure-function relationship of genes and their products. Although many sequence/structure-based methods have been devised with the aim of deciphering this delicate relationship, there still remain large gaps in this fundamental problem, which continuously drives researchers to develop novel methods to extract relevant information from sequences and structures and to infer the functions of newly identified genes by genomics technology. Results Here we present an ultrafast method, named BSSF(Binding Site Similarity & Function), which enables researchers to conduct similarity searches in a comprehensive three-dimensional binding site database extracted from PDB structures. This method utilizes a fingerprint representation of the binding site and a validated statistical Z-score function scheme to judge the similarity between the query and database items, even if their similarities are only constrained in a sub-pocket. This fingerprint based similarity measurement was also validated on a known binding site dataset by comparing with geometric hashing, which is a standard 3D similarity method. The comparison clearly demonstrated the utility of this ultrafast method. After conducting the database searching, the hit list is further analyzed to provide basic statistical information about the occurrences of Gene Ontology terms and Enzyme Commission numbers, which may benefit researchers by helping them to design further experiments to study the query proteins. Conclusions This ultrafast web-based system will not only help researchers interested in drug design and structural genomics to identify similar binding sites, but also assist them by providing further analysis of hit list from database searching.</p

Southampton (e-Prints Soton)

Online Research Database In Technology

BSSF: a fingerprint based ultrafast binding site similarity search and function analysis server

Author: A Andreeva
A Bateman
A Godzik
A Gutteridge
A Kouranov
A Shulman-Peleg
AC Wallace
AG Murzin
BE Engelhardt
Bing Xiong
C Chothia
CA Orengo
D Lee
D Pal
David L Burk
DF Veber
GJ Kleywegt
GP Brady
HM Berman
Hualiang Jiang
J Blaszczyk
J Soding
Jie Wu
Jingkang Shen
K Lundstrom
K Yeturu
L Holm
L Xie
M Ashburner
M Brylinski
Mengzhu Xue
MP Liang
ND Gold
OC Redfern
P Willett
RA Laskowski
RA Laskowski
RB Russell
S Schmitt
SF Altschul
SG Buchanan
T Fawcett
T Hamelryck
TA Binkowski
WA Warr
WR Pearson
XY Jiang
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Using Multiple Microenvironments to Find Similar Ligand-Binding Sites: Application to Kinase Inhibitor Binding

Author: A Brakoulias
A Kahraman
A Shulman-Peleg
AR Ortiz
B Apsel
C Schalon
D Kuhn
Daniel A. Beard
DR Banatao
E Kellenberger
GW Tang
J Hert
J Overington
JA Capra
JD Benson
K Kinoshita
L Wei
L Xie
L Xie
M Weisel
MA Lill
MW Karaman
N Weill
O Dym
OK Mirzoeva
R Morphy
R Najmanovich
RJ Morris
RP Sheridan
Russ B. Altman
S Subbiah
S Wu
S Yoon
SC Bagley
SC Kellenberger E
SJ Teague
T Liu
Tianyun Liu
WS Yang
XH Ma
Publication venue: Public Library of Science
Publication date: 01/12/2011
Field of study

The recognition of cryptic small-molecular binding sites in protein structures is important for understanding off-target side effects and for recognizing potential new indications for existing drugs. Current methods focus on the geometry and detailed chemical interactions within putative binding pockets, but may not recognize distant similarities where dynamics or modified interactions allow one ligand to bind apparently divergent binding pockets. In this paper, we introduce an algorithm that seeks similar microenvironments within two binding sites, and assesses overall binding site similarity by the presence of multiple shared microenvironments. The method has relatively weak geometric requirements (to allow for conformational change or dynamics in both the ligand and the pocket) and uses multiple biophysical and biochemical measures to characterize the microenvironments (to allow for diverse modes of ligand binding). We term the algorithm PocketFEATURE, since it focuses on pockets using the FEATURE system for characterizing microenvironments. We validate PocketFEATURE first by showing that it can better discriminate sites that bind similar ligands from those that do not, and by showing that we can recognize FAD-binding sites on a proteome scale with Area Under the Curve (AUC) of 92%. We then apply PocketFEATURE to evolutionarily distant kinases, for which the method recognizes several proven distant relationships, and predicts unexpected shared ligand binding. Using experimental data from ChEMBL and Ambit, we show that at high significance level, 40 kinase pairs are predicted to share ligands. Some of these pairs offer new opportunities for inhibiting two proteins in a single pathway

Partial Order Optimum Likelihood (POOL): Maximum Likelihood Prediction of Protein Active Site Residues Using 3D Structure and Sequence Properties

Author: A Gutteridge
A Shulman-Peleg
AH Elcock
AP Bradley
C Enroth
CT Porter
D Ming
E Youn
F Glaser
F Wilcoxon
G Amitai
G Cheng
GJ Bartlett
J Ko
J Liang
JD Madura
L Xie
Leonel F. Murga
LF Murga
M Ota
M Silberstein
Mary Jo Ondrechen
Michael Levitt
MJ Best
MJ Ondrechen
MK Gilson
N Petrova
P Domingos
R Edgar
R Greaves
RA Laskowski
RA Laskowski
Ronald J. Williams
T Robertson
TA Binkowski
W Tong
W Tong
Wenxu Tong
Y Wei
Y Wei
Ying Wei
Publication venue: Public Library of Science
Publication date: 01/01/2009
Field of study

A new monotonicity-constrained maximum likelihood approach, called Partial Order Optimum Likelihood (POOL), is presented and applied to the problem of functional site prediction in protein 3D structures, an important current challenge in genomics. The input consists of electrostatic and geometric properties derived from the 3D structure of the query protein alone. Sequence-based conservation information, where available, may also be incorporated. Electrostatics features from THEMATICS are combined with multidimensional isotonic regression to form maximum likelihood estimates of probabilities that specific residues belong to an active site. This allows likelihood ranking of all ionizable residues in a given protein based on THEMATICS features. The corresponding ROC curves and statistical significance tests demonstrate that this method outperforms prior THEMATICS-based methods, which in turn have been shown previously to outperform other 3D-structure-based methods for identifying active site residues. Then it is shown that the addition of one simple geometric property, the size rank of the cleft in which a given residue is contained, yields improved performance. Extension of the method to include predictions of non-ionizable residues is achieved through the introduction of environment variables. This extension results in even better performance than THEMATICS alone and constitutes to date the best functional site predictor based on 3D structure only, achieving nearly the same level of performance as methods that use both 3D structure and sequence alignment data. Finally, the method also easily incorporates such sequence alignment data, and when this information is included, the resulting method is shown to outperform the best current methods using any combination of sequence alignments and 3D structures. Included is an analysis demonstrating that when THEMATICS features, cleft size rank, and alignment-based conservation scores are used individually or in combination THEMATICS features represent the single most important component of such classifiers

The Overlap of Small Molecule and Protein Binding Sites within Families of Protein Structures

Author: A Leo-Macias
A Sali
A Shulman-Peleg
AA Bogan
AC Stuart
AG Murzin
AL Brass
AM Sanchez
Andrej Sali
AP Higueruelo
B de Chassey
B Ma
B Qian
BR Howard
CD Thanos
CL Drum
D Datta
D Dimitropoulos
D Wilson
DA Erlanson
DR Caffrey
E Sokolskaja
FP Davis
FP Davis
Fred P. Davis
GJ Kleywegt
GR Crabtree
H Zhu
J Kuriyan
JA Wells
JJ Ellis
JM Chandonia
KS Thorn
L Parthasarathi
LL Conte
M Wurtele
MA Marti-Renom
MD Dyer
MR Arkin
MR Arkin
O Keskin
Philip E. Bourne
R Elber
R Sedrani
RA Laskowski
RP Bhattacharyya
S Eyrisch
S Jones
SJ Projan
SL Lebeis
SR Collins
T Berg
T Clackson
T Kortemme
TD Bunney
X Wang
Y Ofran
Publication venue: Public Library of Science
Publication date: 01/02/2010
Field of study

Protein–protein interactions are challenging targets for modulation by small molecules. Here, we propose an approach that harnesses the increasing structural coverage of protein complexes to identify small molecules that may target protein interactions. Specifically, we identify ligand and protein binding sites that overlap upon alignment of homologous proteins. Of the 2,619 protein structure families observed to bind proteins, 1,028 also bind small molecules (250–1000 Da), and 197 exhibit a statistically significant (p<0.01) overlap between ligand and protein binding positions. These “bi-functional positions”, which bind both ligands and proteins, are particularly enriched in tyrosine and tryptophan residues, similar to “energetic hotspots” described previously, and are significantly less conserved than mono-functional and solvent exposed positions. Homology transfer identifies ligands whose binding sites overlap at least 20% of the protein interface for 35% of domain–domain and 45% of domain–peptide mediated interactions. The analysis recovered known small-molecule modulators of protein interactions as well as predicted new interaction targets based on the sequence similarity of ligand binding sites. We illustrate the predictive utility of the method by suggesting structural mechanisms for the effects of sanglifehrin A on HIV virion production, bepridil on the cellular entry of anthrax edema factor, and fusicoccin on vertebrate developmental pathways. The results, available at http://pibase.janelia.org, represent a comprehensive collection of structurally characterized modulators of protein interactions, and suggest that homologous structures are a useful resource for the rational design of interaction modulators

eScholarship - University of California

Structural Similarity and Classification of Protein Interaction Interfaces

Author: A Andreeva
A Shulman-Peleg
AS Aytuna
B Alberts
BE Boser
Bin Pang
C Prieto
C Winter
CA Orengo
CD Livingstone
CE Stebbins
Chi-Ren Shyu
CJ Tsai
D Beckett
D Comaniciu
D Schneidman-Duhovny
Dmitry Korkin
ED Levy
FB Sheinerman
FP Davis
G Prehna
HM Berman
I Abbasi
I Guyon
J Fauchere
J Janin
J Teyra
JM Chandonia
M Guharoy
M Hall
M Shatsky
MF Lensink
MT Shamim
Nan Zhao
NC Elde
O Keskin
O Keskin
OV Belyaeva
P Aloy
P Aloy
P Ciaccia
P Rousseeuw
RA Laskowski
S Hubbard
S Hubbard
S Huo
S Jones
S Theodoridis
T Joachims
TS Furey
U Ogmen
Vladimir N. Uversky
ZA Hamburger
Publication venue: Public Library of Science
Publication date: 12/05/2011
Field of study

Interactions between proteins play a key role in many cellular processes. Studying protein-protein interactions that share similar interaction interfaces may shed light on their evolution and could be helpful in elucidating the mechanisms behind stability and dynamics of the protein complexes. When two complexes share structurally similar subunits, the similarity of the interaction interfaces can be found through a structural superposition of the subunits. However, an accurate detection of similarity between the protein complexes containing subunits of unrelated structure remains an open problem

Fabrication Principles and Their Contribution to the Superior In Vivo Therapeutic Efficacy of Nano-Liposomes Remote Loaded with Glucocorticoids

Author: A Abdul-Hai
A Ciccone
A Fritze
A Gabizon
A Gabizon
A Schroeder
Alberto Gabizon
Alex Sigal
AM Samuni
B Kornek
BD Anderson
BD Williams
C Chemin
D Czock
D Zucker
D Zucker
DD Breimer
DD Lasic
Deborah Tulchinsky
Dimitris Fatouros
Dina Tzemach
E Charmandari
E Kluza
E London
EFL Dubois
F Blanchette
G Haran
GJ Grant
H Maeda
H Shmeeda
IH Shaw
J Schmidt
JA Champion
JM Metselaar
JM Metselaar
JP Wong
Keren Turjeman
L Steinman
LI Mckay
M Banciu
M Buttmann
M Shinitzky
MD Smith
NC Munshi
NJ Zuidam
O Garbuzenko
O Garbuzenko
P Mukerjee
Pablo Kizelsztein
PL Chang
RH Whitham
RP Schleimer
RV Sionov
S Clerc
S Greenstein
S Yetgin
SA Abraham
T Moreau
T Peleg-Shulman
T Siegal
T Ziemssen
V Rousseau
V Wasserman
VI Kaledin
VV Ranade
WD Stein
Y Avnir
Y Avnir
Y Barenholz
Y Barenholz
Y Barenholz
Y Barenholz
Yechezkel Barenholz
Yuval Avnir
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

We report here the design, development and performance of a novel formulation of liposome- encapsulated glucocorticoids (GCs). A highly efficient (>90%) and stable GC encapsulation was obtained based on a transmembrane calcium acetate gradient driving the active accumulation of an amphipathic weak acid GC pro-drug into the intraliposome aqueous compartment, where it forms a GC-calcium precipitate. We demonstrate fabrication principles that derive from the physicochemical properties of the GC and the liposomal lipids, which play a crucial role in GC release rate and kinetics. These principles allow fabrication of formulations that exhibit either a fast, second-order (t1/2 ∼1 h), or a slow, zero-order release rate (t1/2 ∼ 50 h) kinetics. A high therapeutic efficacy was found in murine models of experimental autoimmune encephalomyelitis (EAE) and hematological malignancies

CiteSeerX

Combining specificity determining and conserved residues improves functional site prediction

Author: A Carro
A del Sol Mesa
A Shulman-Peleg
A Stark
A Stark
A Teplyakov
AE Todd
ATR Laurie
B Ma
B Mirkin
B Reva
B Zambelli
BJ Polacco
C Romier
C Yeats
CT Porter
DA Rodionov
EA Gaucher
G Dodson
G Koczyk
G Wu
GJ Kleywegt
H Yao
IM Wallace
IN Shindyalov
J Capra
J Dundas
J Pei
J-M Chandonia
JA Capra
JE Donald
JR Manning
K Ye
K Ye
KA Feenstra
KM Mayer
L Aravind
L Holm
LA Mirny
M Hendlich
M Landau
MA Willis
Mikhail S Gelfand
O Lichtarge
Olga V Kalinina
OV Kalinina
OV Kalinina
P Aloy
PP Khil
PP Khil
R Landgraf
RD Finn
RJ Edwards
Robert B Russell
S Ahmad
S Chakrabarti
S Sankararaman
S Whelan
SS Hannenhalli
T Maier
T Pupko
WR Taylor
WSJ Valdar
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background Predicting the location of functionally important sites from protein sequence and/or structure is a long-standing problem in computational biology. Most current approaches make use of sequence conservation, assuming that amino acid residues conserved within a protein family are most likely to be functionally important. Most often these approaches do not consider many residues that act to define specific sub-functions within a family, or they make no distinction between residues important for function and those more relevant for maintaining structure (e.g. in the hydrophobic core). Many protein families bind and/or act on a variety of ligands, meaning that conserved residues often only bind a common ligand sub-structure or perform general catalytic activities. Results Here we present a novel method for functional site prediction based on identification of conserved positions, as well as those responsible for determining ligand specificity. We define Specificity-Determining Positions (SDPs), as those occupied by conserved residues within sub-groups of proteins in a family having a common specificity, but differ between groups, and are thus likely to account for specific recognition events. We benchmark the approach on enzyme families of known 3D structure with bound substrates, and find that in nearly all families residues predicted by SDPsite are in contact with the bound substrate, and that the addition of SDPs significantly improves functional site prediction accuracy. We apply SDPsite to various families of proteins containing known three-dimensional structures, but lacking clear functional annotations, and discusse several illustrative examples. Conclusion The results suggest a better means to predict functional details for the thousands of protein structures determined prior to a clear understanding of molecular function.</p

Homology Inference of Protein-Protein Interactions via Conserved Binding Sites

Author: A Marchler-Bauer
A Marchler-Bauer
A Marchler-Bauer
A Shulman-Peleg
AJ Walhout
Anna R. Panchenko
B Burgess
BA Shoemaker
BA Shoemaker
BA Shoemaker
BG Ma
BH Dessailly
D Kemmer
Dachuan Zhang
E Krissinel
E Krissinel
ED Levy
ER Jefferson
H Chen
H Neuvirth
H Yu
H Zhu
HM Berman
I Ispolatov
J Chen
J Kim
J Kirn
JE Dayhoff
JF Gibrat
K Hashimoto
K Henrick
L Xue
LR Matthews
M Gribskov
M Persico
Manoj Tyagi
MP Stumpf
N Slonim
P Aloy
P Fariselli
Q Xu
QC Zhang
Ratna R. Thangudu
RH Holm
RR Thangudu
S Henikoff
S Liang
S Mika
S Mintz
SF Altschul
Stephen H. Bryant
T Reguly
Thomas Madej
Vladimir N. Uversky
WE Newton
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

The coverage and reliability of protein-protein interactions determined by high-throughput experiments still needs to be improved, especially for higher organisms, therefore the question persists, how interactions can be verified and predicted by computational approaches using available data on protein structural complexes. Recently we developed an approach called IBIS (Inferred Biomolecular Interaction Server) to predict and annotate protein-protein binding sites and interaction partners, which is based on the assumption that the structural location and sequence patterns of protein-protein binding sites are conserved between close homologs. In this study first we confirmed high accuracy of our method and found that its accuracy depends critically on the usage of all available data on structures of homologous complexes, compared to the approaches where only a non-redundant set of complexes is employed. Second we showed that there exists a trade-off between specificity and sensitivity if we employ in the prediction only evolutionarily conserved binding site clusters or clusters supported by only one observation (singletons). Finally we addressed the question of identifying the biologically relevant interactions using the homology inference approach and demonstrated that a large majority of crystal packing interactions can be correctly identified and filtered by our algorithm. At the same time, about half of biological interfaces that are not present in the protein crystallographic asymmetric unit can be reconstructed by IBIS from homologous complexes without the prior knowledge of crystal parameters of the query protein

CiteSeerX