Search CORE

39,925 research outputs found

Exploring the potential of 3D Zernike descriptors and SVM for protein\u2013protein interface prediction

Author: Daberdaku Sebastian
Ferrari Carlo
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

Abstract Background The correct determination of protein–protein interaction interfaces is important for understanding disease mechanisms and for rational drug design. To date, several computational methods for the prediction of protein interfaces have been developed, but the interface prediction problem is still not fully understood. Experimental evidence suggests that the location of binding sites is imprinted in the protein structure, but there are major differences among the interfaces of the various protein types: the characterising properties can vary a lot depending on the interaction type and function. The selection of an optimal set of features characterising the protein interface and the development of an effective method to represent and capture the complex protein recognition patterns are of paramount importance for this task. Results In this work we investigate the potential of a novel local surface descriptor based on 3D Zernike moments for the interface prediction task. Descriptors invariant to roto-translations are extracted from circular patches of the protein surface enriched with physico-chemical properties from the HQI8 amino acid index set, and are used as samples for a binary classification problem. Support Vector Machines are used as a classifier to distinguish interface local surface patches from non-interface ones. The proposed method was validated on 16 classes of proteins extracted from the Protein–Protein Docking Benchmark 5.0 and compared to other state-of-the-art protein interface predictors (SPPIDER, PrISE and NPS-HomPPI). Conclusions The 3D Zernike descriptors are able to capture the similarity among patterns of physico-chemical and biochemical properties mapped on the protein surface arising from the various spatial arrangements of the underlying residues, and their usage can be easily extended to other sets of amino acid properties. The results suggest that the choice of a proper set of features characterising the protein interface is crucial for the interface prediction task, and that optimality strongly depends on the class of proteins whose interface we want to characterise. We postulate that different protein classes should be treated separately and that it is necessary to identify an optimal set of features for each protein class

Directory of Open Access Journals

Archivio istituzionale della ricerca - Università di Padova

Deriving a mutation index of carcinogenicity using protein structure and protein interfaces

Author: A Custodio
A David
A Dixit
A Hamosh
A Pal
AJ Bass
Anna Tramontano
B Reva
B Vogelstein
CJ Richardson
CM Croce
D Chasman
D Sims
D Talavera
D Xu
E Krissinel
EC Chao
ER Mardis
F Damm
Frances Pearl
G Birrane
G De Baets
H Boutselakis
H Carter
H Makishima
IA Adzhubei
IS Moreira
J Carlsson
Jarle Hakas
JM Hurst
JM Izarzugaza
JR Morris
K Wang
Konstantinos Mitsopoulos
L Breiman
L Ding
M Li
M Magrane
Marketa Zvelebil
MR Stratton
MR Stratton
MS Greenblatt
MW MacArthur
MY Frederic
Octavio Espinosa
P Flicek
P Kumar
P Srivastava
PA Chan
PA Futreal
PB Crowley
PC Ng
PC Ng
PD Stenson
PH Lee
PT Wan
PV Hornbeck
PY Chou
R Ferla
R Rajasekaran
RJ Kinsella
S Jones
S Sunyaev
S Velankar
SA Forbes
TM Anne
V Ramensky
W Huang da
W Kabsch
X Wang
X Wang
Y Bromberg
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2014
Field of study

With the advent of Next Generation Sequencing the identification of mutations in the genomes of healthy and diseased tissues has become commonplace. While much progress has been made to elucidate the aetiology of disease processes in cancer, the contributions to disease that many individual mutations make remain to be characterised and their downstream consequences on cancer phenotypes remain to be understood. Missense mutations commonly occur in cancers and their consequences remain challenging to predict. However, this knowledge is becoming more vital, for both assessing disease progression and for stratifying drug treatment regimes. Coupled with structural data, comprehensive genomic databases of mutations such as the 1000 Genomes project and COSMIC give an opportunity to investigate general principles of how cancer mutations disrupt proteins and their interactions at the molecular and network level. We describe a comprehensive comparison of cancer and neutral missense mutations; by combining features derived from structural and interface properties we have developed a carcinogenicity predictor, InCa (Index of Carcinogenicity). Upon comparison with other methods, we observe that InCa can predict mutations that might not be detected by other methods. We also discuss general limitations shared by all predictors that attempt to predict driver mutations and discuss how this could impact high-throughput predictions. A web interface to a server implementation is publicly available at http://inca.icr.ac.uk/

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Institute of Cancer Research Repository

Sussex Research Online

FigShare

Using electrostatic potentials to predict DNA-binding sites on DNA-binding proteins

Author: Berman Helen M
Jones Susan
Shanahan Hugh P
Thornton Janet M
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2003
Field of study

A method to detect DNA-binding sites on the surface of a protein structure is important for functional annotation. This work describes the analysis of residue patches on the surface of DNA-binding proteins and the development of a method of predicting DNA-binding sites using a single feature of these surface patches. Surface patches and the DNA-binding sites were initially analysed for accessibility, electrostatic potential, residue propensity, hydrophobicity and residue conservation. From this, it was observed that the DNA-binding sites were, in general, amongst the top 10% of patches with the largest positive electrostatic scores. This knowledge led to the development of a prediction method in which patches of surface residues were selected such that they excluded residues with negative electrostatic scores. This method was used to make predictions for a data set of 56 non-homologous DNA-binding proteins. Correct predictions made for 68% of the data set

CiteSeerX

Royal Holloway Research Online

Crossref

Royal Holloway - Pure

PubMed Central

Sussex Research Online

Prediction of protein-protein interaction types using association rule based classification

Author: Gilbert D
Kim JW
Kim S
Park SH
Reyes JA
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

This article has been made available through the Brunel Open Access Publishing Fund - Copyright @ 2009 Park et alBackground: Protein-protein interactions (PPI) can be classified according to their characteristics into, for example obligate or transient interactions. The identification and characterization of these PPI types may help in the functional annotation of new protein complexes and in the prediction of protein interaction partners by knowledge driven approaches. Results: This work addresses pattern discovery of the interaction sites for four different interaction types to characterize and uses them for the prediction of PPI types employing Association Rule Based Classification (ARBC) which includes association rule generation and posterior classification. We incorporated domain information from protein complexes in SCOP proteins and identified 354 domain-interaction sites. 14 interface properties were calculated from amino acid and secondary structure composition and then used to generate a set of association rules characterizing these domain-interaction sites employing the APRIORI algorithm. Our results regarding the classification of PPI types based on a set of discovered association rules shows that the discriminative ability of association rules can significantly impact on the prediction power of classification models. We also showed that the accuracy of the classification can be improved through the use of structural domain information and also the use of secondary structure content. Conclusion: The advantage of our approach is that we can extract biologically significant information from the interpretation of the discovered association rules in terms of understandability and interpretability of rules. A web application based on our method can be found at http://bioinfo.ssu.ac.kr/~shpark/picasso/SHP was supported by the Korea Research Foundation Grant funded by the Korean Government(KRF-2005-214-E00050). JAR has been supported by the Programme Alβan, the European Union Programme of High level Scholarships for Latin America, scholarship E04D034854CL. SK was supported by Soongsil University Research Fund

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Brunel University Research Archive

Abundance of intrinsic disorder in SV-IV, a multifunctional androgen-dependent protein secreted from rat seminal vesicle

Author: Ambrosone
Bairoch
Bornberg-Bauer
Bourhis
Cai
Caporale
Cheng
Coeytaux
Csizmók
Doszatányi
Dosztányi
Dunker
Dunker
Dyson
D’Ambrosio
Esposito
Ferron
Gaboriaud
Galzitskaya
Galzitskaya
Galzitskaya
Garbuzynskiy
Harris
Hirose
Ialenti
Kandala
Kyte
Li
Lin
Linding
Linding
Liu
Lupas
MacCallum
McDonald
Metafora
Metafora
Miele
Murzin
Obradovic
Obradovic
Ostrowski
Pan
Prilusky
Quevillon-Cheruel
Radivojac
Ragone
Romero
Romero
Rüping
Shimizu
Shimizu
Sickmeier
Stiuso
Stiuso
Tompa
Tufano
Uversky
Uversky
Vucetic
Ward
Weathers
Weathers
Wolf
Wootton
Wright
Yang
Publication venue
Publication date: 06/12/2007
Field of study

The potent immunomodulatory, anti-inflammatory and procoagulant properties of the
protein no. 4 secreted from the rat seminal vesicle epithelium (SV-IV) have been
previously found to be modulated by a supramolecular monomer-trimer equilibrium.
More structural details that integrate experimental data into a predictive framework
have recently been reported. Unfortunately, homology modelling and fold-recognition
strategies were not successful in creating a theoretical model of the structural
organization of SV-IV. It was inferred that the global structure of SV-IV is not similar
to any protein of known three-dimensional structure. Reversing the classical approach
to the sequence-structure-function paradigm, in this paper we report on novel
information obtained by comparing physicochemical parameters of SV-IV with two
datasets made of intrinsically unfolded and ideally globular proteins. In addition, we
have analysed the SV-IV sequence by several publicly available disorder-oriented
predictors. Overall, disorder predictions and a re-examination of existing experimental
data strongly suggest that SV-IV needs large plasticity to efficiently interact with the
different targets that characterize its multifaceted biological function and should be
therefore better classified as an intrinsically disordered protein

CiteSeerX

Crossref

Archivio della Ricerca - Università di Salerno

Nature Precedings

Sequence composition and environment effects on residue fluctuations in protein structures

Author: Anatoly M. Ruvinsky
Hubbard S. J.
Ilya A. Vakser
Publication venue: 'AIP Publishing'
Publication date: 28/07/2009
Field of study

The spectrum and scale of fluctuations in protein structures affect the range of cell phenomena, including stability of protein structures or their fragments, allosteric transitions and energy transfer. The study presents a statistical-thermodynamic analysis of relationship between the sequence composition and the distribution of residue fluctuations in protein-protein complexes. A one-node-per residue elastic network model accounting for the nonhomogeneous protein mass distribution and the inter-atomic interactions through the renormalized inter-residue potential is developed. Two factors, a protein mass distribution and a residue environment, were found to determine the scale of residue fluctuations. Surface residues undergo larger fluctuations than core residues, showing agreement with experimental observations. Ranking residues over the normalized scale of fluctuations yields a distinct classification of amino acids into three groups. The structural instability in proteins possibly relates to the high content of the highly fluctuating residues and a deficiency of the weakly fluctuating residues in irregular secondary structure elements (loops), chameleon sequences and disordered proteins. Strong correlation between residue fluctuations and the sequence composition of protein loops supports this hypothesis. Comparing fluctuations of binding site residues (interface residues) with other surface residues shows that, on average, the interface is more rigid than the rest of the protein surface and Gly, Ala, Ser, Cys, Leu and Trp have a propensity to form more stable docking patches on the interface. The findings have broad implications for understanding mechanisms of protein association and stability of protein structures.Comment: 8 pages, 4 figure

arXiv.org e-Print Archive

Crossref

KU ScholarWorks

PubMed Central

Specialized dynamical properties of promiscuous residues revealed by simulated conformational ensembles

Author: Abecasis G. R.
Alessandro Pandini
Altschul S. F.
Altshuler D. M.
Amadei A.
Aranda B.
Arianna Fornili
Bahar I.
Bahar I.
Bahar I.
Berendsen H.
Bhardwaj N.
Bobay B. G.
Boehr D. D.
Bogan A. A.
Bordogna A.
Bouvier B.
Brookes A. J.
Camacho C.
Carbonell P.
Chandonia J.-M.
Cover T. M.
Cukuroglu E.
Cumming G.
Daily M. D.
Dasgupta B.
Davis F. P.
de Groot B. L.
de Groot B. L.
De Simone A.
del Sol A.
DeLano W.
Dobbins S. E.
Dong Q.
Doruker P.
Dosztányi Z.
Dunbrack R. L.
Dyson H. J.
Echave J.
Ekman D.
Erijman A.
Essmann U.
Eyrisch S.
Fernández A.
Ferrer-Costa C.
Fong J. H.
Fornili A.
Franca Fraternali
Fraternali F.
Goldenberg O.
Haliloglu T.
Haliloglu T.
Hamosh A.
Han J.-D. J.
Hess B.
Hess B.
Higurashi M.
Higurashi M.
Hub J. S.
Hui-Chun Lu
Humphris E. L.
Jeong H.
Jones S.
Jorgensen W.
Kabsch W.
Kar G.
Keskin O.
Keskin O.
Keskin O.
Keskin O.
Keskin O.
Kiel C.
Kim P. M.
Kim P. M.
Kim S.
Kleinjung J.
Kohn J. E.
Kortemme T.
Krissinel E.
Kuttner Y. Y.
Kuzu G.
Lange O. F.
Li X.
Liu L.
Lounnas V.
Maguid S.
Margreitter C.
Martin A. C. R.
Meireles L.
Meireles L. M. C.
Micheletti C.
Mittag T.
Münz M.
Nussinov R.
Pandini A.
Pandini A.
Pandini A.
Pandini A.
Pandini A.
Park B. H.
Patil A.
Patil A.
Peters J. H.
Petrov D.
Poirot O.
Qin H.
R-Development-Core-Team
Rajamani D.
Roulston M.
Rousseeuw P.
Schlitter J.
Schäfer H.
Seeliger D.
Sherry S. T.
Stein A.
Tsai C.-J.
Tuncbag N.
Tuncbag N.
Tyagi M.
van der Spoel D.
Van Gunsteren W.
Vogel C.
Vogel C.
Volkman B. F.
Wells J. A.
Winget J. M.
Wolfe R.
Yogurtcu O. N.
Zen A.
Zen A.
Zhang Q. C.
Zheng W.
Zhu X.
Publication venue: 'American Chemical Society (ACS)'
Publication date: 18/10/2013
Field of study

The ability to interact with different partners is one of the most important features in proteins. Proteins that bind a large number of partners (hubs) have been often associated with intrinsic disorder. However, many examples exist of hubs with an ordered structure, and evidence of a general mechanism promoting promiscuity in ordered proteins is still elusive. An intriguing hypothesis is that promiscuous binding sites have specific dynamical properties, distinct from the rest of the interface and pre-existing in the protein isolated state. Here, we present the first comprehensive study of the intrinsic dynamics of promiscuous residues in a large protein data set. Different computational methods, from coarse-grained elastic models to geometry-based sampling methods and to full-atom Molecular Dynamics simulations, were used to generate conformational ensembles for the isolated proteins. The flexibility and dynamic correlations of interface residues with a different degree of binding promiscuity were calculated and compared considering side chain and backbone motions, the latter both on a local and on a global scale. The study revealed that (a) promiscuous residues tend to be more flexible than nonpromiscuous ones, (b) this additional flexibility has a higher degree of organization, and (c) evolutionary conservation and binding promiscuity have opposite effects on intrinsic dynamics. Findings on simulated ensembles were also validated on ensembles of experimental structures extracted from the Protein Data Bank (PDB). Additionally, the low occurrence of single nucleotide polymorphisms observed for promiscuous residues indicated a tendency to preserve binding diversity at these positions. A case study on two ubiquitin-like proteins exemplifies how binding promiscuity in evolutionary related proteins can be modulated by the fine-tuning of the interface dynamics. The interplay between promiscuity and flexibility highlighted here can inspire new directions in protein-protein interaction prediction and design methods. © 2013 American Chemical Society

Crossref

PubMed Central

King's Research Portal

Brunel University Research Archive

Knowledge-based energy functions for computational studies of proteins

Author: A. Ben-Naim
A. Godzik
A. Godzik
A. Rossi
A.J. Bordner
A.V. Finkelstein
B. Fain
B. Krishnamoorthy
B. Kuhlman
B. Schölkopf
B.H. Park
B.I. Dahiyat
B.J. McConkey
B.O. Mitchell
C. Anfinsen
C. Carter Jr.
C. Czaplewski
C. Hoppe
C. Hu
C. Micheletti
C. Papadimitriou
C. Zhang
C. Zhang
C. Zhang
C. Zhang
C. Zhang
C.A. Rohl
C.B. Anfinsen
C.M.R Lemer
C.S. Mészáros
D. Gilis
D. Gilis
D. Gilis
D. Tobi
D. Xu
E. Venclovas
E.I. Shakhnovich
E.I. Shakhnovich
F.A. Momany
H. Dobbs
H. Edelsbrunner
H. Gan
H. Li
H. Li
H. Lu
H. Zhou
H.S. Chan
I. Muegge
J. Khatun
J. Liang
J.A. Kocher
J.A. Rank
J.M. Deutsch
J.R. Bienkowska
K. Nishikawa
K. Sale
K.H. Lee
K.K. Koretke
K.K. Koretke
K.T. Simons
L. Adamian
L. Adamian
L. Adamian
L.A. Mirny
L.L. Looger
L.M. Amzel
M. Karplus
M. Levitt
M. Vendruscolo
M. Vendruscolo
M.H. Hao
M.H. Hao
M.J. Sippl
M.J. Sippl
M.J. Sippl
M.P. Eastwood
M.R. Betancourt
M.S. Friedrichs
N. Karmarkar
N.V. Buchete
N.V. Buchete
P. Koehl
P. Koehl
P.D. Thomas
P.D. Thomas
P.G. Wolynes
P.J. Munson
R. Goldstein
R. Guerois
R. Jackups Jr.
R. Janicke
R. Méndez
R. Samudrala
R. Samudrala
R.B. Hill
R.I. Dima
R.J. Vanderbei
R.K. Singh
R.L. Jernigan
R.S. DeWitte
S. Liu
S. Miyazawa
S. Miyazawa
S. Miyazawa
S. Shimizu
S. Shimizu
S. Tanaka
S.J. Wodak
T. Kortemme
T. Kortemme
T. Kortemme
T. Lazaridis
T.L. Chiu
U. Bastolla
U. Bastolla
V. Vapnik
V. Vapnik
V.N. Maiorov
W.P. Russ
X. Li
X. Li
Y. Duan
Y. Park
Y. Xia
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 19/01/2006
Field of study

This chapter discusses theoretical framework and methods for developing knowledge-based potential functions essential for protein structure prediction, protein-protein interaction, and protein sequence design. We discuss in some details about the Miyazawa-Jernigan contact statistical potential, distance-dependent statistical potentials, as well as geometric statistical potentials. We also describe a geometric model for developing both linear and non-linear potential functions by optimization. Applications of knowledge-based potential functions in protein-decoy discrimination, in protein-protein interactions, and in protein design are then described. Several issues of knowledge-based potential functions are finally discussed.Comment: 57 pages, 6 figures. To be published in a book by Springe

arXiv.org e-Print Archive

Crossref