Search CORE

Archivio della Ricerca - Università di Salerno

Nature Precedings

KineticDB: a database of protein folding kinetics

Author: A. A. Osypov
Alm
D. N. Ivankov
Galzitskaya
Galzitskaya
Galzitskaya
Garbuzynskiy
Gong
Gromiha
Ivankov
Jackson
Jiang
Ma
Makarov
Munoz
Murzin
N. S. Bogatyreva
Naganathan
Plaxco
Qian
Publication venue: Oxford University Press
Publication date
Field of study

We propose here KineticDB, a systematically compiled database of protein folding kinetics, which contains about 90 unique proteins. The main goal of the KineticDB is to provide users with a diverse set of protein folding rates determined experimentally. The search for determinants of protein folding is still in progress, aimed at obtaining a new understanding of the folding process. Comparison with experimental protein folding rates has been the main tool for validation of both theoretical models and empirical relationships during the last 10 years. It is, therefore, necessary to provide a researcher with as much data as possible in a simple and easy-to-use way. At present, the KineticDB contains the results of folding kinetics measurements of single-domain proteins and separate protein domains as well as short peptides without disulfide bonds. It includes data on about 90 unique proteins and many mutants that have been systematically accumulated over the last 10 years and is the largest collection of protein folding kinetic data presented as a database. The KineticDB is available at http://kineticdb.protres.ru/db/index.pl

Prediction of Amyloidogenic and Disordered Regions in Protein Chains

Author: Azriel
Bairoch
Bemporad
Bodles
Bucciantini
Chamberlain
Chiti
Dosztanyi
Dunker
Dunker
Dyson
Dyson
Esteras-Chopo
Fandrich
Fauchere
Fernandez
Fink
Frare
Galzitskaya
Galzitskaya
Galzitskaya
Garbuzynskiy
Gazit
Gsponer
Guijarro
Hamidi
Haspel
Ivanova
Jaroniec
Jaroniec
Jimenez
Jones
Khare
Kozhukh
Krebs
Linding
Lopez de la Paz
MacPhee
Makin
Maries
Mazor
Michail Yurievich Lobanov
Minor
Morrissey
Murzin
Nelson
Obradovic
Ohnishi
Oxana V. Galzitskaya
Patel
Pedersen
Radivojac
Reches
Romero
Romero
Romero
Rousseau
Rudall
Sergiy O. Garbuzynskiy
Speare
Tartaglia
Tartaglia
Thompson
Thompson
Tjernberg
Torok
Torrent
Tracz
Trovato
Ueda
Uversky
Uversky
Uversky
Von Bergen
Vucetic
Vucetic
Ward
Wootton
Wright
Yamamoto
Yamin
Yoon
Publication venue: Public Library of Science
Publication date: 01/01/2005
Field of study

The determination of factors that influence protein conformational changes is very important for the identification of potentially amyloidogenic and disordered regions in polypeptide chains. In our work we introduce a new parameter, mean packing density, to detect both amyloidogenic and disordered regions in a protein sequence. It has been shown that regions with strong expected packing density are responsible for amyloid formation. Our predictions are consistent with known disease-related amyloidogenic regions for eight of 12 amyloid-forming proteins and peptides in which the positions of amyloidogenic regions have been revealed experimentally. Our findings support the concept that the mechanism of amyloid fibril formation is similar for different peptides and proteins. Moreover, we have demonstrated that regions with weak expected packing density are responsible for the appearance of disordered regions. Our method has been tested on datasets of globular proteins and long disordered protein segments, and it shows improved performance over other widely used methods. Thus, we demonstrate that the expected packing density is a useful value with which one can predict both intrinsically disordered and amyloidogenic regions of a protein based on sequence alone. Our results are important for understanding the structural characteristics of protein folding and misfolding

CiteSeerX

Public Library of Science (PLOS)

Prediction of peptide and protein propensity for amyloid formation

Author: A Quintas
A Trovato
A Trovato
AC Davison
AC Tsolis
Alexandre Quintas
AM Fernandez-Escamilla
AP Pawar
AV Finkelstein
B Rost
C Nerelius
Carlos Família
CM Dobson
D Eisenberg
David A. Phoenix
DJ Selkoe
DM Fowler
Eugene A. Permyakov
F Chiti
F Chiti
F Sasagawa
GG Tartaglia
GG Tartaglia
H Hu
I Cherny
I Walsh
IV Baskakov
J Palau
J Tian
JC Rochet
JD Sipe
JM Zimmerman
JW Kelly
JW Kelly
K Rajagopal
KF DuBay
KK Frousios
KT O’Neil
L Goldschmidt
LO Jimenez
M Belli
M Emily
M Hollander
M Kuhn
M López de la Paz
M Oliveberg
M Stefani
M Sunde
M Sunde
M Zamani
MB Kursa
MJ Thompson
MT Pastor
N Becker
N Qian
O Conchillo-Solé
PK Teng
PY Chou
RS Harrison
S Idicula-thomas
S Kawashima
S Kawashima
S Maurer-Stroh
S Ventura
S Yoon
S Yoon
Sarah R. Dennison
SJ Hamodrakas
SJ Hamodrakas
SK Maji
SO Garbuzynskiy
T Hothorn
T Hothorn
T Hothorn
T Scheibel
TPJ Knowles
VS Mathura
WH DePas
WT Astbury
Y Kallberg
Ž Eva
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 09/07/2014
Field of study

Understanding which peptides and proteins have the potential to undergo amyloid formation and what driving forces are responsible for amyloid-like fiber formation and stabilization remains limited. This is mainly because proteins that can undergo structural changes, which lead to amyloid formation, are quite diverse and share no obvious sequence or structural homology, despite the structural similarity found in the fibrils. To address these issues, a novel approach based on recursive feature selection and feed-forward neural networks was undertaken to identify key features highly correlated with the self-assembly problem. This approach allowed the identification of seven physicochemical and biochemical properties of the amino acids highly associated with the self-assembly of peptides and proteins into amyloid-like fibrils (normalized frequency of β-sheet, normalized frequency of β-sheet from LG, weights for β-sheet at the window position of 1, isoelectric point, atom-based hydrophobic moment, helix termination parameter at position j+1 and ΔGº values for peptides extrapolated in 0 M urea). Moreover, these features enabled the development of a new predictor (available at http://cran.r-project.org/web/packages/appnn/index.html) capable of accurately and reliably predicting the amyloidogenic propensity from the polypeptide sequence alone with a prediction accuracy of 84.9 % against an external validation dataset of sequences with experimental in vitro, evidence of amyloid formation

Public Library of Science (PLOS)

FigShare

On Side-Chain Conformational Entropy of Proteins

Author: Abagyan
Banerjee
Berezovsky
Brady
Bromberg
Canutescu
Clore
Cole
Creamer
Crick
Dill
Doig
Dunbrack
Dunbrack
Eisenmesser
Garbuzynskiy
Gray
Gronenborn
Huang
Janin
Jinfeng Zhang
Jun S. Liu
Karplus
Koehl
Kussell
Liang
Liang
Lindorff-Larsen
Liu
Lovell
Ma
Misura
Mitchell
Pickett
Pokarowski
Reinhardt
Richards
Rosenbluth
Samudrala
Schafer
Shortle
Socolich
Tseng
Vajda
Vasquez
Wang
Yu
Zhang
Zhang
Zhang
Publication venue: Public Library of Science
Publication date: 01/01/2006
Field of study

The role of side-chain entropy (SCE) in protein folding has long been speculated about but is still not fully understood. Utilizing a newly developed Monte Carlo method, we conducted a systematic investigation of how the SCE relates to the size of the protein and how it differs among a protein's X-ray, NMR, and decoy structures. We estimated the SCE for a set of 675 nonhomologous proteins, and observed that there is a significant SCE for both exposed and buried residues for all these proteins—the contribution of buried residues approaches ∼40% of the overall SCE. Furthermore, the SCE can be quite different for structures with similar compactness or even similar conformations. As a striking example, we found that proteins' X-ray structures appear to pack more “cleverly” than their NMR or decoy counterparts in the sense of retaining higher SCE while achieving comparable compactness, which suggests that the SCE plays an important role in favouring native protein structures. By including a SCE term in a simple free energy function, we can significantly improve the discrimination of native protein structures from decoys

ComSin: database of protein structures in bound (complex) and unbound (single) states in relation to their intrinsic disorder

Author: Altschul
Anna R. Panchenko
Bateman
Benjamin A. Shoemaker
Berman
Fong
He
Huber
Jessica H. Fong
Kabsch
Krissinel
Letunic
Linding
Loh
Marchler-Bauer
Marchler-Bauer
Meereis
Meszaros
Michail Yu. Lobanov
Mohan
Olejniczak
Oxana V. Galzitskaya
Romero
Sergiy O. Garbuzynskiy
Shoemaker
Shoemaker
Shoemaker
Sickmeier
Sigalov
Stivers
Tompa
Tompa
Uversky
Wang
Wright
Xie
Zidek
Publication venue: Oxford University Press
Publication date
Field of study

Most of the proteins in a cell assemble into complexes to carry out their function. In this work, we have created a new database (named ComSin) of protein structures in bound (complex) and unbound (single) states to provide a researcher with exhaustive information on structures of the same or homologous proteins in bound and unbound states. From the complete Protein Data Bank (PDB), we selected 24 910 pairs of protein structures in bound and unbound states, and identified regions of intrinsic disorder. For 2448 pairs, the proteins in bound and unbound states are identical, while 7129 pairs have sequence identity 90% or larger. The developed server enables one to search for proteins in bound and unbound states with several options including sequence similarity between the corresponding proteins in bound and unbound states, and validation of interaction interfaces of protein complexes. Besides that, through our web server, one can obtain necessary information for studying disorder-to-order and order-to-disorder transitions upon complex formation, and analyze structural differences between proteins in bound and unbound states. The database is available at http://antares.protres.ru/comsin/

Exploiting heterogeneous features to improve in silico prediction of peptide status – amyloidogenic or non-amyloidogenic

Author: A Caflisch
AE Eiben
I Levner
J Cui
J Han
J Tian
KE Marshall
KK Frousios
KS Hareesha
L Goldschmidt
M López de la Paz
MJ Thompson
NV Subba Reddy
O Conchillo-Solé
OV Galzitskaya
P Han
P Moscato
S Bandyopadhyay
S Kawashima
Smitha Sunil Kumaran Nair
SO Garbuzynskiy
SSK Nair
SSK Nair
VS Mathura
WL Huang
Y Peng
Y Saeys
Z Zhang
Z Zhu
ZR Li
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Springer - Publisher Connector

IUPred2A: context-dependent prediction of protein disorder as a function of redox state and protein binding

Author: Berman
Borgia
Bálint Mészáros
Disfani
Dosztányi
Dosztányi
Dosztányi
Dosztányi
Dosztányi
Dosztányi
Egan
Erdős
Fang
Ferreon
Fichó
Finn
Finn
Fraga
Garbuzynskiy
Gibson
Gontero
Gábor Erdős
He
Hornbeck
Jakob
Jones
Lobanov
Lowe
Malhis
Malhis
Meng
Meng
Miskei
Mészáros
Mészáros
Necci
Necci
Pace
Peng
Peng
Peng
Piovesan
Piovesan
Reichmann
Reichmann
Reichmann
Schad
Thomas
Tremblay
Vacic
van der Lee
Van Roey
Walsh
Ward
Wright
Xue
Yan
Zou
Zsuzsanna Dosztányi
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2018
Field of study

The structural states of proteins include ordered globular domains as well as intrinsically disordered protein regions that exist as highly flexible conformational ensembles in isolation. Various computational tools have been developed to discriminate ordered and disordered segments based on the amino acid sequence. However, properties of IDRs can also depend on various conditions, including binding to globular protein partners or environmental factors, such as redox potential. These cases provide further challenges for the computational characterization of disordered segments. In this work we present IUPred2A, a combined web interface that allows to generate energy estimation based predictions for ordered and disordered residues by IUPred2 and for disordered binding regions by ANCHOR2. The updated web server retains the robustness of the original programs but offers several new features. While only minor bug fixes are implemented for IUPred, the next version of ANCHOR is significantly improved through a new architecture and parameters optimized on novel datasets. In addition, redox-sensitive regions can also be highlighted through a novel experimental feature

ELTE Digital Institutional Repository (EDIT)

Predicting mostly disordered proteins by using structure-unknown protein data

Author: AK Dunker
AK Dunker
AK Dunker
AL Fink
CJ Oldfield
DT Jones
E Garner
EA Weathers
HJ Dyson
J Prilusky
JJ Ward
JJ Ward
JW Chen
Kana Shimizu
Kentaro Tomii
LM Iakoucheva
MJ Zvelebil
NS Bogatyreva
P Romero
P Tompa
P Tompa
PE Wright
R Apweiler
R Linding
R Linding
S Vucetic
S Vucetic
Shuichi Hirose
SO Garbuzynskiy
T Joachims
Tamotsu Noguchi
V Receveur-Brechot
VN Uversky
VN Uversky
VN Uversky
X Li
Y Minezaki
Yoichi Muraoka
Z Dosztanyi
Z Obradovic
ZR Yang
Publication venue: BioMed Central
Publication date: 01/03/2007
Field of study

BACKGROUND: Predicting intrinsically disordered proteins is important in structural biology because they are thought to carry out various cellular functions even though they have no stable three-dimensional structure. We know the structures of far more ordered proteins than disordered proteins. The structural distribution of proteins in nature can therefore be inferred to differ from that of proteins whose structures have been determined experimentally. We know many more protein sequences than we do protein structures, and many of the known sequences can be expected to be those of disordered proteins. Thus it would be efficient to use the information of structure-unknown proteins in order to avoid training data sparseness. We propose a novel method for predicting which proteins are mostly disordered by using spectral graph transducer and training with a huge amount of structure-unknown sequences as well as structure-known sequences. RESULTS: When the proposed method was evaluated on data that included 82 disordered proteins and 526 ordered proteins, its sensitivity was 0.723 and its specificity was 0.977. It resulted in a Matthews correlation coefficient 0.202 points higher than that obtained using FoldIndex, 0.221 points higher than that obtained using the method based on plotting hydrophobicity against the number of contacts and 0.07 points higher than that obtained using support vector machines (SVMs). To examine robustness against training data sparseness, we investigated the correlation between two results obtained when the method was trained on different datasets and tested on the same dataset. The correlation coefficient for the proposed method is 0.14 higher than that for the method using SVMs. When the proposed SGT-based method was compared with four per-residue predictors (VL3, GlobPlot, DISOPRED2 and IUPred (long)), its sensitivity was 0.834 for disordered proteins, which is 0.052–0.523 higher than that of the per-residue predictors, and its specificity was 0.991 for ordered proteins, which is 0.036–0.153 higher than that of the per-residue predictors. The proposed method was also evaluated on data that included 417 partially disordered proteins. It predicted the frequency of disordered proteins to be 1.95% for the proteins with 5%–10% disordered sequences, 1.46% for the proteins with 10%–20% disordered sequences and 16.57% for proteins with 20%–40% disordered sequences. CONCLUSION: The proposed method, which utilizes the information of structure-unknown data, predicts disordered proteins more accurately than other methods and is less affected by training data sparseness

Springer - Publisher Connector