Search CORE

12 research outputs found

Accurate Prediction of Protein Structural Class

Author: AG Murzin
CA Orengo
CB Anfinsen
G Deleage
H Nakashima
HB Shen
I Bahar
JY Yang
JY Yang
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KD Kedarisetti
KD Pruitt
L Dong
L Kurgan
L Kurgan
L Kurgan
Meng Ge
MJ Mizianty
P Baldi
RY Luo
S Costantini
S Costantini
SE Brenner
SF Altschul
SM Muska
T Liu
T Liu
TG Liu
Vladimir N. Uversky
W Li
WS Bu
X Xiao
X Xiao
Xia-Yu Xia
Xian-Ming Pan
XM Pan
Y Cai
YD Cai
YD Cai
ZC Li
Zhi-Xin Wang
ZX Wang
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

Because of the increasing gap between the data from sequencing and structural genomics, the accurate prediction of the structural class of a protein domain solely from the primary sequence has remained a challenging problem in structural biology. Traditional sequence-based predictors generally select several sequence features and then feed them directly into a classification program to identify the structural class. The current best sequence-based predictor achieved an overall accuracy of 74.1% when tested on a widely used, non-homologous benchmark dataset 25PDB. In the present work, we built a multiple linear regression (MLR) model to convert the 440-dimensional (440D) sequence feature vector extracted from the Position Specific Scoring Matrix (PSSM) of a protein domain to a 4-dimensinal (4D) structural feature vector, which could then be used to predict the four major structural classes. We performed 10-fold cross-validation and jackknife tests of the method on a large non-homologous dataset containing 8,244 domains distributed among the four major classes. The performance of our approach outperformed all of the existing sequence-based methods and had an overall accuracy of 83.1%, which is even higher than the results of those predicted secondary structure-based methods

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

The Francis Crick Institute

Prediction of protein structural classes for low-homology sequences based on predicted secondary structure

Author: A Anand
A Fiser
A Murzin
C Anfinsen
C Chen
CLJ Webber
DT Jones
F Birzele
HB Shen
HJ Jeffrey
HN Lin
I Bahar
J Qi
Jian-Yi Yang
JP Eckmann
JP Zbilut
JY Yang
K Chen
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KD Kedarisetti
L Kurgan
L Kurgan
LA Kurgan
M Duan
M Levitt
RO Duda
S Costantini
SF Altschul
TL Zhang
Xin Chen
Z Aydin
ZD Zhang
ZG Yu
Zhen-Ling Peng
ZX Wang
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Prediction of protein structural classes (<it>α</it>, <it>β</it>, <it>α </it>+ <it>β </it>and <it>α</it>/<it>β</it>) from amino acid sequences is of great importance, as it is beneficial to study protein function, regulation and interactions. Many methods have been developed for high-homology protein sequences, and the prediction accuracies can achieve up to 90%. However, for low-homology sequences whose average pairwise sequence identity lies between 20% and 40%, they perform relatively poorly, yielding the prediction accuracy often below 60%. Results We propose a new method to predict protein structural classes on the basis of features extracted from the predicted secondary structures of proteins rather than directly from their amino acid sequences. It first uses PSIPRED to predict the secondary structure for each protein sequence. Then, the <it>chaos game representation </it>is employed to represent the predicted secondary structure as two time series, from which we generate a comprehensive set of 24 features using <it>recurrence quantification analysis</it>, <it>K-string based information entropy </it>and <it>segment-based analysis</it>. The resulting feature vectors are finally fed into a simple yet powerful Fisher's discriminant algorithm for the prediction of protein structural classes. We tested the proposed method on three benchmark datasets in low homology and achieved the overall prediction accuracies of 82.9%, 83.1% and 81.3%, respectively. Comparisons with ten existing methods showed that our method consistently performs better for all the tested datasets and the overall accuracy improvements range from 2.3% to 27.5%. A web server that implements the proposed method is freely available at <url>http://www1.spms.ntu.edu.sg/~chenxin/RKS_PPSC/</url>. Conclusion The high prediction accuracy achieved by our proposed method is attributed to the design of a comprehensive feature set on the predicted secondary structure sequences, which is capable of characterizing the sequence order information, local interactions of the secondary structural elements, and spacial arrangements of <it>α </it>helices and <it>β </it>strands. Thus, it is a valuable method to predict protein structural classes particularly for low-homology amino acid sequences.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

DR-NTU (Digital Repository of NTU)

A Hierarchical and Scalable Strategy for Protein Structural Classification

Author: A Dalkiran
DE Pires
DL Nelson
FM Pearl
I Schomburg
I Sillitoe
J Gu
JD Tyzack
JM Chandonia
K Weinberger
KD Kedarisetti
KE Chen
L Breiman
MA Hearst
P Rogen
P Rogen
PW Rose
RC Melo
RC Melo
SA Silveira
XD Sun
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Protein Secondary Structure Prediction Based on Data Partition and Semi-Random Subspace Method

Author: C Chang
C Fang
D Kneller
DT Jones
E Faraggi
G Wang
GD Fasman
H Bouziane
H Kim
J Garnier
J Guo
J Moult
J Moult
J Moult
J Zhou
JA Cuff
JJ Ward
K Asai
KD Kedarisetti
KJ Won
LH Holley
LJ McGuffin
M Spencer
N Qian
NK Fox
PD Yoo
PY Chou
Q Wu
R Heffernan
S Hua
S Wang
SA Malekpour
SF Altschul
TK Ho
W Kabsch
W Li
YT Tan
Z Aydin
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Clinical and neuropathological phenotype associated with the novel V189I mutation in the prion protein gene

Author: A Franceschini
A Kobayashi
A Ladogana
AR Giovagnoli
B Ghetti
C Jansen
C Mauro
DA Hall
E Bagyinszky
E Oldoni
E Tunnell
EA Stone
EV Minikel
G Giaccone
G Mackenzie
G Puoti
G Puoti
GG Kovacs
HM Schätzl
I Cali
I Zerr
IA Adzhubei
IB Kuznetsov
J Bendl
J Cheng
J Collinge
KD Kedarisetti
KJ Knaus
M Brazzelli
M Colucci
M Lek
M Mancuso
M Pocchiari
M Salvatore
M Schmitz
MO Kim
MS Palmer
N-L Sim
O Windl
P Gambetti
P Gambetti
P Parchi
P Parchi
RB Petersen
RG Will
S Capellari
SB Prusiner
V Pietrini
W Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Comparison study on statistical features of predicted secondary structures for protein structural class prediction: From content to position

Author: A Ahmadi Adl
A Andreeva
AG Murzin
AL Cuff
C Chen
C Orengo
C Zheng
DT Jones
F Birzele
HN Lin
JY Yang
K Chen
K Chou
K Chou
K Chou
K Chou
KC Chou
KD Kedarisetti
L Kurgan
L Kurgan
LA Kurgan
M Duan
M Levitt
MJ Mizianty
P Ferragina
P Klein
Pingan He
Q Dai
Q Dai
Qi Dai
RY Luo
SF Altschul
SL Zhang
SY Ding
T Liu
TL Zhang
U Hobohm
V Vapnik
XD Sun
Xiaoqing Liu
Y Cai
Y Cao
Yan Li
YS Ding
Yuhua Yao
Yunjie Cao
Z Aydin
Z Yuan
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Prediction of Protein Function Improving Sequence Remote Alignment Search by a Fuzzy Logic Algorithm

Author: A Schlessinger
Antonio Gómez
Antonio Hermoso
B Rost
BE Suzek
D Devos
DT Jones
E Camon
E Jacob
Enrique Querol
ES Lander
G Yona
GP Zhou
HB Shen
HB Shen
HB Shen
HB Shen
I Friedberg
J Cedano
J Jantzen
J Kyte
J Park
Jaume Piñol
JC Venter
Jordi Espadaler
Juan Cedano
K Ginalski
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KD Kedarisetti
L Menendezarias
LA Zadeh
LJ Jensen
PA Karplus
PJ Woolf
R Kato
RD King
S Hoersch
S Matsuda
S Mondal
SE Brenner
SF Altschul
SF Altschul
SW Zhang
TP Hopp
WD Tian
WR Gilks
WR Pearson
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Statistical prediction of protein structural, localization and functional properties by the analysis of its fragment mass distributions after proteolytic cleavage

Author: A Arneodo
A Arneodo
A Bunde
A Grosberg
AE Kister
AM Frank
AN Krutchinsky
B Audit
B Audit
B Rost
BI Dahiyat
C Vaillant
C Vaillant
C-K Peng
C-K Peng
D Forst
D Kozma
DT Jones
E Dudkina
EG Altmann
H Berman
H Chi
J Allmer
J Seidler
J Wang
K Chen.
KD Kedarisetti
KJ Leman
KX Wan
L Pauling
L Pauling
LA Kelley
LH Chen
LS Huang
M Biasini
M Levitt
M Sickmeier
MI Bogachev
MI Bogachev
MI Bogachev
MI Bogachev
MI Bogachev
MR Wilkins
P Mallick
P. Artimo
S Fukuchi
SV Buldyrev
T Fawcett
T Hirokawa
T Liu
TY Samgina
W Li
Y Cao
Z-X Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

An ensemble of support vector machines for predicting the membrane protein type directly from the amino acid sequence

Author: Alessandra Lumini
B Niu
C Chen
C Chen
DA Doyle
DQ Liu
F Tan
G Pugalenthi
GP Zhou
GP Zhou
H Lin
H Lin
H Liu
H Lodish
HB Shen
HB Shen
HB Shen
J Cedano
J Chen
J Guo
JR Schnell
JY Shi
K Lee
K Nakai
K Nakai
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KD Kedarisetti
L Nanni
L Nanni
L Nanni
L Nanni
Loris Nanni
M Wang
M Wang
N Cristianini
P Du
P Mundra
QB Gao
QB Gao
S Jahandideh
S Kawashima
S Mondal
SM Douglas
SW Zhang
TL Zhang
X Pu
X Xiao
X Xiao
X Xiao
XD Sun
Y Cao
Y Diao
Y Gao
Y Huang
YD Cai
YD Cai
YL Chen
YS Ding
YZ Guo
Z Wen
Z Yuan
ZH Zhang
Zhou
Zhou
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Comparative analysis of essential collective dynamics and NMR-derived flexibility profiles in evolutionarily diverse prion proteins

Author: Amadei A
Apetri AC
Arnold GE
Barducci A
Barducci A
Berendsen HJC
Berendsen HJC
Berjanskii M
Berjanskii MV
Berjanskii MV
Blinov N
Brown WM
Bucciantini M
Calzolai L
Calzolai L
Caughey BW
Cobb NJ
Darden T
David S Wishart
De Simone A
De Simone A
De Simone A
DeMarco L
DeMarco ML
Dima RI
Diringer H
Emberly EG
Emberly EG
Garcia AE
Garcia FL
Goldmann W
Gossert AD
Govaerts C
Gu W
Harris DA
Hartley DM
Hayward S
Hess B
Huang Z
Humphrey W
Jackson GS
Jain AK
James TL
Julien O
Kachel N
Kaneko K
Kayed R
Kedarisetti KD
Kitao A
Kolattukudy P Santo
Kurt TD
Langella E
Langella E
Lindahl E
Liu DC
Liu H
Loeffler HH
Lopez Garcia F
Lu X
Lysek DA
Lysek DA
Maria Stepanova
Mark Berjanskii
Mori H
Pan KM
Perez DR
Prusiner SB
Prusiner SB
Saborio GP
Scheraga HA
Scott WRP
Sigurdson CJ
Silveira JR
Smirnovas V
Soto C
Stepanova M
Sunde M
Sunde M
Tournier AL
Ulrich EL
Weissmann C
Wille H
Xie Zh
Yang LW
Yesylevsky SO
Zahn R
Zhong L
Publication venue: Landes Bioscience
Publication date
Field of study

Collective motions on ns-µs time scales are known to have a major impact on protein folding, stability, binding and enzymatic efficiency. It is also believed that these motions may have an important role in the early stages of prion protein misfolding and prion disease. In an effort to accurately characterize these motions and their potential influence on the misfolding and prion disease transmissibility we have conducted a combined analysis of molecular dynamic simulations and NMR-derived flexibility measurements over a diverse range of prion proteins. Using a recently developed numerical formalism, we have analyzed the essential collective dynamics (ECD) for prion proteins from eight different species including human, cow, elk, cat, hamster, chicken, turtle and frog. We also compared the numerical results with flexibility profiles generated by the random coil index (RCI) from NMR chemical shifts. Prion protein backbone flexibility derived from experimental NMR data and from theoretical computations show strong agreement with each other, demonstrating that it is possible to predict the observed RCI profiles employing the numerical ECD formalism. Interestingly, flexibility differences in the loop between second b strand (S2) and the second a helix (HB) appear to distinguish prion proteins from species that are susceptible to prion disease and those that are resistant. Our results show that the different levels of flexibility in the S2-HB loop in various species are predictable via the ECD method, indicating that ECD may be used to identify disease resistant variants of prion proteins, as well as the influence of prion proteins mutations on disease susceptibility or misfolding propensity

Crossref

PubMed Central