Search CORE

1,582 research outputs found

Analysis and Prediction of the Metabolic Stability of Proteins Based on Their Sequential Features, Subcellular Locations and Interaction Networks

Author: A Madkan
A Ruepp
Andreas Hofmann
B Niu
C Chen
C Chothia
CA Minetti
DS Wishart
FM Li
G Pollastri
G Pollastri
H Ding
H Lin
H Lin
H Peng
H Wei
HB Shen
HB Shen
HC Yen
I Dubchak
I Dubchak
J Wang
JF Wang
JF Wang
JF Wang
JF Wang
JJ Chou
JL Fauchere
JR Schnell
K Gong
K Oxenoid
Kai-Yan Feng
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
Kuo-Chen Chou
L Cristian
L Li
LeLe Hu
LJ Jensen
MM Gromiha
P Martel
P Rice
PA Fields
Ping Wang
QS Du
R Grantham
R Lumry
R Sharan
RB Huang
RM Pielak
SF Altschul
SH White
T Huang
Tao Huang
TJ Kamerzell
TL Zhang
X Xiao
Xiangyin Kong
Xiao-He Shi
Yi-Xue Li
Yu-Dong Cai
Z Qian
Zhisong He
Publication venue: Public Library of Science
Publication date: 04/06/2010
Field of study

The metabolic stability is a very important idiosyncracy of proteins that is related to their global flexibility, intramolecular fluctuations, various internal dynamic processes, as well as many marvelous biological functions. Determination of protein's metabolic stability would provide us with useful information for in-depth understanding of the dynamic action mechanisms of proteins. Although several experimental methods have been developed to measure protein's metabolic stability, they are time-consuming and more expensive. Reported in this paper is a computational method, which is featured by (1) integrating various properties of proteins, such as biochemical and physicochemical properties, subcellular locations, network properties and protein complex property, (2) using the mRMR (Maximum Relevance & Minimum Redundancy) principle and the IFS (Incremental Feature Selection) procedure to optimize the prediction engine, and (3) being able to identify proteins among the four types: “short”, “medium”, “long”, and “extra-long” half-life spans. It was revealed through our analysis that the following seven characters played major roles in determining the stability of proteins: (1) KEGG enrichment scores of the protein and its neighbors in network, (2) subcellular locations, (3) polarity, (4) amino acids composition, (5) hydrophobicity, (6) secondary structure propensity, and (7) the number of protein complexes the protein involved. It was observed that there was an intriguing correlation between the predicted metabolic stability of some proteins and the real half-life of the drugs designed to target them. These findings might provide useful insights for designing protein-stability-relevant drugs. The computational method can also be used as a large-scale tool for annotating the metabolic stability for the avalanche of protein sequences generated in the post-genomic age

Public Library of Science (PLOS)

Crossref

PubMed Central

NR-2L: A Two-Level Predictor for Identifying Nuclear Receptor Subfamilies Based on Sequence-Derived Features

Author: DJ Mangelsdorf
GP Zhou
GP Zhou
H Florence
H Mohabatkar
H Nakashima
JM Keller
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KK Kandaswamy
Kuo-Chen Chou
L Altucci
M Bhasin
M Masso
M Robinson-Rechavi
Niall James Haslam
PC Mahalanobis
Pu Wang
QB Gao
RR Joshi
SF Altschul
T Cover
T Liu
T Liu
T Wang
VD Gusev
W Li
W Liu
X Xiao
Xuan Xiao
Publication venue: Public Library of Science
Publication date
Field of study

Nuclear receptors (NRs) are one of the most abundant classes of transcriptional regulators in animals. They regulate diverse functions, such as homeostasis, reproduction, development and metabolism. Therefore, NRs are a very important target for drug development. Nuclear receptors form a superfamily of phylogenetically related proteins and have been subdivided into different subfamilies due to their domain diversity. In this study, a two-level predictor, called NR-2L, was developed that can be used to identify a query protein as a nuclear receptor or not based on its sequence information alone; if it is, the prediction will be automatically continued to further identify it among the following seven subfamilies: (1) thyroid hormone like (NR1), (2) HNF4-like (NR2), (3) estrogen like, (4) nerve growth factor IB-like (NR4), (5) fushi tarazu-F1 like (NR5), (6) germ cell nuclear factor like (NR6), and (7) knirps like (NR0). The identification was made by the Fuzzy K nearest neighbor (FK-NN) classifier based on the pseudo amino acid composition formed by incorporating various physicochemical and statistical features derived from the protein sequences, such as amino acid composition, dipeptide composition, complexity factor, and low-frequency Fourier spectrum components. As a demonstration, it was shown through some benchmark datasets derived from the NucleaRDB and UniProt with low redundancy that the overall success rates achieved by the jackknife test were about 93% and 89% in the first and second level, respectively. The high success rates indicate that the novel two-level predictor can be a useful vehicle for identifying NRs and their subfamilies. As a user-friendly web server, NR-2L is freely accessible at either http://icpr.jci.edu.cn/bioinfo/NR2L or http://www.jci-bioinfo.cn/NR2L. Each job submitted to NR-2L can contain up to 500 query protein sequences and be finished in less than 2 minutes. The less the number of query proteins is, the shorter the time will usually be. All the program codes for NR-2L are available for non-commercial purpose upon request

Crossref

Directory of Open Access Journals

PubMed Central

Prediction of Protein Domain with mRMR Feature Selection and Analysis

Author: AA Schaffer
AG Murzin
AK Dunker
AM Moses
AP Elhammer
B Saffari
Bi-Qing Li
Bin Xue
BQ Li
CA Orengo
D Chivian
D Li
DE Kim
E Angov
EC Mbamala
G Pugalenthi
GP Zhou
GP Zhou
H Ingolfsson
H Mohabatkar
H Peng
HB Shen
HB Shen
I Walsh
ID Campbell
IH Witten
J Chen
J Cheng
J Cheng
J Cheng
J Eickholt
J Lin
J Liu
J Liu
J Wang
JD Qiu
JE Gewehr
JJ Chou
JR Schnell
K Peng
K Shameer
K Wang
Kai-Yan Feng
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KK Kandaswamy
Kuo-Chen Chou
L Breiman
L Chen
L Holm
Le-Le Hu
Lei Chen
M Esmaeili
M Hayat
M Suyama
MJ Berardi
MK Yoon
N Nagarajan
N von Ohsen
NM Goldenberg
P Mundra
P Tompa
P Wang
PE Wright
PK Nielsen
Q Gu
R Apweiler
R Bondugula
R Guerois
R Linding
RA George
RA Poorman
S Gong
S Kawashima
S Roy
SC Jia
SF Altschul
SM Reynolds
T Ebina
T Huang
TA Holland
W Li
W Zhao
WR Atchley
WZ Lin
X Xiao
X Xiao
X Xiao
X Xiao
X Xiao
X Xiao
X Xiao
Y Zhang
YD Cai
YD Li
Yu-Dong Cai
YX Li
Z He
Z Qiu
ZC Wu
ZC Wu
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

The domains are the structural and functional units of proteins. With the avalanche of protein sequences generated in the postgenomic age, it is highly desired to develop effective methods for predicting the protein domains according to the sequences information alone, so as to facilitate the structure prediction of proteins and speed up their functional annotation. However, although many efforts have been made in this regard, prediction of protein domains from the sequence information still remains a challenging and elusive problem. Here, a new method was developed by combing the techniques of RF (random forest), mRMR (maximum relevance minimum redundancy), and IFS (incremental feature selection), as well as by incorporating the features of physicochemical and biochemical properties, sequence conservation, residual disorder, secondary structure, and solvent accessibility. The overall success rate achieved by the new method on an independent dataset was around 73%, which was about 28–40% higher than those by the existing method on the same benchmark dataset. Furthermore, it was revealed by an in-depth analysis that the features of evolution, codon diversity, electrostatic charge, and disorder played more important roles than the others in predicting protein domains, quite consistent with experimental observations. It is anticipated that the new method may become a high-throughput tool in annotating protein domains, or may, at the very least, play a complementary role to the existing domain prediction methods, and that the findings about the key features with high impacts to the domain prediction might provide useful insights or clues for further experimental investigations in this area. Finally, it has not escaped our notice that the current approach can also be utilized to study protein signal peptides, B-cell epitopes, HIV protease cleavage sites, among many other important topics in protein science and biomedicine

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

Prediction of protein structural classes for low-homology sequences based on predicted secondary structure

Author: A Anand
A Fiser
A Murzin
C Anfinsen
C Chen
CLJ Webber
DT Jones
F Birzele
HB Shen
HJ Jeffrey
HN Lin
I Bahar
J Qi
Jian-Yi Yang
JP Eckmann
JP Zbilut
JY Yang
K Chen
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KD Kedarisetti
L Kurgan
L Kurgan
LA Kurgan
M Duan
M Levitt
RO Duda
S Costantini
SF Altschul
TL Zhang
Xin Chen
Z Aydin
ZD Zhang
ZG Yu
Zhen-Ling Peng
ZX Wang
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Prediction of protein structural classes (<it>α</it>, <it>β</it>, <it>α </it>+ <it>β </it>and <it>α</it>/<it>β</it>) from amino acid sequences is of great importance, as it is beneficial to study protein function, regulation and interactions. Many methods have been developed for high-homology protein sequences, and the prediction accuracies can achieve up to 90%. However, for low-homology sequences whose average pairwise sequence identity lies between 20% and 40%, they perform relatively poorly, yielding the prediction accuracy often below 60%. Results We propose a new method to predict protein structural classes on the basis of features extracted from the predicted secondary structures of proteins rather than directly from their amino acid sequences. It first uses PSIPRED to predict the secondary structure for each protein sequence. Then, the <it>chaos game representation </it>is employed to represent the predicted secondary structure as two time series, from which we generate a comprehensive set of 24 features using <it>recurrence quantification analysis</it>, <it>K-string based information entropy </it>and <it>segment-based analysis</it>. The resulting feature vectors are finally fed into a simple yet powerful Fisher's discriminant algorithm for the prediction of protein structural classes. We tested the proposed method on three benchmark datasets in low homology and achieved the overall prediction accuracies of 82.9%, 83.1% and 81.3%, respectively. Comparisons with ten existing methods showed that our method consistently performs better for all the tested datasets and the overall accuracy improvements range from 2.3% to 27.5%. A web server that implements the proposed method is freely available at <url>http://www1.spms.ntu.edu.sg/~chenxin/RKS_PPSC/</url>. Conclusion The high prediction accuracy achieved by our proposed method is attributed to the design of a comprehensive feature set on the predicted secondary structure sequences, which is capable of characterizing the sequence order information, local interactions of the secondary structural elements, and spacial arrangements of <it>α </it>helices and <it>β </it>strands. Thus, it is a valuable method to predict protein structural classes particularly for low-homology amino acid sequences.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

DR-NTU (Digital Repository of NTU)

Predicting Transcriptional Activity of Multiple Site p53 Mutants Based on Hybrid Properties

Author: A Efeyan
AC Martin
AP Bom
B Ma
CW Lee
DP Lane
G Bossi
H Mohabatkar
H Peng
IK Jordan
JM Smith
JP Qi
K Peng
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KK Kandaswamy
Kuo-Chen Chou
L Meng
M Hayat
M Oren
MS Greenblatt
P Baldi
P Wang
P Zakeri
Q Gu
R Grantham
R Rainwater
Reiner Albert Veitia
RR Joshi
S Kato
S Kawashima
S Niu
SA Danziger
SA Danziger
SA Danziger
SF Altschul
Shen Niu
T Huang
T Huang
T Huang
T Huang
Tao Huang
UK Mukhopadhyay
WR Atchley
XB Zhou
Xiangyin Kong
Y Cai
YD Cai
Yu-Dong Cai
Yun Huang
Z Qian
Z Yang
Zhongping Xu
Publication venue: Public Library of Science
Publication date: 08/08/2011
Field of study

As an important tumor suppressor protein, reactivate mutated p53 was found in many kinds of human cancers and that restoring active p53 would lead to tumor regression. In this work, we developed a new computational method to predict the transcriptional activity for one-, two-, three- and four-site p53 mutants, respectively. With the approach from the general form of pseudo amino acid composition, we used eight types of features to represent the mutation and then selected the optimal prediction features based on the maximum relevance, minimum redundancy, and incremental feature selection methods. The Mathew's correlation coefficients (MCC) obtained by using nearest neighbor algorithm and jackknife cross validation for one-, two-, three- and four-site p53 mutants were 0.678, 0.314, 0.705, and 0.907, respectively. It was revealed by the further optimal feature set analysis that the 2D (two-dimensional) structure features composed the largest part of the optimal feature set and maybe played the most important roles in all four types of p53 mutant active status prediction. It was also demonstrated by the optimal feature sets, especially those at the top level, that the 3D structure features, conservation, physicochemical and biochemical properties of amino acid near the mutation site, also played quite important roles for p53 mutant active status prediction. Our study has provided a new and promising approach for finding functionally important sites and the relevant features for in-depth study of p53 protein and its action mechanism

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Accurate Prediction of Protein Structural Class

Author: AG Murzin
CA Orengo
CB Anfinsen
G Deleage
H Nakashima
HB Shen
I Bahar
JY Yang
JY Yang
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KD Kedarisetti
KD Pruitt
L Dong
L Kurgan
L Kurgan
L Kurgan
Meng Ge
MJ Mizianty
P Baldi
RY Luo
S Costantini
S Costantini
SE Brenner
SF Altschul
SM Muska
T Liu
T Liu
TG Liu
Vladimir N. Uversky
W Li
WS Bu
X Xiao
X Xiao
Xia-Yu Xia
Xian-Ming Pan
XM Pan
Y Cai
YD Cai
YD Cai
ZC Li
Zhi-Xin Wang
ZX Wang
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

Because of the increasing gap between the data from sequencing and structural genomics, the accurate prediction of the structural class of a protein domain solely from the primary sequence has remained a challenging problem in structural biology. Traditional sequence-based predictors generally select several sequence features and then feed them directly into a classification program to identify the structural class. The current best sequence-based predictor achieved an overall accuracy of 74.1% when tested on a widely used, non-homologous benchmark dataset 25PDB. In the present work, we built a multiple linear regression (MLR) model to convert the 440-dimensional (440D) sequence feature vector extracted from the Position Specific Scoring Matrix (PSSM) of a protein domain to a 4-dimensinal (4D) structural feature vector, which could then be used to predict the four major structural classes. We performed 10-fold cross-validation and jackknife tests of the method on a large non-homologous dataset containing 8,244 domains distributed among the four major classes. The performance of our approach outperformed all of the existing sequence-based methods and had an overall accuracy of 83.1%, which is even higher than the results of those predicted secondary structure-based methods

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

Recommendations for a core outcome set for measuring standing balance in adult populations: a consensus-based approach

Author: A Shumway-Cook
A Shumway-Cook
AA Qutubuddin
AH Newstead
AL Leddy
AL Leddy
Antony Bayer
Brian E. Maki
CF Dillon
CH Wang
CS Tsang
CY Chou
D Donoghue
D Podsiadlo
D Tran
DC Bland
Debra J. Rose
DH Saunders
DJ Rose
DL Sturnieks
DT Felson
F Franchignoni
F Franchignoni
F La Porta
FB Horak
FB Horak
HF Mao
JF Lemay
JM Guralnik
JR Basford
K Berg
K Berg
K Fitch
Kathryn M. Sibley
KJ Brusse
KM Sibley
KM Sibley
KO Berg
KO Berg
LA Beaupre
LA King
LD Gillespie
Liza Stathokostas
LK Boulgarides
M Bergstrom
M Boers
M Conradsson
M Godi
M McGlynn
M Wirz
MA Holbein-Jenny
ME McNeely
ME Tinetti
ME Tinetti
MK Murphy
N Lofgren
P Jogi
PL Scalzo
PR Williamson
R Haas
R Orr
RA Liston
RP Duncan
S O'Hoski
S Whitney
S Wood-Dauphinee
Sarah E. Lamb
SE Lamb
SF Tyson
SF Tyson
Sharon E. Straus
Stephen R. Lord
Susan B. Jaglal
T Crocker
T Steffen
TE Howe
TE Howe
TJ Stevenson
Tracey Howe
UB Flansbjer
V Hiengkaew
Vicky Scott
YC Learmonth
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 13/03/2015
Field of study

Standing balance is imperative for mobility and avoiding falls. Use of an excessive number of standing balance measures has limited the synthesis of balance intervention data and hampered consistent clinical practice.To develop recommendations for a core outcome set (COS) of standing balance measures for research and practice among adults.A combination of scoping reviews, literature appraisal, anonymous voting and face-to-face meetings with fourteen invited experts from a range of disciplines with international recognition in balance measurement and falls prevention. Consensus was sought over three rounds using pre-established criteria.The scoping review identified 56 existing standing balance measures validated in adult populations with evidence of use in the past five years, and these were considered for inclusion in the COS.Fifteen measures were excluded after the first round of scoring and a further 36 after round two. Five measures were considered in round three. Two measures reached consensus for recommendation, and the expert panel recommended that at a minimum, either the Berg Balance Scale or Mini Balance Evaluation Systems Test be used when measuring standing balance in adult populations.Inclusion of two measures in the COS may increase the feasibility of potential uptake, but poses challenges for data synthesis. Adoption of the standing balance COS does not constitute a comprehensive balance assessment for any population, and users should include additional validated measures as appropriate.The absence of a gold standard for measuring standing balance has contributed to the proliferation of outcome measures. These recommendations represent an important first step towards greater standardization in the assessment and measurement of this critical skill and will inform clinical research and practice internationally

University of Toronto Research Repository

Crossref

Directory of Open Access Journals

PRED_PPI: a server for predicting protein-protein interactions based on sequence data with probability assignment

Author: C von Mering
D Juan
Gongbin Li
I Xenarios
JD Han
Juan Li
JW Shen
KC Chou
L Burger
M Singhal
Menglong Li
S Peri
SF Altschul
TF Wu
Wenjia Xiong
Xuanmin Guang
Xuemei Pu
Yanzhi Guo
YZ Guo
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Protein-protein interactions (PPIs) are crucial for almost all cellular processes, including metabolic cycles, DNA transcription and replication, and signaling cascades. Given the importance of PPIs, several methods have been developed to detect them. Since the experimental methods are time-consuming and expensive, developing computational methods for effectively identifying PPIs is of great practical significance. Findings Most previous methods were developed for predicting PPIs in only one species, and do not account for probability estimations. In this work, a relatively comprehensive prediction system was developed, based on a support vector machine (SVM), for predicting PPIs in five organisms, specifically humans, yeast, <it>Drosophila</it>, <it>Escherichia coli</it>, and <it>Caenorhabditis elegans</it>. This PPI predictor includes the probability of its prediction in the output, so it can be used to assess the confidence of each SVM prediction by the probability assignment. Using a probability of 0.5 as the threshold for assigning class labels, the method had an average accuracy for detecting protein interactions of 90.67% for humans, 88.99% for yeast, 90.09% for <it>Drosophila</it>, 92.73% for <it>E. coli</it>, and 97.51% for <it>C. elegans</it>. Moreover, among the correctly predicted pairs, more than 80% were predicted with a high probability of ≥0.8, indicating that this tool could predict novel PPIs with high confidence. Conclusions Based on this work, a web-based system, Pred_PPI, was constructed for predicting PPIs from the five organisms. Users can predict novel PPIs and obtain a probability value about the prediction using this tool. Pred_PPI is freely available at <url>http://cic.scu.edu.cn/bioinformatics/predict_ppi/default.html</url>.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Systematic review of antiepileptic drugs’ safety and effectiveness in feline epilepsy

Author: A Klang
A Pakozdy
A Pakozdy
A Pakozdy
A Pellegrini
AD Quesnel
Akos Pakozdy
AM Wahle
AS Sawchuk
C Bertolani
CR Hooijmans
CW Dewey
D Hasegawa
D Hughes
D Moher
D Schwartz-Porsche
D Schwartz-Porsche
DB Roye
DE Cuff
DL Zoran
DM Boothe
DM Brewer
G Zaccara
G Zaccara
GE Solomon
GH Guyatt
H Volk
Holger A. Volk
JA Berlin
JA Wada
JAD Gasper
JM Ducote
KE Finnerty
KS Bailey
KS Bailey
LR Barnard
M Charalambous
M Charalambous
M Lowrie
M Podell
MA Cautela
MA Holmes
Marios Charalambous
MB Carnes
MJ Baho
NM van Gelder
O Engel
P Boydell
PN Papanikolaou
R Chou
S Cizinauskas
S Schriefl
S Wagner
SA Center
SD Ross
SF Bhatti
SM Cochrane
SM Cochrane
Sofie F. M. Bhatti
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

Understanding the efficacy and safety profile of antiepileptic drugs (AEDs) in feline epilepsy is a crucial consideration for managing this important brain disease. However, there is a lack of information about the treatment of feline epilepsy and therefore a systematic review was constructed to assess current evidence for the AEDs’ efficacy and tolerability in cats. The methods and materials of our former systematic reviews in canine epilepsy were mostly mirrored for the current systematic review in cats. Databases of PubMed, CAB Direct and Google scholar were searched to detect peer-reviewed studies reporting efficacy and/or adverse effects of AEDs in cats. The studies were assessed with regards to their quality of evidence, i.e. study design, study population, diagnostic criteria and overall risk of bias and the outcome measures reported, i.e. prevalence and 95% confidence interval of the successful and affected population in each study and in total

Crossref

Ghent University Academic Bibliography

Directory of Open Access Journals

ESTuber db: an online database for Tuber borchii EST sequences

Author: A Gattiker
Alessandra Stella
Andrea Caprera
Angelo Viotti
B Ewing
B Grimaldi
B Lazzari
B Montanini
B Viard
Barbara Lazzari
Cristian Cosentino
E Barbieri
G Benson
H-H Chou
I Lacourt
L Falquet
Luciano Milanesi
R Percudani
S Gabella
SF Altschul
The Gene Ontology Consortium
X Huan
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Crossref

Springer - Publisher Connector

PubMed Central