Search CORE

6 research outputs found

A strategy to select suitable physicochemical attributes of amino acids for protein fold recognition

Author: A Bundi
A Chinnasamy
A Dehzangi
A Dehzangi
A Dehzangi
A Dehzangi
A Sharma
A Sharma
A Sharma
A Sharma
A Sharma
AA Schaffer
Abdollah Dehzangi
Alok Sharma
AW Burgess
C Ding
D Bouchaffra
D Eisenberg
DM Dawson
G Khanarian
H Cid
H Zhang
HB Shen
HR Guy
I Dubchak
IH Witten
IK Valavanis
J Janin
James Lyons
JL Fauchere
JM Zimmerman
JO Hutchens
JT Huang
K Chen
K Kavousi
KC Chou
Kuldip K Paliwal
L Kurgan
L Liu
LA Kurgan
M Charton
M Charton
M Gromiha
M Levitt
MJ Geisow
MO Dayhoff
MO Dayhoff
P Argos
P Deschavanne
P Ghanty
P Klein
PY Chou
Q Dong
R Grantham
R Najmanovich
S Kawashima
Satoru Miyano
Seiya Imoto
T Liu
T Yang
TH Cormen
TL Zhang
V Kecman
W Chmielnicki
Y Krishnaraj
Y Ying
Y-h Taguchi
YS Ding
ZC Li
ZZ Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Protein fold recognition using genetic algorithm optimized voting scheme and profile bigram

Author: Dehzangi Abdollah
Imoto S.
Lal Sunil P.
Raicar Gaurav
Saini Harsh
Sharma Alokanand
Publication venue: JSW
Publication date: 01/01/2016
Field of study

In biology, identifying the tertiary structure of a protein helps determine its functions. A step towards tertiary structure identification is predicting a protein’s fold. Computational methods have been applied to determine a protein’s fold by assembling information from its structural, physicochemical and/or evolutionary properties. It has been shown that evolutionary information helps improve prediction accuracy. In this study, a scheme is proposed that uses the genetic algorithm (GA) to optimize a weighted voting scheme to improve protein fold recognition. This scheme incorporates k-separated bigram transition probabilities for feature extraction, which are based on the Position Specific Scoring Matrix (PSSM). A set of SVM classifiers are used for initial classification, whereupon their predictions are consolidated using the optimized weighted voting scheme. This scheme has been demonstrated on the Ding and Dubchak (DD), Extended Ding and Dubchak (EDD) and Taguchi and Gromhia (TG) datasets benchmarked data sets

University of the South Pacific Electronic Research Repository

Predicting MoRFs in protein sequences using HMM profiles

Author
Publication venue: BioMed Central
Publication date
Field of study

Springer - Publisher Connector

A Tri-Gram Based Feature Extraction Technique Using Linear Probabilities of Position Specific Scoring Matrix for Protein Fold Recognition

Author: Abdollah Dehzangi
Alok Sharma
James Lyons
Kuldip K. Paliwal
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

A mixture of physicochemical and evolutionary–based feature extraction approaches for protein fold recognition

Author: Dehzangi A.
Lyons J.
Paliwal K.K.
Sattar A.
Sharma Alokanand
Publication venue: 'Inderscience Publishers'
Publication date: 01/01/2015
Field of study

Griffith Sciences, Griffith School of EngineeringFull Tex

University of the South Pacific Electronic Research Repository

Predicting Protein Contact Map By Bagging Decision Trees

Author: Ren Chuqiao
Publication venue: Bucknell Digital Commons
Publication date: 07/05/2015
Field of study

Proteins\u27 function and structure are intrinsically related. In order to understand proteins\u27 functionality, it is essential for medical and biological researchers to deter- mine proteins\u27 three-dimensional structure. The traditional method using NMR spectroscopy or X-ray crystallography are inefficient compared to computational methods. Fortunately, substantial progress has been made in the prediction of protein structure in bioinformatics. Despite these achievements, the computational complexity of protein folding remains a challenge. Instead, many methods aim to predict a protein contact map from protein sequence using machine learning algorithms. In this thesis, we introduce a novel ensemble method for protein contact map prediction based on bagging multiple decision trees. A random sampling method is used to address the large class imbalance in contact maps. To generalize the feature space, we further clustered the amino acid alphabet from twenty to ten. A software is also developed to view protein contact map at certain threshold and separation. The parameters used in decision trees are determined experimentally, and the overall results for the first L, L/2 and L/5 predictions for protein of length L are evaluated

Bucknell University