Search CORE

51 research outputs found

Application of bioinformatics in diagnosis of white spot syndrome virus

Author: Mohabatkar H.
Publication venue
Publication date: 01/01/2005
Field of study

White spot syndrome is one of the major problems in shrimp culture worldwide. There are different techniques like Dot blotting, PCR and using monoclonal antibodies for diagnosis of White Spot Syndrome Virus (WSSV). in the latter method, by using laboratory animals, monoclonal antibodies against different antigenic domains of proteins of the virus are developed. Then the reactivity of these antibodies with all proteins of shrimp can be tested by ELISA. As it is not known at the start of the test which parts of a protein are strong epitopes and so there is a need to test many peptides, this method is expensive and time consuming. One of the solutions for this problem is prediction of epitopes, synthesis of few peptides, and testing these peptides. Since VP28 is the most important protein of WSSV capsid, the sequences of amino acids of VP28 of four isolates of WSSV from different parts of the world were collected for this study. By using bioinformatic methods, after aligning of sequences the consensus sequence was identified. For prediction of antigenic domains of V28, seven different programs were used. The analysis through the computer programme resulted in prediction of five epitopes in V28. These parts of the protein can now be synthesized and tested for identification of the virus

Directory of Open Access Journals

Aquatic Commons

Plant glutathione S-transferase classification, structure and evolution

Author: Esmaeili M
Mohabatkar H
Mohsenzadeh S
Moosavi F
Saffari B
Shahrtash M
Publication venue: 'African Journals Online (AJOL)'
Publication date: 17/10/2013
Field of study

Glutathione S-transferases are multifunctional proteins involved in diverse intracellular events such as primary and secondary metabolisms, stress metabolism, herbicide detoxification and plant protection against ozone damages, heavy metals and xenobiotics. The plant glutathione S-transferase superfamily have been subdivided into eight classes. Phi, tau, zeta, theta, lambda, dehydroascorbate reductase and tetrachlorohydroquinone dehalogenase classes are soluble and one class is microsomal. Glutathione S-transferases are mostly soluble cytoplasmic enzymes. To date, the crystal structures of over 200 soluble glutathione S-transferases, present in plants, animals and bacteria have been resolved. The structures of glutathione S-transferase influence its function. Phylogenetic analysis suggests that all soluble glutathione S-transferases have arisen from an ancient progenitor gene, through both convergent and divergent pathways.Key words: Glutathione S-transferases (GST), classification, structure, evolution, phylogenetic analysis, xenobiotics

AJOL - African Journals Online

Predicting Anatomical Therapeutic Chemical (ATC) Classification of Drugs by Integrating Chemical-Chemical Interactions and Similarities

Author: DN Georgiou
GA Watson
GP Zhou
GP Zhou
GP Zhou
H Gurulingappa
H Mohabatkar
H Mohabatkar
IW Althaus
J Andraos
J Lin
Kai-Yan Feng
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
Kuo-Chen Chou
L Hu
Lei Chen
M Dunkel
M Esmaeili
M Hattori
M Kanehisa
M Kanehisa
M Kuhn
Ozlem Keskin
P Jaccard
P Wang
Q Gu
R Sharan
T Huang
U Karaoz
Wei-Ming Zeng
WZ Lin
X Xiao
YD Cai
YD Cai
Yu-Dong Cai
ZC Wu
ZC Wu
Publication venue: Public Library of Science
Publication date: 13/04/2012
Field of study

The Anatomical Therapeutic Chemical (ATC) classification system, recommended by the World Health Organization, categories drugs into different classes according to their therapeutic and chemical characteristics. For a set of query compounds, how can we identify which ATC-class (or classes) they belong to? It is an important and challenging problem because the information thus obtained would be quite useful for drug development and utilization. By hybridizing the informations of chemical-chemical interactions and chemical-chemical similarities, a novel method was developed for such purpose. It was observed by the jackknife test on a benchmark dataset of 3,883 drug compounds that the overall success rate achieved by the prediction method was about 73% in identifying the drugs among the following 14 main ATC-classes: (1) alimentary tract and metabolism; (2) blood and blood forming organs; (3) cardiovascular system; (4) dermatologicals; (5) genitourinary system and sex hormones; (6) systemic hormonal preparations, excluding sex hormones and insulins; (7) anti-infectives for systemic use; (8) antineoplastic and immunomodulating agents; (9) musculoskeletal system; (10) nervous system; (11) antiparasitic products, insecticides and repellents; (12) respiratory system; (13) sensory organs; (14) various. Such a success rate is substantially higher than 7% by the random guess. It has not escaped our notice that the current method can be straightforwardly extended to identify the drugs for their 2nd-level, 3rd-level, 4th-level, and 5th-level ATC-classifications once the statistically significant benchmark data are available for these lower levels

Public Library of Science (PLOS)

Crossref

PubMed Central

FigShare

iDNA-Prot: Identification of DNA Binding Proteins Using Random Forest with Grey Model

Author: A Bairoch
A Dehzangi
A Neumann
AA Schaffer
AK Patel
AK Patel
B Molparia
C Chen
DN Georgiou
E Nordhoff
EW Stawiski
G Nimrod
G Nimrod
G Wang
G Wang
H Mohabatkar
H Mohabatkar
HP Shanahan
J Rogers
JB Brown
JD Qiu
Jian-An Fang
JL Deng
JS Wu
K-C Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KK Kandaswamy
KK Kumar
Kuo-Chen Chou
L Breiman
L Breiman
L Nanni
L Nanni
L Yu
M Esmaeili
M Keil
M Kumar
N Bhardwaj
N Bhardwaj
Q Gu
RE Langlois
S Ahmad
S Ahmad
Vladimir N. Uversky
Wei-Zhong Lin
WR Atchley
X Shao
X Xiao
X Xiao
X Yu
XB Zhou
Xuan Xiao
Y Cai
Y Fang
YD Cai
YH Zeng
ZP Liu
Publication venue: Public Library of Science
Publication date: 15/09/2011
Field of study

DNA-binding proteins play crucial roles in various cellular processes. Developing high throughput tools for rapidly and effectively identifying DNA-binding proteins is one of the major challenges in the field of genome annotation. Although many efforts have been made in this regard, further effort is needed to enhance the prediction power

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Computer-aided design of nano-filter construction using DNA self-assembly

Author: A. Carbone
A.P. Alivisatos
B.H. Robinson
C.A. Mirkin
C.F. Monson
D.M. Hawkins
E. Braun
E.K. Freyhult
E.W. Myers
F. Yoshida
G. Braun
H. Qiu
Hassan Mohabatkar
I. Willner
J. Richter
J.D. Watson
K. Keren
K. Keren
M. Liu
M. Mertig
M.A. Batalia
N.C. Seeman
N.C. Seeman
N.C. Seeman
O. Gotoh
R.P. Fahlman
Reza Mohammadzadegan
S. Chomet
S.B. Needleman
T.F. Smith
T.G. Drummond
Z.X. Deng
Publication venue: Springer
Publication date: 01/01/2006
Field of study

Computer-aided design plays a fundamental role in both top-down and bottom-up nano-system fabrication. This paper presents a bottom-up nano-filter patterning process based on DNA self-assembly. In this study we designed a new method to construct fully designed nano-filters with the pores between 5 nm and 9 nm in diameter. Our calculations illustrated that by constructing such a nano-filter we would be able to separate many molecules

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

NR-2L: A Two-Level Predictor for Identifying Nuclear Receptor Subfamilies Based on Sequence-Derived Features

Author: DJ Mangelsdorf
GP Zhou
GP Zhou
H Florence
H Mohabatkar
H Nakashima
JM Keller
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KK Kandaswamy
Kuo-Chen Chou
L Altucci
M Bhasin
M Masso
M Robinson-Rechavi
Niall James Haslam
PC Mahalanobis
Pu Wang
QB Gao
RR Joshi
SF Altschul
T Cover
T Liu
T Liu
T Wang
VD Gusev
W Li
W Liu
X Xiao
Xuan Xiao
Publication venue: Public Library of Science
Publication date
Field of study

Nuclear receptors (NRs) are one of the most abundant classes of transcriptional regulators in animals. They regulate diverse functions, such as homeostasis, reproduction, development and metabolism. Therefore, NRs are a very important target for drug development. Nuclear receptors form a superfamily of phylogenetically related proteins and have been subdivided into different subfamilies due to their domain diversity. In this study, a two-level predictor, called NR-2L, was developed that can be used to identify a query protein as a nuclear receptor or not based on its sequence information alone; if it is, the prediction will be automatically continued to further identify it among the following seven subfamilies: (1) thyroid hormone like (NR1), (2) HNF4-like (NR2), (3) estrogen like, (4) nerve growth factor IB-like (NR4), (5) fushi tarazu-F1 like (NR5), (6) germ cell nuclear factor like (NR6), and (7) knirps like (NR0). The identification was made by the Fuzzy K nearest neighbor (FK-NN) classifier based on the pseudo amino acid composition formed by incorporating various physicochemical and statistical features derived from the protein sequences, such as amino acid composition, dipeptide composition, complexity factor, and low-frequency Fourier spectrum components. As a demonstration, it was shown through some benchmark datasets derived from the NucleaRDB and UniProt with low redundancy that the overall success rates achieved by the jackknife test were about 93% and 89% in the first and second level, respectively. The high success rates indicate that the novel two-level predictor can be a useful vehicle for identifying NRs and their subfamilies. As a user-friendly web server, NR-2L is freely accessible at either http://icpr.jci.edu.cn/bioinfo/NR2L or http://www.jci-bioinfo.cn/NR2L. Each job submitted to NR-2L can contain up to 500 query protein sequences and be finished in less than 2 minutes. The less the number of query proteins is, the shorter the time will usually be. All the program codes for NR-2L are available for non-commercial purpose upon request

Crossref

Directory of Open Access Journals

PubMed Central

Classification and Analysis of Regulatory Pathways Using Graph Property, Biochemical and Physicochemical Property, and Functional Property

Author: A Bairoch
A Barabasi
C Chen
C Chen
C Klukas
C Krieger
Cathal Seoighe
CF Gao
D Chakrabarti
D Frishman
DN Georgiou
E Camon
F Chiti
G Pollastri
GF Cooper
GP Zhou
GP Zhou
GY Zhang
H Ding
H Lin
H Mohabatkar
H Mohabatkar
H Ogata
H Peng
I Althaus
I Althaus
I Althaus
I Dubchak
I Dubchak
I Schomburg
I Schomburg
IH Witten
J Andraos
J Cheng
J Cheng
JD Qiu
JM Dale
K Chou
K Chou
K Chou
K Chou
K Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
Kuo-Chen Chou
L Chen
L Chen
L Chen
L Chen
L Chen
L Lu
L Lu
L Yu
Lei Chen
M Chang
M Esmaeili
M Kanehisa
M Kanehisa
M Kanehisa
M Kanehisa
N Chazal
N Friedman
P Carmona-Saez
P Pharkya
Q Gu
R Caspi
R Caspi
RR Bouckaert
S Salzberg
SS Keerthi
T Denoeux
T Huang
T Huang
T Huang
T Huang
T Huang
Tao Huang
U Stelzl
W Buntine
X Xiao
XB Zhou
Y Cai
Y Cai
Y Cai
Y Qi
YH Zeng
YS Lobanova
Yu-Dong Cai
Z He
ZC Wu
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Given a regulatory pathway system consisting of a set of proteins, can we predict which pathway class it belongs to? Such a problem is closely related to the biological function of the pathway in cells and hence is quite fundamental and essential in systems biology and proteomics. This is also an extremely difficult and challenging problem due to its complexity. To address this problem, a novel approach was developed that can be used to predict query pathways among the following six functional categories: (i) “Metabolism”, (ii) “Genetic Information Processing”, (iii) “Environmental Information Processing”, (iv) “Cellular Processes”, (v) “Organismal Systems”, and (vi) “Human Diseases”. The prediction method was established trough the following procedures: (i) according to the general form of pseudo amino acid composition (PseAAC), each of the pathways concerned is formulated as a 5570-D (dimensional) vector; (ii) each of components in the 5570-D vector was derived by a series of feature extractions from the pathway system according to its graphic property, biochemical and physicochemical property, as well as functional property; (iii) the minimum redundancy maximum relevance (mRMR) method was adopted to operate the prediction. A cross-validation by the jackknife test on a benchmark dataset consisting of 146 regulatory pathways indicated that an overall success rate of 78.8% was achieved by our method in identifying query pathways among the above six classes, indicating the outcome is quite promising and encouraging. To the best of our knowledge, the current study represents the first effort in attempting to identity the type of a pathway system or its biological function. It is anticipated that our report may stimulate a series of follow-up investigations in this new and challenging area

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

Imbalanced Multi-Modal Multi-Label Learning for Subcellular Localization Prediction of Human Proteins with Both Single and Multiple Sites

Author: A Hoglund
B Liao
CE Rasmussen
DN Georgiou
FM Li
Franca Fraternali
G Tsoumakas
GP Zhou
H Mohabatkar
H Mohabatkar
H Nakashima
HB Shen
HB Shen
HB Shen
HB Shen
HN Lin
Hong Gu
J Ma
J Ma
J Tian
J Yin
Jianjun He
JY Shi
K Imai
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KY Lee
L Chen
L Chen
L Hu
LJ Foster
LL Hu
M Esmaeili
MS Scott
O Emanuelsson
P Wang
P Wang
RE Schapire
S Briesemeister
S Hua
S Mei
S Mei
S Zhang
T Huang
T Huang
T Huang
T Liu
Wenqi Liu
WZ Lin
X Jiang
X Xiao
X Xiao
X Xiao
YH Zeng
YL Chen
YL Chen
Z He
Z Lu
ZC Wu
ZC Wu
Publication venue: Public Library of Science
Publication date: 08/06/2012
Field of study

It is well known that an important step toward understanding the functions of a protein is to determine its subcellular location. Although numerous prediction algorithms have been developed, most of them typically focused on the proteins with only one location. In recent years, researchers have begun to pay attention to the subcellular localization prediction of the proteins with multiple sites. However, almost all the existing approaches have failed to take into account the correlations among the locations caused by the proteins with multiple sites, which may be the important information for improving the prediction accuracy of the proteins with multiple sites. In this paper, a new algorithm which can effectively exploit the correlations among the locations is proposed by using Gaussian process model. Besides, the algorithm also can realize optimal linear combination of various feature extraction technologies and could be robust to the imbalanced data set. Experimental results on a human protein data set show that the proposed algorithm is valid and can achieve better performance than the existing approaches

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

A Multi-Label Predictor for Identifying the Subcellular Locations of Singleplex and Multiplex Eukaryotic Proteins

Author: A Garg
A Khan
A Pierleoni
A Reinhardt
AA Schffer
AH Millar
B Niu
C Chen
C Cortes
C Smith
CS Yu
D Georgiou
D Zou
E Camon
E Glory
FM Li
G Tsoumakas
Guo-Zheng Li
GY Zhang
GY Zhang
H Ding
H Ding
H Lin
H Lin
H Mohabatkar
H Mohabatkar
HB Shen
HB Shen
HB Shen
HB Shen
HB Shen
HB Shen
J Guo
J Lin
J Lin
J Read
J Wang
JD Qiu
JD Qiu
JD Qiu
K Nakai
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KJ Park
KK Kandaswamy
L Hao
L Hu
L Nanni
L Nanni
L Yu
Lukasz Kurgan
M Ashburner
M Bhasin
M Esmaeili
M Gerstein
P Wang
Q Gu
R Fan
S Hua
S Zhang
SS Sahu
SW Zhang
SW Zhang
WZ Lin
X Jian
X Jiang
X Xiao
X Xiao
X Xiao
XB Zhou
Xiao Wang
Y Fang
Y Huang
Y Loewenstein
YC Wang
Yh Zeng
YS Ding
Z Lu
ZC Li
Publication venue: Public Library of Science
Publication date: 22/05/2012
Field of study

Subcellular locations of proteins are important functional attributes. An effective and efficient subcellular localization predictor is necessary for rapidly and reliably annotating subcellular locations of proteins. Most of existing subcellular localization methods are only used to deal with single-location proteins. Actually, proteins may simultaneously exist at, or move between, two or more different subcellular locations. To better reflect characteristics of multiplex proteins, it is highly desired to develop new methods for dealing with them. In this paper, a new predictor, called Euk-ECC-mPLoc, by introducing a powerful multi-label learning approach which exploits correlations between subcellular locations and hybridizing gene ontology with dipeptide composition information, has been developed that can be used to deal with systems containing both singleplex and multiplex eukaryotic proteins. It can be utilized to identify eukaryotic proteins among the following 22 locations: (1) acrosome, (2) cell membrane, (3) cell wall, (4) centrosome, (5) chloroplast, (6) cyanelle, (7) cytoplasm, (8) cytoskeleton, (9) endoplasmic reticulum, (10) endosome, (11) extracellular, (12) Golgi apparatus, (13) hydrogenosome, (14) lysosome, (15) melanosome, (16) microsome, (17) mitochondrion, (18) nucleus, (19) peroxisome, (20) spindle pole body, (21) synapse, and (22) vacuole. Experimental results on a stringent benchmark dataset of eukaryotic proteins by jackknife cross validation test show that the average success rate and overall success rate obtained by Euk-ECC-mPLoc were 69.70% and 81.54%, respectively, indicating that our approach is quite promising. Particularly, the success rates achieved by Euk-ECC-mPLoc for small subsets were remarkably improved, indicating that it holds a high potential for simulating the development of the area. As a user-friendly web-server, Euk-ECC-mPLoc is freely accessible to the public at the website http://levis.tongji.edu.cn:8080/bioinfo/Euk-ECC-mPLoc/. We believe that Euk-ECC-mPLoc may become a useful high-throughput tool, or at least play a complementary role to the existing predictors in identifying subcellular locations of eukaryotic proteins

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Identification of Colorectal Cancer Related Genes with mRMR and Shortest Path in Protein-Protein Interaction Network

Author: B Bakall
B Hoeft
BC Christensen
Bi-Qing Li
C Deves
C Hiranuma
CA Borgono
D Landi
D Liu
D Menendez
D Szklarczyk
DN Georgiou
DW Parsons
E Dijkstra
E Nabieva
EP Diamandis
EP Diamandis
G Lagger
G Thomas
GP Zhou
GP Zhou
GP Zhou
GR Howe
H Mohabatkar
H Mohabatkar
H Peng
H Stohr
H Tsukahara
HE MacLean
I Niittymaki
I Ohkubo
IJ Kim
IW Althaus
J Andraos
J Cui
J Li
J Sabates-Bellver
JH Friedman
JL Huret
JR Reeves
K Hibi
K Imai
K Yu
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KL Ng
Kuo-Chen Chou
L Castagnetta
L Chen
L Chen
L Hu
L Hu
LD Wood
Lei Liu
LL Hu
M Esmaeili
M Katoh
M Levesque
M Talieri
M Thangaraju
MG Catalano
ML Slattery
MS Kim
MW Medina
P Bogdanov
P Polakis
Paulo Lee Ho
Q Gu
Q Liu
R Sharan
RA Irizarry
S Jones
S Letovsky
SA Gayther
SA Johnson
SH Nagaraj
SM Lipkin
T Denoeux
T Hinoue
T Huang
T Huang
T Huang
T Huang
T Huang
T Huang
T Huang
T Huang
T Huang
T Huang
T Morikawa
Tao Huang
TS Keshava Prasad
U Karaoz
W Huang da
W van Criekinge
WL Allen
X Xiao
XY Yang
Y Benjamini
Y Cai
YA Kourmpetis
YD Cai
Yu-Dong Cai
ZC Wu
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

One of the most important and challenging problems in biomedicine and genomics is how to identify the disease genes. In this study, we developed a computational method to identify colorectal cancer-related genes based on (i) the gene expression profiles, and (ii) the shortest path analysis of functional protein association networks. The former has been used to select differentially expressed genes as disease genes for quite a long time, while the latter has been widely used to study the mechanism of diseases. With the existing protein-protein interaction data from STRING (Search Tool for the Retrieval of Interacting Genes), a weighted functional protein association network was constructed. By means of the mRMR (Maximum Relevance Minimum Redundancy) approach, six genes were identified that can distinguish the colorectal tumors and normal adjacent colonic tissues from their gene expression profiles. Meanwhile, according to the shortest path approach, we further found an additional 35 genes, of which some have been reported to be relevant to colorectal cancer and some are very likely to be relevant to it. Interestingly, the genes we identified from both the gene expression profiles and the functional protein association network have more cancer genes than the genes identified from the gene expression profiles alone. Besides, these genes also had greater functional similarity with the reported colorectal cancer genes than the genes identified from the gene expression profiles alone. All these indicate that our method as presented in this paper is quite promising. The method may become a useful tool, or at least plays a complementary role to the existing method, for identifying colorectal cancer genes. It has not escaped our notice that the method can be applied to identify the genes of other diseases as well

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

FigShare