Search CORE

20 research outputs found

Prediction and classification of aminoacyl tRNA synthetases using PROSITE domains

Author: Panwar Bharat
Raghava Gajendra PS
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Aminoacyl tRNA synthetases (aaRSs) catalyse the first step of protein synthesis in all organisms. They are responsible for the precise attachment of amino acids to their cognate transfer RNAs. There are twenty different types of aaRSs, unique for each amino acid. These aaRSs have been divided into two classes, each comprising ten enzymes. It is important to predict and classify aaRSs in order to understand protein synthesis. Results In this study, all models were developed on a non-redundant dataset containing 117 aaRSs and an equal number of non-aaRSs, in which no two sequences have more than 30% similarity. First, we applied the similarity search technique, BLAST, and achieved a maximum accuracy of 67.52%. We observed that 62% of tRNA synthetases contain one or more domains from amongst the following four PROSITE domains: PS50862, PS00178, PS50860 and PS50861. An SVM-based model was developed to discriminate between aaRSs, and non-aaRSs, and achieved a maximum MCC of 0.68 with accuracy of 83.73%, using selective dipeptide composition. We developed a hybrid approach and achieved a maximum MCC of 0.72 with accuracy of 85.49%, where SVM model developed using selected dipeptide composition and information of four PROSITE domains. We further developed an SVM-based model for classifying the aaRSs into class-1 and class-2, using selective dipeptide composition and achieved an MCC of 0.79. We also observed that two domains (PS00178, PS50889) in class-1 and three domains (PS50862, PS50860, PS50861) in class-2 were preferred. A hybrid method was developed using these domains as descriptor, along with selected dipeptide composition, and achieved an MCC of 0.87 with a sensitivity of 94.55% and an accuracy of 93.19%. All models were evaluated using a five-fold cross-validation technique. Conclusions We have analyzed protein sequences of aaRSs (class-1 and class-2) and non-aaRSs and identified interesting patterns. The high accuracy achieved by our SVM models using selected dipeptide composition demonstrates that certain types of dipeptide are preferred in aaRSs. We were able to identify PROSITE domains that are preferred in aaRSs and their classes, providing interesting insights into tRNA synthetases. The method developed in this study will be useful for researchers studying aaRS enzymes and tRNA biology. The web-server based on the above study, is available at <url>http://www.imtech.res.in/raghava/icaars/</url>.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

BIAdb: A curated database of benzylisoquinoline alkaloids

Author: Kaur Jasjit
Panwar Bharat
Raghava Gajendra PS
Sharma Arun
Singla Deepak
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Background: Benzylisoquinoline is the structural backbone of many alkaloids with a wide variety of structures including papaverine, noscapine, codeine, morphine, apomorphine, berberine, protopine and tubocurarine. Many benzylisoquinoline alkaloids have been reported to show therapeutic properties and to act as novel medicines. Thus it is important to collect and compile benzylisoquinoline alkaloids in order to explore their usage in medicine. Description: We extract information about benzylisoquinoline alkaloids from various sources like PubChem, KEGG, KNApSAcK and manual curation from literature. This information was processed and compiled in order to create a comprehensive database of benzylisoquinoline alkaloids, called BIAdb. The current version of BIAdb contains information about 846 unique benzylisoquinoline alkaloids, with multiple entries in term of source, function leads to total number of 2504 records. One of the major features of this database is that it provides data about 627 different plant species as a source of benzylisoquinoline and 114 different types of function performed by these compounds. A large number of online tools have been integrated, which facilitate user in exploring full potential of BIAdb. In order to provide additional information, we give external links to other resources/databases. One of the important features of this database is that it is tightly integrated with Drugpedia, which allows managing data in fixed/flexible format. Conclusions: A database of benzylisoquinoline compounds has been created, which provides comprehensive information about benzylisoquinoline alkaloids. This database will be very useful for those who are working in the field of drug discovery based on natural products. This database will also serve researchers working in the field of synthetic biology, as developing medicinally important alkaloids using synthetic process are one of important challenges. This database is available from http://crdd.osdd.net/raghava/biadb/

Crossref

Springer - Publisher Connector

PubMed Central

Analysis and prediction of cancerlectins using evolutionary and domain information

Author: A Garg
A Thies
Bharat Panwar
C Chen
CK Ching
D Damodaran
E Gorelik
EG De Mejia
EM Zdobnov
FT Liu
Gajendra PS Raghava
GD Fasman
GR Vasta
H Ding
H Kaur
H Kaur
H Kaur
H Lis
J Adam
Jagat S Chauhan
KA Mahon
KC Chou
KV Brinda
L Jun-Wei
M Bhasin
M Jogindra Swamy
M Kumar
M Kumar
M Kumar
M Rashid
M Vijayan
N Sharon
P Baldi
R Duncan
R Kaundal
R Lotan
R Verma
R Verma
Ravi Kumar
S Dejun
S Hu
S Nakahara
SF Altschul
SF Altschul
SH Choi
T Joachims
T Shirai
T Szoke
U Schumacher
U Schumacher
V Vapnik
WR Pearson
XF Bai
Y K Song
YK Song
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Predicting the function of a protein is one of the major challenges in the post-genomic era where a large number of protein sequences of unknown function are accumulating rapidly. Lectins are the proteins that specifically recognize and bind to carbohydrate moieties present on either proteins or lipids. Cancerlectins are those lectins that play various important roles in tumor cell differentiation and metastasis. Although the two types of proteins are linked, still there is no computational method available that can distinguish cancerlectins from the large pool of non-cancerlectins. Hence, it is imperative to develop a method that can distinguish between cancer and non-cancerlectins. Results All the models developed in this study are based on a non-redundant dataset containing 178 cancerlectins and 226 non-cancerlectins in which no two sequences have more than 50% sequence similarity. We have applied the similarity search based technique, i.e. BLAST, and achieved a maximum accuracy of 43.25%. The amino acids compositional analysis have shown that certain residues (e.g. Leucine, Proline) were preferred in cancerlectins whereas some other (e.g. Asparatic acid, Asparagine) were preferred in non-cancerlectins. It has been found that the PROSITE domain "Crystalline beta gamma" was abundant in cancerlectins whereas domains like "SUEL-type lectin domain" were found mainly in non-cancerlectins. An SVM-based model has been developed to differentiate between the cancer and non-cancerlectins which achieved a maximum Matthew's correlation coefficient (MCC) value of 0.32 with an accuracy of 64.84%, using amino acid compositions. We have developed a model based on dipeptide compositions which achieved an MCC value of 0.30 with an accuracy of 64.84%. Thereafter, we have developed models based on split compositions (2 and 4 parts) and achieved an MCC value of 0.31, 0.32 with accuracies of 65.10% and 66.09%, respectively. An SVM model based on Position Specific Scoring Matrix (PSSM), generated by PSI-BLAST, was developed and achieved an MCC value of 0.36 with an accuracy of 68.34%. Finally, we have integrated the PROSITE domain information with PSSM and developed an SVM model that has achieved an MCC value of 0.38 with 69.09% accuracy. Conclusion BLAST has been found inefficient to distinguish between cancer and non-cancerlectins. We analyzed the protein sequences of cancer and non-cancerlectins and identified interesting patterns. We have been able to identify PROSITE domains that are preferred in cancer and non-cancerlectins and thus provided interesting insights into the two types of proteins. The method developed in this study will be useful for researchers studying cancerlectins, lectins and cancer biology. The web-server based on the above study, is available at <url>http://www.imtech.res.in/raghava/cancer_pred/</url></p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Crowdsourced assessment of common genetic contribution to predicting anti-TNF treatment response in rheumatoid arthritis

Author: Aguilar Daniel
Aittokallio Tero
Allaart Cornelia F.
Ammad-ud-din Muhammad
Anton Bernat
Azencott Chloe-Agathe
Balagurusamy Venkat S. K.
Barton Anne
Bellón Víctor
Boeva Valentina
Bonet Jaume
Bridges S. Louis
Bunte Kerstin
Calaza Manuel
Cheng Lu
Chheda Himanshu
Coenen Marieke
Corander Jukka
Criswell Lindsey
Criswell Lindsey
Cui Jing
de Vries Niek
Dillenberger Donna
Dumontier Michel
Eksi Ridvan
Elmarakeby Haitham
Falcao Andre O.
Fornés Oriol
Friend Stephen
Friend Stephen
García-García Javier
Gerlag Danielle
Goldenberg Anna
Gopalacharyulu Peddinti
Greenberg Jeff
Gregersen Peter K.
Gregersen Peter K.
Guan Yuanfang
Guan Yuanfang
Guney Emre
Hajiloo Mohsen
Heath Lenwood S.
Hidru Daniel
Hoff Bruce
Huizinga Tom W. J.
Jaiswal Alok
Kaski Samuel
Khalfaoui Beyrem
Khan Suleiman Ali
Klareskog Lars
Klareskog Lars
Kramer Eric R.
Kremer Joel
Kurreeman Fina
Li Hongdong
Long Quan
Louis Bridges Jr. S.
Mangravite Lara M.
Mangravite Lara M.
Mariette Xavier
Marttinen Pekka
Marín Manuel Alejandro
Mezlini Aziz M.
Miceli Corinne
Michaud Kaleb
Molparia Bhuvan
Moore Jonathan D.
Moreland Larry
Moreland Larry
Neto Elias Chaibub
Norman Thea
Oliva Baldo
Oliva Baldo
Opiyo Stephen Obol
Padyukov Leonid
Padyukov Leonid
Pandey Gaurav
Panwar Bharat
Pappas Dimitrios
Pirinen Matti
Planas-Iglesias Joan
Plenge Robert
Plenge Robert
Poglayen Daniel
Pratap Abhishek
Saarela Janna
Saevarsdottir Saedis
Saevarsdottir Saedis
Samwald Matthias
Savage Richard S.
Shadick Nancy
Sieberts Solveig K.
Stahl Eli
Stolovitzky Gustavo
Stolovitzky Gustavo
Stoven Véronique
Suver Christine
Tak Paul P.
Tang Hao
Tang Jing
Torkamani Ali
Vert Jean-Phillipe
Wang Bo
Wang Tao
Weinblatt Michael
Wennerberg Krister
Wineinger Nathan E.
Xiao Guanghua
Xie Yang
Yeung Rae
Zhan Xiaowei
Zhao Cheng
Zhu Fan
Zhu Jun
Publication venue
Publication date: 01/01/2016
Field of study

Correction: vol 7, 13205, 2016, doi:10.1038/ncomms13205Rheumatoid arthritis (RA) affects millions world-wide. While anti-TNF treatment is widely used to reduce disease progression, treatment fails in Bone-third of patients. No biomarker currently exists that identifies non-responders before treatment. A rigorous community-based assessment of the utility of SNP data for predicting anti-TNF treatment efficacy in RA patients was performed in the context of a DREAM Challenge (http://www.synapse.org/RA_Challenge). An open challenge framework enabled the comparative evaluation of predictions developed by 73 research groups using the most comprehensive available data and covering a wide range of state-of-the-art modelling methodologies. Despite a significant genetic heritability estimate of treatment non-response trait (h(2) = 0.18, P value = 0.02), no significant genetic contribution to prediction accuracy is observed. Results formally confirm the expectations of the rheumatology community that SNP information does not significantly improve predictive performance relative to standard clinical traits, thereby justifying a refocusing of future efforts on collection of other data.Peer reviewe

Multi-cell type gene coexpression network analysis reveals coordinated interferon response and cross-cell type correlations in systemic lupus erythematosus.

Author: Panwar Bharat,
Publication venue
Publication date: 24/06/2023
Field of study

Ezid

Prediction of vitamin interacting residues in a vitamin binding protein using evolutionary information

Author: Gupta Sudheer
Panwar Bharat
Raghava Gajendra P S
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/02/2013
Field of study

Abstract Background The vitamins are important cofactors in various enzymatic-reactions. In past, many inhibitors have been designed against vitamin binding pockets in order to inhibit vitamin-protein interactions. Thus, it is important to identify vitamin interacting residues in a protein. It is possible to detect vitamin-binding pockets on a protein, if its tertiary structure is known. Unfortunately tertiary structures of limited proteins are available. Therefore, it is important to develop in-silico models for predicting vitamin interacting residues in protein from its primary structure. Results In this study, first we compared protein-interacting residues of vitamins with other ligands using Two Sample Logo (TSL). It was observed that ATP, GTP, NAD, FAD and mannose preferred {G,R,K,S,H}, {G,K,T,S,D,N}, {T,G,Y}, {G,Y,W} and {Y,D,W,N,E} residues respectively, whereas vitamins preferred {Y,F,S,W,T,G,H} residues for the interaction with proteins. Furthermore, compositional information of preferred and non-preferred residues along with patterns-specificity was also observed within different vitamin-classes. Vitamins A, B and B6 preferred {F,I,W,Y,L,V}, {S,Y,G,T,H,W,N,E} and {S,T,G,H,Y,N} interacting residues respectively. It suggested that protein-binding patterns of vitamins are different from other ligands, and motivated us to develop separate predictor for vitamins and their sub-classes. The four different prediction modules, (i) vitamin interacting residues (VIRs), (ii) vitamin-A interacting residues (VAIRs), (iii) vitamin-B interacting residues (VBIRs) and (iv) pyridoxal-5-phosphate (vitamin B6) interacting residues (PLPIRs) have been developed. We applied various classifiers of SVM, BayesNet, NaiveBayes, ComplementNaiveBayes, NaiveBayesMultinomial, RandomForest and IBk etc., as machine learning techniques, using binary and Position-Specific Scoring Matrix (PSSM) features of protein sequences. Finally, we selected best performing SVM modules and obtained highest MCC of 0.53, 0.48, 0.61, 0.81 for VIRs, VAIRs, VBIRs, PLPIRs respectively, using PSSM-based evolutionary information. All the modules developed in this study have been trained and tested on non-redundant datasets and evaluated using five-fold cross-validation technique. The performances were also evaluated on the balanced and different independent datasets. Conclusions This study demonstrates that it is possible to predict VIRs, VAIRs, VBIRs and PLPIRs from evolutionary information of protein sequence. In order to provide service to the scientific community, we have developed web-server and standalone software VitaPred (http://crdd.osdd.net/raghava/vitapred/).</p

Directory of Open Access Journals

Prediction of vitamin interacting residues in a vitamin binding protein using evolutionary information

Author: Bharat Panwar
Gajendra P S Raghava
Sudheer Gupta
Publication venue: Springer Nature
Publication date: 07/02/2013
Field of study

BACKGROUND: The vitamins are important cofactors in various enzymatic-reactions. In past, many inhibitors have been designed against vitamin binding pockets in order to inhibit vitamin-protein interactions. Thus, it is important to identify vitamin interacting residues in a protein. It is possible to detect vitamin-binding pockets on a protein, if its tertiary structure is known. Unfortunately tertiary structures of limited proteins are available. Therefore, it is important to develop in-silico models for predicting vitamin interacting residues in protein from its primary structure. RESULTS: In this study, first we compared protein-interacting residues of vitamins with other ligands using Two Sample Logo (TSL). It was observed that ATP, GTP, NAD, FAD and mannose preferred {G,R,K,S,H}, {G,K,T,S,D,N}, {T,G,Y}, {G,Y,W} and {Y,D,W,N,E} residues respectively, whereas vitamins preferred {Y,F,S,W,T,G,H} residues for the interaction with proteins. Furthermore, compositional information of preferred and non-preferred residues along with patterns-specificity was also observed within different vitamin-classes. Vitamins A, B and B6 preferred {F,I,W,Y,L,V}, {S,Y,G,T,H,W,N,E} and {S,T,G,H,Y,N} interacting residues respectively. It suggested that protein-binding patterns of vitamins are different from other ligands, and motivated us to develop separate predictor for vitamins and their sub-classes. The four different prediction modules, (i) vitamin interacting residues (VIRs), (ii) vitamin-A interacting residues (VAIRs), (iii) vitamin-B interacting residues (VBIRs) and (iv) pyridoxal-5-phosphate (vitamin B6) interacting residues (PLPIRs) have been developed. We applied various classifiers of SVM, BayesNet, NaiveBayes, ComplementNaiveBayes, NaiveBayesMultinomial, RandomForest and IBk etc., as machine learning techniques, using binary and Position-Specific Scoring Matrix (PSSM) features of protein sequences. Finally, we selected best performing SVM modules and obtained highest MCC of 0.53, 0.48, 0.61, 0.81 for VIRs, VAIRs, VBIRs, PLPIRs respectively, using PSSM-based evolutionary information. All the modules developed in this study have been trained and tested on non-redundant datasets and evaluated using five-fold cross-validation technique. The performances were also evaluated on the balanced and different independent datasets. CONCLUSIONS: This study demonstrates that it is possible to predict VIRs, VAIRs, VBIRs and PLPIRs from evolutionary information of protein sequence. In order to provide service to the scientific community, we have developed web-server and standalone software VitaPred (http://crdd.osdd.net/raghava/vitapred/)

Springer - Publisher Connector

PubMed Central