Search CORE

34 research outputs found

Identification of the ubiquitin–proteasome pathway domain by hyperparameter optimization based on a 2D convolutional neural network

Author: Ali Ghulam
Apilak Worachartcheewan
Maha A. Thafar
Muhammad Arif
Rahu Sikander
Shabana Habib
Publication venue: 'Frontiers Media SA'
Publication date: 01/07/2022
Field of study

The major mechanism of proteolysis in the cytosol and nucleus is the ubiquitin–proteasome pathway (UPP). The highly controlled UPP has an effect on a wide range of cellular processes and substrates, and flaws in the system can lead to the pathogenesis of a number of serious human diseases. Knowledge about UPPs provide useful hints to understand the cellular process and drug discovery. The exponential growth in next-generation sequencing wet lab approaches have accelerated the accumulation of unannotated data in online databases, making the UPP characterization/analysis task more challenging. Thus, computational methods are used as an alternative for fast and accurate identification of UPPs. Aiming this, we develop a novel deep learning-based predictor named “2DCNN-UPP” for identifying UPPs with low error rate. In the proposed method, we used proposed algorithm with a two-dimensional convolutional neural network with dipeptide deviation features. To avoid the over fitting problem, genetic algorithm is employed to select the optimal features. Finally, the optimized attribute set are fed as input to the 2D-CNN learning engine for building the model. Empirical evidence or outcomes demonstrates that the proposed predictor achieved an overall accuracy and AUC (ROC) value using 10-fold cross validation test. Superior performance compared to other state-of-the art methods for discrimination the relations UPPs classification. Both on and independent test respectively was trained on 10-fold cross validation method and then evaluated through independent test. In the case where experimentally validated ubiquitination sites emerged, we must devise a proteomics-based predictor of ubiquitination. Meanwhile, we also evaluated the generalization power of our trained modal via independent test, and obtained remarkable performance in term of 0.862 accuracy, 0.921 sensitivity, 0.803 specificity 0.803, and 0.730 Matthews correlation coefficient (MCC) respectively. Four approaches were used in the sequences, and the physical properties were calculated combined. When used a 10-fold cross-validation, 2D-CNN-UPP obtained an AUC (ROC) value of 0.862 predicted score. We analyzed the relationship between UPP protein and non-UPP protein predicted score. Last but not least, this research could effectively analyze the large scale relationship between UPP proteins and non-UPP proteins in particular and other protein problems in general and our research work might improve computational biological research. Therefore, we could utilize the latest features in our model framework and Dipeptide Deviation from Expected Mean (DDE) -based protein structure features for the prediction of protein structure, functions, and different molecules, such as DNA and RNA

Directory of Open Access Journals

Rational Design of Colchicine Derivatives as anti-HIV Agents via QSAR and Molecular Docking

Crossref

Large-scale classification of P-glycoprotein inhibitors using SMILES-based descriptors

Author: A. A. Toropov (2617437)
A. P. Toropova (2617433)
A. Worachartcheewan (3620786)
C. Nantasenamat (3620792)
N. Schaduangrat (3620789)
V. Prachayasittikul (3620783)
Publication venue
Publication date: 06/01/2017
Field of study

<p>P-glycoprotein (Pgp) inhibition has been considered as an effective strategy towards combating multidrug-resistant cancers. Owing to the substrate promiscuity of Pgp, the classification of its interacting ligands is not an easy task and is an ongoing issue of debate. Chemical structures can be represented by the simplified molecular input line entry system (SMILES) in the form of linear string of symbols. In this study, the SMILES notations of 2254 Pgp inhibitors including 1341 active, and 913 inactive compounds were used for the construction of a SMILE-based classification model using CORrelation And Logic (CORAL) software. The model provided an acceptable predictive performance as observed from statistical parameters consisting of accuracy, sensitivity and specificity that afforded values greater than 70% and MCC value greater than 0.6 for training, calibration and validation sets. In addition, the CORAL method highlighted chemical features that may contribute to increased and decreased Pgp inhibitory activities. This study highlights the potential of CORAL software for rapid screening of prospective compounds from a large chemical space and provides information that could aid in the design and development of potential Pgp inhibitors.</p

The Francis Crick Institute

Large-scale structure-activity relationship study of hepatitis C virus NS5B polymerase inhibition using SMILES-based descriptors

Author: A Batra
A Toropov
A Worachartcheewan
A Worachartcheewan
A Worachartcheewan
AA Toropov
AA Toropov
AA Toropov
AA Toropov
AA Toropov
AA Toropov
AK Srivastava
Alla P. Toropova
Andrey A. Toropov
AP Toropova
AP Toropova
AP Toropova
Apilak Worachartcheewan
C Nantasenamat
C Nantasenamat
C Nantasenamat
Chanin Nantasenamat
E Pourbasheer
GS Cooke
HB El-Serag
L Wei
M Lapins
M Wang
M Wang
MP Walker
PK Ojha
S Chinnaswamy
T Liu
TJ Liang
V Prachayasittikul
Virapong Prachayasittikul
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Potential value and impact of data mining and machine learning in clinical diagnostics

Author: Aqlan F
Asad M
Firouzi Jahantigh F
Georga EI
Husain W
Khameneh ME
Kim T
Lee J-Y
Momand Z
Pan CC
Sahu H
Sarraf S
Shi CH
Winiarti S
Worachartcheewan A
Worachartcheewan A
Publication venue: 'Informa UK Limited'
Publication date
Field of study

Crossref

VPatho: a deep learning-based two-stage approach for accurate prediction of gain-of-function and loss-of-function variants

Author: Ge F.
Iqbal S.
Li C.
Li F.
Muhammad A.
Song J.
Thafar M.A.
Worachartcheewan A.
Xu X.
Yan Z.
Yu D.J.
Publication venue: Oxford University Press (OUP)
Publication date: 01/01/2023
Field of study

Determining the pathogenicity and functional impact (i.e. gain-of-function; GOF or loss-of-function; LOF) of a variant is vital for unraveling the genetic level mechanisms of human diseases. To provide a 'one-stop' framework for the accurate identification of pathogenicity and functional impact of variants, we developed a two-stage deep-learning-based computational solution, termed VPatho, which was trained using a total of 9619 pathogenic GOF/LOF and 138 026 neutral variants curated from various databases. A total number of 138 variant-level, 262 protein-level and 103 genome-level features were extracted for constructing the models of VPatho. The development of VPatho consists of two stages: (i) a random under-sampling multi-scale residual neural network (ResNet) with a newly defined weighted-loss function (RUS-Wg-MSResNet) was proposed to predict variants' pathogenicity on the gnomAD_NV + GOF/LOF dataset; and (ii) an XGBOD model was constructed to predict the functional impact of the given variants. Benchmarking experiments demonstrated that RUS-Wg-MSResNet achieved the highest prediction performance with the weights calculated based on the ratios of neutral versus pathogenic variants. Independent tests showed that both RUS-Wg-MSResNet and XGBOD achieved outstanding performance. Moreover, assessed using variants from the CAGI6 competition, RUS-Wg-MSResNet achieved superior performance compared to state-of-the-art predictors. The fine-trained XGBOD models were further used to blind test the whole LOF data downloaded from gnomAD and accordingly, we identified 31 nonLOF variants that were previously labeled as LOF/uncertain variants. As an implementation of the developed approach, a webserver of VPatho is made publicly available at http://csbio.njust.edu.cn/bioinf/vpatho/ to facilitate community-wide efforts for profiling and prioritizing the query variants with respect to their pathogenicity and functional impact.Fang Ge, Chen Li, Shahid Iqbal, Arif Muhammad, Fuyi Li, Maha A. Thafar, Zihao Yan, Apilak Worachartcheewan, Xiaofeng Xu, Jiangning Song and Dong-Jun Y

Adelaide Research & Scholarship

Probing the origins of anticancer activity of chrysin derivatives

Author: A Tropsha
A Worachartcheewan
A Worachartcheewan
A Worachartcheewan
AK Ibrahim
Apilak Worachartcheewan
C Kandaswami
C Nantasenamat
C Nantasenamat
C Nantasenamat
C Nantasenamat
C Nantasenamat
Chanin Nantasenamat
Chartchalerm Isarankura-Na-Ayudhya
DJ Newman
GM Cragg
HA Mohammed
I Kubo
IH Witten
J Drews
J Sathiavelu
J Wang
KE Heim
KJ Woo
L Eriksson
LP Sun
M Ishihara
M Ishihara
M Karelson
M Serafini
MJ Frisch
P Batra
P Rathee
P Thanikaivelan
PG Pietta
PR Duchowicz
R DenningtonII
R Khachatoorian
RG Parr
RG Parr
RG Parr
RJ Nijveldt
S Zhang
T Takahashi
T Zhang
TP Cushnie
Virapong Prachayasittikul
X Zheng
X Zheng
Y Bae
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Predicting antimicrobial activities of benzimidazole derivatives

Crossref

Large-scale classification of P-glycoprotein inhibitors using SMILES-based descriptors

Author: A. A. Toropov
A. P. Toropova
A. Worachartcheewan
C. Nantasenamat
Chiba P.
Huber M.
Katzung B.G.
Medina-Franco J.L.
N. Schaduangrat
Nantasenamat C.
Prachayasittikul V.
Prachayasittikul V.
Seelig A.
Toropova A.P.
V. Prachayasittikul
V. Prachayasittikul
Publication venue: 'Informa UK Limited'
Publication date
Field of study

Crossref

Quasi-SMILES and nano-QFPR: The predictive model for zeta potentials of metal oxide nanoparticles

Author: Achary
Achary
Alla P. Toropova
Andrey A. Toropov
Cho
Estrada
García
Leszczynski
Mullen
P. Ganga Raju Achary
Toropov
Toropov
Toropov
Toropov
Toropova
Toropova
Toropova
Worachartcheewan
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref