4 research outputs found
The complete mitochondrial genome and phylogenetic analysis of <i>Polythlipta liquidalis</i> Leech, 1889 (Crambidae: spilomelinae)
The complete mitochondrial genome of Polythlipta liquidalis Leech, 1889 was sequenced and annotated in this study, which was the first reported complete mitogenome of the genus Polythlipta. The mitogenome of P. liquidalis is 15,305 bp in length and was predicted to encode 37 typical mitochondrial genes including 13 protein-coding genes (PCGs), 22 transfer RNA genes (tRNAs), 2 ribosomal RNA genes (rRNAs), and one major non-coding A-T rich region. The maximum likelihood phylogenetic analysis based on the 13 PCGs was constructed, including P. liquidalis and 15 related Spilomelinae species, using Ostrinia furnacalis as the outgroup. The result showed that P. liquidalis is grouped with Sinomphisa plagialis. These data will serve as a molecular resource for species identification of P. liquidalis and become a valuable resource for a range of genetic, functional, evolutionary and comparative genomic studies on members of Spilomelinae.</p
Table1_Study of PARP inhibitors for breast cancer based on enhanced multiple kernel function SVR with PSO.docx
PARP1 is one of six enzymes required for the highly error-prone DNA repair pathway microhomology-mediated end joining (MMEJ) and needs to be inhibited when over-expressed. In order to study the PARP1 inhibitory effect of fused tetracyclic or pentacyclic dihydrodiazepinoindolone derivatives (FTPDDs) by quantitative structure-activity relationship technique, six models were established by four kinds of methods, heuristic method, gene expression programming, random forester, and support vector regression with single, double, and triple kernel function respectively. The single, double, and triple kernel functions were RBF kernel function, the integration of RBF and polynomial kernel functions, and the integration of RBF, polynomial, and linear kernel functions respectively. The problem of multi-parameter optimization introduced in the support vector regression model was solved by the particle swarm optimization algorithm. Among the models, the model established by support vector regression with triple kernel function, in which the optimal R2 and RMSE of training set and test set were 0.9353, 0.9348 and 0.0157, 0.0288, and R2cv of training set and test set were 0.9090 and 0.8971, shows the strongest prediction ability and robustness. The method of support vector regression with triple kernel function is a great promotion in the field of quantitative structure-activity relationship, which will contribute a lot to designing and screening new drug molecules. The information contained in the model can provide important factors that guide drug design. Based on these factors, six new FTPDDs have been designed. Using molecular docking experiments to determine the properties of new derivatives, the new drug was ultimately successfully designed.</p
DataSheet1_Study of PARP inhibitors for breast cancer based on enhanced multiple kernel function SVR with PSO.ZIP
PARP1 is one of six enzymes required for the highly error-prone DNA repair pathway microhomology-mediated end joining (MMEJ) and needs to be inhibited when over-expressed. In order to study the PARP1 inhibitory effect of fused tetracyclic or pentacyclic dihydrodiazepinoindolone derivatives (FTPDDs) by quantitative structure-activity relationship technique, six models were established by four kinds of methods, heuristic method, gene expression programming, random forester, and support vector regression with single, double, and triple kernel function respectively. The single, double, and triple kernel functions were RBF kernel function, the integration of RBF and polynomial kernel functions, and the integration of RBF, polynomial, and linear kernel functions respectively. The problem of multi-parameter optimization introduced in the support vector regression model was solved by the particle swarm optimization algorithm. Among the models, the model established by support vector regression with triple kernel function, in which the optimal R2 and RMSE of training set and test set were 0.9353, 0.9348 and 0.0157, 0.0288, and R2cv of training set and test set were 0.9090 and 0.8971, shows the strongest prediction ability and robustness. The method of support vector regression with triple kernel function is a great promotion in the field of quantitative structure-activity relationship, which will contribute a lot to designing and screening new drug molecules. The information contained in the model can provide important factors that guide drug design. Based on these factors, six new FTPDDs have been designed. Using molecular docking experiments to determine the properties of new derivatives, the new drug was ultimately successfully designed.</p
Enhancing Protein Solubility via Glycosylation: From Chemical Synthesis to Machine Learning Predictions
Glycosylation is a valuable tool for modulating protein
solubility;
however, the lack of reliable research strategies has impeded efficient
progress in understanding and applying this modification. This study
aimed to bridge this gap by investigating the solubility of a model
glycoprotein molecule, the carbohydrate-binding module (CBM), through
a two-stage process. In the first stage, an approach involving chemical
synthesis, comparative analysis, and molecular dynamics simulations
of a library of glycoforms was employed to elucidate the effect of
different glycosylation patterns on solubility and the key factors
responsible for the effect. In the second stage, a predictive mathematical
formula, innovatively harnessing machine learning algorithms, was
derived to relate solubility to the identified key factors and accurately
predict the solubility of the newly designed glycoforms. Demonstrating
feasibility and effectiveness, this two-stage approach offers a valuable
strategy for advancing glycosylation research, especially for the
discovery of glycoforms with increased solubility