Search CORE

4 research outputs found

The complete mitochondrial genome and phylogenetic analysis of <i>Polythlipta liquidalis</i> Leech, 1889 (Crambidae: spilomelinae)

Author: Hua Rong (4163989)
Lingdi Yang (13169454)
Ruihan Wang (234022)
Yanyao Zhang (11565751)
Yu Wang (12152)
Publication venue
Publication date: 05/10/2023
Field of study

The complete mitochondrial genome of Polythlipta liquidalis Leech, 1889 was sequenced and annotated in this study, which was the first reported complete mitogenome of the genus Polythlipta. The mitogenome of P. liquidalis is 15,305 bp in length and was predicted to encode 37 typical mitochondrial genes including 13 protein-coding genes (PCGs), 22 transfer RNA genes (tRNAs), 2 ribosomal RNA genes (rRNAs), and one major non-coding A-T rich region. The maximum likelihood phylogenetic analysis based on the 13 PCGs was constructed, including P. liquidalis and 15 related Spilomelinae species, using Ostrinia furnacalis as the outgroup. The result showed that P. liquidalis is grouped with Sinomphisa plagialis. These data will serve as a molecular resource for species identification of P. liquidalis and become a valuable resource for a range of genetic, functional, evolutionary and comparative genomic studies on members of Spilomelinae.</p

The Francis Crick Institute

Table1_Study of PARP inhibitors for breast cancer based on enhanced multiple kernel function SVR with PSO.docx

Author: Haohan Xue (17892128)
Peijian Zhang (7798271)
Ruihan Wang (234022)
Ruixuan Zhang (10491569)
Xudong Yan (6338561)
Publication venue
Publication date: 02/02/2024
Field of study

PARP1 is one of six enzymes required for the highly error-prone DNA repair pathway microhomology-mediated end joining (MMEJ) and needs to be inhibited when over-expressed. In order to study the PARP1 inhibitory effect of fused tetracyclic or pentacyclic dihydrodiazepinoindolone derivatives (FTPDDs) by quantitative structure-activity relationship technique, six models were established by four kinds of methods, heuristic method, gene expression programming, random forester, and support vector regression with single, double, and triple kernel function respectively. The single, double, and triple kernel functions were RBF kernel function, the integration of RBF and polynomial kernel functions, and the integration of RBF, polynomial, and linear kernel functions respectively. The problem of multi-parameter optimization introduced in the support vector regression model was solved by the particle swarm optimization algorithm. Among the models, the model established by support vector regression with triple kernel function, in which the optimal R2 and RMSE of training set and test set were 0.9353, 0.9348 and 0.0157, 0.0288, and R2cv of training set and test set were 0.9090 and 0.8971, shows the strongest prediction ability and robustness. The method of support vector regression with triple kernel function is a great promotion in the field of quantitative structure-activity relationship, which will contribute a lot to designing and screening new drug molecules. The information contained in the model can provide important factors that guide drug design. Based on these factors, six new FTPDDs have been designed. Using molecular docking experiments to determine the properties of new derivatives, the new drug was ultimately successfully designed.</p

The Francis Crick Institute

DataSheet1_Study of PARP inhibitors for breast cancer based on enhanced multiple kernel function SVR with PSO.ZIP

Author: Haohan Xue (17892128)
Peijian Zhang (7798271)
Ruihan Wang (234022)
Ruixuan Zhang (10491569)
Xudong Yan (6338561)
Publication venue
Publication date: 02/02/2024
Field of study

The Francis Crick Institute

Enhancing Protein Solubility via Glycosylation: From Chemical Synthesis to Machine Learning Predictions

Author: Bo Ma (284545)
Boxue Tian (1334118)
Hedi Chen (18339834)
Jinyuan Gong (18339837)
Meng Li (79487)
Ruihan Wang (234022)
Shiying Shang (1551991)
Wenqiang Liu (1282782)
Xin Li (51274)
Xiuli Wei (4636498)
Yajing Zhang (1760800)
Yani Wang (4841685)
Yaohao Li (2581240)
Zhongping Tan (1867627)
Publication venue
Publication date: 10/04/2024
Field of study

Glycosylation is a valuable tool for modulating protein solubility; however, the lack of reliable research strategies has impeded efficient progress in understanding and applying this modification. This study aimed to bridge this gap by investigating the solubility of a model glycoprotein molecule, the carbohydrate-binding module (CBM), through a two-stage process. In the first stage, an approach involving chemical synthesis, comparative analysis, and molecular dynamics simulations of a library of glycoforms was employed to elucidate the effect of different glycosylation patterns on solubility and the key factors responsible for the effect. In the second stage, a predictive mathematical formula, innovatively harnessing machine learning algorithms, was derived to relate solubility to the identified key factors and accurately predict the solubility of the newly designed glycoforms. Demonstrating feasibility and effectiveness, this two-stage approach offers a valuable strategy for advancing glycosylation research, especially for the discovery of glycoforms with increased solubility

The Francis Crick Institute