Search CORE

242 research outputs found

Genomic Biomarkers for Personalized Medicine in Breast Cancer

Author: Huixiao Hong
Minjun Chen
Weida Tong
Publication venue
Publication date: 20/11/2012
Field of study

Breast cancer is the most common malignant disease in Western women. Historically, breast cancer was perceived as a single disease with various clinicalpathological features, and therefore, “one drug fits all” approaches drove the treatment regimens. The advent of genomics studies has led to a new paradigm in which breast cancer is heterogeneous consisting of different diseases from the same organ site. For example, gene expression profiling analysis revealed that estrogen receptor (ER)-positive and ER negative breast cancer are two distinct diseases with different risk factors, clinical presentations, outcomes, and responses to systemic therapies. Consequently, the new paradigm demands a personalized strategy in cancer medicine, in which the selection of treatment regimens for each cancer patient will largely rely on assessment by predictive biomarkers and study of the anatomical and pathological features of the cancer

ZENODO

Editorial: Deep Learning for Toxicity and Disease Prediction

Author: Chen Minjun
Gong Ping
Zhang Chaoyang
Publication venue: The Aquila Digital Community
Publication date: 26/02/2020
Field of study

Aquila Digital Community (University of Southern Mississippi, USM)

An Efficient Source Model Selection Framework in Model Databases

Author: Chen Lu
Du Yuntao
Gao Yunjun
Yang Keyu
Zhao Minjun
Publication venue
Publication date: 24/11/2021
Field of study

With the explosive increase of big data, training a Machine Learning (ML) model becomes a computation-intensive workload, which would take days or even weeks. Thus, reusing an already trained model has received attention, which is called transfer learning. Transfer learning avoids training a new model from scratch by transferring knowledge from a source task to a target task. Existing transfer learning methods mostly focus on how to improve the performance of the target task through a specific source model, and assume that the source model is given. Although many source models are available, it is difficult for data scientists to select the best source model for the target task manually. Hence, how to efficiently select a suitable source model in a model database for model reuse is an interesting but unsolved problem. In this paper, we propose SMS, an effective, efficient, and flexible source model selection framework. SMS is effective even when the source and target datasets have significantly different data labels, and is flexible to support source models with any type of structure, and is efficient to avoid any training process. For each source model, SMS first vectorizes the samples in the target dataset into soft labels by directly applying this model to the target dataset, then uses Gaussian distributions to fit for clusters of soft labels, and finally measures the distinguishing ability of the source model using Gaussian mixture-based metric. Moreover, we present an improved SMS (I-SMS), which decreases the output number of the source model. I-SMS can significantly reduce the selection time while retaining the selection performance of SMS. Extensive experiments on a range of practical model reuse workloads demonstrate the effectiveness and efficiency of SMS

arXiv.org e-Print Archive

Computational models for predicting liver toxicity in the deep learning era

Author: Fahad Mostafa
Fahad Mostafa
Minjun Chen
Publication venue: Frontiers Media S.A.
Publication date: 01/01/2024
Field of study

Drug-induced liver injury (DILI) is a severe adverse reaction caused by drugs and may result in acute liver failure and even death. Many efforts have centered on mitigating risks associated with potential DILI in humans. Among these, quantitative structure-activity relationship (QSAR) was proven to be a valuable tool for early-stage hepatotoxicity screening. Its advantages include no requirement for physical substances and rapid delivery of results. Deep learning (DL) made rapid advancements recently and has been used for developing QSAR models. This review discusses the use of DL in predicting DILI, focusing on the development of QSAR models employing extensive chemical structure datasets alongside their corresponding DILI outcomes. We undertake a comprehensive evaluation of various DL methods, comparing with those of traditional machine learning (ML) approaches, and explore the strengths and limitations of DL techniques regarding their interpretability, scalability, and generalization. Overall, our review underscores the potential of DL methodologies to enhance DILI prediction and provides insights into future avenues for developing predictive models to mitigate DILI risk in humans

Directory of Open Access Journals

SparDL: Distributed Deep Learning Training with Efficient Sparse Communication

Author: Chen Lu
Gao Yunjun
Liu Qing
Mao Yuren
Yin Yichen
Zhao Minjun
Publication venue
Publication date: 23/02/2024
Field of study

Top-k sparsification has recently been widely used to reduce the communication volume in distributed deep learning. However, due to the Sparse Gradient Accumulation (SGA) dilemma, the performance of top-k sparsification still has limitations. Recently, a few methods have been put forward to handle the SGA dilemma. Regrettably, even the state-of-the-art method suffers from several drawbacks, e.g., it relies on an inefficient communication algorithm and requires extra transmission steps. Motivated by the limitations of existing methods, we propose a novel efficient sparse communication framework, called SparDL. Specifically, SparDL uses the Spar-Reduce-Scatter algorithm, which is based on an efficient Reduce-Scatter model, to handle the SGA dilemma without additional communication operations. Besides, to further reduce the latency cost and improve the efficiency of SparDL, we propose the Spar-All-Gather algorithm. Moreover, we propose the global residual collection algorithm to ensure fast convergence of model training. Finally, extensive experiments are conducted to validate the superiority of SparDL

arXiv.org e-Print Archive

Selecting a single model or combining multiple models for microarray-based classifier development? – A comparative analysis based on large and diverse datasets generated from the MAQC-II project

Author: Chen Minjun
Fang Hong
Kelly Reagan
Perkins Roger
Shi Leming
Tong Weida
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Genomic biomarkers play an increasing role in both preclinical and clinical application. Development of genomic biomarkers with microarrays is an area of intensive investigation. However, despite sustained and continuing effort, developing microarray-based predictive models (i.e., genomics biomarkers) capable of reliable prediction for an observed or measured outcome (i.e., endpoint) of unknown samples in preclinical and clinical practice remains a considerable challenge. No straightforward guidelines exist for selecting a single model that will perform best when presented with unknown samples. In the second phase of the MicroArray Quality Control (MAQC-II) project, 36 analysis teams produced a large number of models for 13 preclinical and clinical endpoints. Before external validation was performed, each team nominated one model per endpoint (referred to here as 'nominated models') from which MAQC-II experts selected 13 'candidate models' to represent the best model for each endpoint. Both the nominated and candidate models from MAQC-II provide benchmarks to assess other methodologies for developing microarray-based predictive models. Methods We developed a simple ensemble method by taking a number of the top performing models from cross-validation and developing an ensemble model for each of the MAQC-II endpoints. We compared the ensemble models with both nominated and candidate models from MAQC-II using blinded external validation. Results For 10 of the 13 MAQC-II endpoints originally analyzed by the MAQC-II data analysis team from the National Center for Toxicological Research (NCTR), the ensemble models achieved equal or better predictive performance than the NCTR nominated models. Additionally, the ensemble models had performance comparable to the MAQC-II candidate models. Most ensemble models also had better performance than the nominated models generated by five other MAQC-II data analysis teams that analyzed all 13 endpoints. Conclusions Our findings suggest that an ensemble method can often attain a higher average predictive performance in an external validation set than a corresponding “optimized” model method. Using an ensemble method to determine a final model is a potentially important supplement to the good modeling practices recommended by the MAQC-II project for developing microarray-based genomic biomarkers.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Selective Adsorption of Ionic Species Using Macroporous Monodispersed Polyethylene Glycol Diacrylate/Acrylic Acid Microgels with Tunable Negative Charge

Author: Bolognesi Guido
Chen Minjun
Kumrić Ksenija
Prodanović Radivoje
Thacker Conner
Vladisavljević Goran T.
Publication venue
Publication date: 01/01/2023
Field of study

In this work, the possibility of fabricating composite magneto-optical ceramics by electrophoretic deposition (EPD) of nanopowders and high-temperature vacuum sintering of the compacts was investigated. Holmium oxide was chosen as a magneto-optical material for the study because of its transparency in the mid-IR range. Nanopowders of magneto-optical (Ho0.95La0.05)2O3 (HoLa) material were made by self-propagating high-temperature synthesis. Nanopowders of (Y0.9La0.1)2O3 (YLa) were made by laser synthesis for an inactive matrix. The process of formation of one- and two-layer compacts by EPD of the nanopowders from alcohol suspensions was studied in detail. Acetylacetone was shown to be a good dispersant to obtain alcohol suspensions of the nanopowders, characterized by high zeta potential values (+29–+80 mV), and to carry out a stable EPD process. One-layer compacts were made from the HoLa and YLa nanopowders with a density of 30–43%. It was found out that the introduction of polyvinyl butyral (PVB) into the suspension leads to a decrease in the mass and thickness of the green bodies deposited, but does not significantly affect their density. The possibility of making two-layer (YLa/HoLa) compacts with a thickness of up to 2.6 mm and a density of up to 46% was demonstrated. Sintering such compacts in a vacuum at a temperature of 1750 °C for 10 h leads to the formation of ceramics with a homogeneous boundary between the YLa/HoLa layers and a thickness of the interdiffused ion layer of about 30 μm

Faculty of Chemistry Repository - Cherry

Repository of the Vinča Nuclear Institute (VinaR)

Cuproptosis-related gene FDX1 as a prognostic biomarker for kidney renal clear cell carcinoma correlates with immune checkpoints and immune cell infiltration

Author: Haixin Chen
Minjun Lou
Tingting Chen
Yimin Yao
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2023
Field of study

Background: Kidney renal clear cell carcinoma (KIRC) is not sensitive to radiotherapy and chemotherapy, and only some KIRC patients can benefit from immunotherapy and targeted therapy. Cuproptosis is a new mechanism of cell death, which is closely related to tumor progression, prognosis and immunity. The identification of prognostic markers related to cuproptosis in KIRC may provide targets for treatment and improve the prognosis of KIRC patients.Methods: Ten cuproptosis-related genes were analyzed for differential expression in KIRC-TCGA and a prognostic model was constructed. Nomogram diagnostic model was used to screen independent prognostic molecules. The screened molecules were verified in multiple datasets (GSE36895 and GSE53757), and in KIRC tumor tissues by RT-PCR and immunohistochemistry (IHC). Clinical correlation of cuproptosis-related independent prognostic molecules was analyzed. According to the molecular expression, the two groups were divided into high and low expression groups, and the differences of immune checkpoint and tumor infiltrating lymphocytes (TILs) between the two groups were compared by EPIC algorithm. The potential Immune checkpoint blocking (ICB) response of high and low expression groups was predicted by the “TIDE” algorithm.Results: FDX1 and DLAT were protective factors, while CDKN2A was a risk factor. FDX1 was an independent prognostic molecule by Nomogram, and low expressed in tumor tissues compared with adjacent tissues (p < 0.05). FDX1 was positively correlated with CD274, HAVCR2, PDCD1LG2, and negatively correlated with CTLA4, LAG3, and PDCD1. The TIDE score of low-FDX1 group was higher than that of high-FDX1 group. The abundance of CD4+ T cells, CD8+ T cells and Endothelial cells in FDX1-low group was lower than that in FDX1-high group (p < 0.05).Conclusion: FDX1, as a key cuproptosis-related gene, was also an independent prognostic molecule of KIRC. FDX1 might become an interesting biomarker and potential therapeutic target for KIRC

Directory of Open Access Journals

PubMed Central