Search CORE

849 research outputs found

Gene Co-expression Network and Copy Number Variation Analyses Identify Transcription Factors Associated With Multiple Myeloma Progression

Author: Abu Zaid Mohammad
Han Zhi
Huang Kun
Huang Zhi
Johnson Travis S.
Xiang Shunian
Yu Christina Y.
Zhan Xiaohui
Publication venue: 'Frontiers Media SA'
Publication date: 17/05/2019
Field of study

Multiple myeloma (MM) has two clinical precursor stages of disease: monoclonal gammopathy of undetermined significance (MGUS) and smoldering multiple myeloma (SMM). However, the mechanism of progression is not well understood. Because gene co-expression network analysis is a well-known method for discovering new gene functions and regulatory relationships, we utilized this framework to conduct differential co-expression analysis to identify interesting transcription factors (TFs) in two publicly available datasets. We then used copy number variation (CNV) data from a third public dataset to validate these TFs. First, we identified co-expressed gene modules in two publicly available datasets each containing three conditions: normal, MGUS, and SMM. These modules were assessed for condition-specific gene expression, and then enrichment analysis was conducted on condition-specific modules to identify their biological function and upstream TFs. TFs were assessed for differential gene expression between normal and MM precursors, then validated with CNV analysis to identify candidate genes. Functional enrichment analysis reaffirmed known functional categories in MM pathology, the main one relating to immune function. Enrichment analysis revealed a handful of differentially expressed TFs between normal and either MGUS or SMM in gene expression and/or CNV. Overall, we identified four genes of interest (MAX, TCF4, ZNF148, and ZNF281) that aid in our understanding of MM initiation and progression

IUPUIScholarWorks

FigShare

Recommended from our members

Cirmtuzumab inhibits Wnt5a-induced Rac1 activation in chronic lymphocytic leukemia treated with ibrutinib.

Author: Chen L
Chen Y
Choi MY
Cui B
Kipps TJ
Rassenti LZ
Widhopf Ii GF
Wu Christina
Yu J
Zhang L
Publication venue: eScholarship, University of California
Publication date: 01/06/2017
Field of study

Signaling via the B cell receptor (BCR) plays an important role in the pathogenesis and progression of chronic lymphocytic leukemia (CLL). This is underscored by the clinical effectiveness of ibrutinib, an inhibitor of Bruton's tyrosine kinase (BTK) that can block BCR-signaling. However, ibrutinib cannot induce complete responses (CR) or durable remissions without continued therapy, suggesting alternative pathways also contribute to CLL growth/survival that are independent of BCR-signaling. ROR1 is a receptor for Wnt5a, which can promote activation of Rac1 to enhance CLL-cell proliferation and survival. In this study, we found that CLL cells of patients treated with ibrutinib had activated Rac1. Moreover, Wnt5a could induce Rac1 activation and enhance proliferation of CLL cells treated with ibrutinib at concentrations that were effective in completely inhibiting BTK and BCR-signaling. Wnt5a-induced Rac1 activation could be blocked by cirmtuzumab (UC-961), an anti-ROR1 mAb. We found that treatment with cirmtuzumab and ibrutinib was significantly more effective than treatment with either agent alone in clearing leukemia cells in vivo. This study indicates that cirmtuzumab may enhance the activity of ibrutinib in the treatment of patients with CLL or other ROR1+ B-cell malignancies

eScholarship - University of California

TPQCI: A topology potential-based method to quantify functional influence of copy number variations

Author: Huang Kun
Liu Yusong
Ye Xiufen
Yu Christina Y.
Zhan Xiaohui
Zhang Jie
Publication venue: 'Elsevier BV'
Publication date: 01/08/2021
Field of study

Copy number variation (CNV) is a major type of chromosomal structural variation that play important roles in many diseases including cancers. Due to genome instability, a large number of CNV events can be detected in diseases such as cancer. Therefore, it is important to identify the functionally important CNVs in diseases, which currently still poses a challenge in genomics. One of the critical steps to solve the problem is to define the influence of CNV. In this paper, we provide a topology potential based method, TPQCI, to quantify this kind of influence by integrating statistics, gene regulatory associations, and biological function information. We used this metric to detect functionally enriched genes on genomic segments with CNV in breast cancer and multiple myeloma and discovered biological functions influenced by CNV. Our results demonstrate that, by using our proposed TPQCI metric, we can detect disease-specific genes that are influenced by CNVs. Source codes of TPQCI are provided in Github (https://github.com/usos/TPQCI)

IUPUIScholarWorks

Improving accuracy of GPT-3/4 results on biomedical data using a retrieval-augmented language model

Author: Guan Meijian
Hamadeh Hisham
Higgs Brandon W
Karagoz Kubra
Si Han
Soong David
Sridhar Sriram
Sá Ana Caroline Costa
Wagner Jan-Samuel
Yu Christina Y
Publication venue
Publication date: 30/05/2023
Field of study

Large language models (LLMs) have made significant advancements in natural language processing (NLP). Broad corpora capture diverse patterns but can introduce irrelevance, while focused corpora enhance reliability by reducing misleading information. Training LLMs on focused corpora poses computational challenges. An alternative approach is to use a retrieval-augmentation (RetA) method tested in a specific domain. To evaluate LLM performance, OpenAI's GPT-3, GPT-4, Bing's Prometheus, and a custom RetA model were compared using 19 questions on diffuse large B-cell lymphoma (DLBCL) disease. Eight independent reviewers assessed responses based on accuracy, relevance, and readability (rated 1-3). The RetA model performed best in accuracy (12/19 3-point scores, total=47) and relevance (13/19, 50), followed by GPT-4 (8/19, 43; 11/19, 49). GPT-4 received the highest readability scores (17/19, 55), followed by GPT-3 (15/19, 53) and the RetA model (11/19, 47). Prometheus underperformed in accuracy (34), relevance (32), and readability (38). Both GPT-3.5 and GPT-4 had more hallucinations in all 19 responses compared to the RetA model and Prometheus. Hallucinations were mostly associated with non-existent references or fabricated efficacy data. These findings suggest that RetA models, supplemented with domain-specific corpora, may outperform general-purpose LLMs in accuracy and relevance within specific domains. However, this evaluation was limited to specific questions and metrics and may not capture challenges in semantic search and other NLP tasks. Further research will explore different LLM architectures, RetA methodologies, and evaluation methods to assess strengths and limitations more comprehensively

arXiv.org e-Print Archive

Prenatal Vitamin D Supplementation and Child Respiratory Health: A Randomised Controlled Trial

Author: A Merewood
AA Litonjua
Adrian R. Martineau
AE Millen
B Novakovic
BW Hollis
CA Camargo Jr
CA Camargo Jr
Chris J. Griffiths
Christina Yu
CK Yu
CR Gale
D Bikle
DD Johnson
DR Murdoch
E Cremers
E Hypponen
F Groenman
FD Martinez
G Devereux
GR Zosky
Heinz Fehrenbach
Jane C. Kirkby
Janet Stocks
John O. Warner
M Erkkola
MF Holick
MJ Ege
N Beydon
Richard Hooper
Robert J. Boyle
Seif O. Shaheen
SH Pearce
Sheree Poulton
Stephen Robinson
Stephen T. Goldring
SW Turner
X Liu
Y Miyake
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 07/05/2013
Field of study

PMCID: PMC3691177This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

UCL Discovery

Spiral - Imperial College Digital Repository

Queen Mary Research Online

FigShare

Gene Co-expression Network and Copy Number Variation Analyses Identify Transcription Factors Associated With Multiple Myeloma Progression

Author: Christina Y. Yu
Christina Y. Yu
Kun Huang
Kun Huang
Mohammad Abu Zaid
Shunian Xiang
Shunian Xiang
Travis S. Johnson
Travis S. Johnson
Xiaohui Zhan
Xiaohui Zhan
Zhi Han
Zhi Han
Zhi Huang
Zhi Huang
Publication venue: 'Frontiers Media SA'
Publication date: 01/05/2019
Field of study

Directory of Open Access Journals

Deep learning-based cancer survival prognosis from RNA-seq data: approaches and evaluations

Author: Cao Sha
Cheng Jun
Han Zhi
Helm Bryan
Huang Kun
Huang Zhi
Johnson Travis S.
Rizkalla Maher
Salama Paul
Xiang Shunian
Yu Christina Y.
Zhan Xiaohui
Zhang Chi
Zhang Jie
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

Background: Recent advances in kernel-based Deep Learning models have introduced a new era in medical research. Originally designed for pattern recognition and image processing, Deep Learning models are now applied to survival prognosis of cancer patients. Specifically, Deep Learning versions of the Cox proportional hazards models are trained with transcriptomic data to predict survival outcomes in cancer patients. Methods: In this study, a broad analysis was performed on TCGA cancers using a variety of Deep Learning-based models, including Cox-nnet, DeepSurv, and a method proposed by our group named AECOX (AutoEncoder with Cox regression network). Concordance index and p-value of the log-rank test are used to evaluate the model performances. Results: All models show competitive results across 12 cancer types. The last hidden layers of the Deep Learning approaches are lower dimensional representations of the input data that can be used for feature reduction and visualization. Furthermore, the prognosis performances reveal a negative correlation between model accuracy, overall survival time statistics, and tumor mutation burden (TMB), suggesting an association among overall survival time, TMB, and prognosis prediction accuracy. Conclusions: Deep Learning based algorithms demonstrate superior performances than traditional machine learning based models. The cancer prognosis results measured in concordance index are indistinguishable across models while are highly variable across cancers. These findings shedding some light into the relationships between patient characteristics and survival learnability on a pan-cancer level

IUPUIScholarWorks

A pan-kidney cancer study identifies subtype specific perturbations on pathways with potential drivers in renal cell carcinoma

Author: Huang Kun
Liu Yusong
Ni Dong
Wang Tian‑Fu
Yu Christina Y.
Zhan Xiaohui
Zhang Jie
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 28/12/2020
Field of study

Background: Renal cell carcinoma (RCC) is a complex disease and is comprised of several histological subtypes, the most frequent of which are clear cell renal cell carcinoma (ccRCC), papillary renal cell carcinoma (PRCC) and chromophobe renal cell carcinoma (ChRCC). While lots of studies have been performed to investigate the molecular characterizations of different subtypes of RCC, our knowledge regarding the underlying mechanisms are still incomplete. As molecular alterations are eventually reflected on the pathway level to execute certain biological functions, characterizing the pathway perturbations is crucial for understanding tumorigenesis and development of RCC. Methods: In this study, we investigated the pathway perturbations of various RCC subtype against normal tissue based on differential expressed genes within a certain pathway. We explored the potential upstream regulators of subtype-specific pathways with Ingenuity Pathway Analysis (IPA). We also evaluated the relationships between subtype-specific pathways and clinical outcome with survival analysis. Results: In this study, we carried out a pathway-based analysis to explore the mechanisms of various RCC subtypes with TCGA RNA-seq data. Both commonly altered pathways and subtype-specific pathways were detected. To identify the distinctive characteristics of each subtype, we focused on subtype-specific perturbed pathways. Specifically, we observed that some of the altered pathways were regulated by several recurrent upstream regulators which presenting different expression patterns among distinct RCC subtypes. We also noticed that a large number of perturbed pathways were controlled by the subtype-specific upstream regulators. Moreover, we also evaluated the relationships between perturbed pathways and clinical outcome. Prognostic pathways were identified and their roles in tumor development and progression were inferred. Conclusions: In summary, we evaluated the relationships among pathway perturbations, upstream regulators and clinical outcome for differential subtypes in RCC. We hypothesized that the alterations of common upstream regulators as well as subtype-specific upstream regulators work together to affect the downstream pathway perturbations and drive cancer initialization and prognosis. Our findings not only increase our understanding of the mechanisms of various RCC subtypes, but also provide targets for personalized therapeutic intervention

IUPUIScholarWorks

TPSC: a module detection method based on topology potential and spectral clustering in weighted networks and its application in gene co-expression module discovery

Author: Feng Weixing
Hou Jie
Huang Kun
Liu Yusong
Shao Wei
Ye Xiufen
Yu Christina Y.
Zhang Jie
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/10/2021
Field of study

Background: Gene co-expression networks are widely studied in the biomedical field, with algorithms such as WGCNA and lmQCM having been developed to detect co-expressed modules. However, these algorithms have limitations such as insufficient granularity and unbalanced module size, which prevent full acquisition of knowledge from data mining. In addition, it is difficult to incorporate prior knowledge in current co-expression module detection algorithms. Results: In this paper, we propose a novel module detection algorithm based on topology potential and spectral clustering algorithm to detect co-expressed modules in gene co-expression networks. By testing on TCGA data, our novel method can provide more complete coverage of genes, more balanced module size and finer granularity than current methods in detecting modules with significant overall survival difference. In addition, the proposed algorithm can identify modules by incorporating prior knowledge. Conclusion: In summary, we developed a method to obtain as much as possible information from networks with increased input coverage and the ability to detect more size-balanced and granular modules. In addition, our method can integrate data from different sources. Our proposed method performs better than current methods with complete coverage of input genes and finer granularity. Moreover, this method is designed not only for gene co-expression networks but can also be applied to any general fully connected weighted network

IUPUIScholarWorks

Directory of Open Access Journals

SALMON: Survival Analysis Learning With Multi-Omics Neural Networks on Breast Cancer

Author: Bryan Helm
Christina Y. Yu
Christina Y. Yu
Jie Zhang
Kun Huang
Kun Huang
Kun Huang
Maher Rizkalla
Paul Salama
Shunian Xiang
Shunian Xiang
Travis S. Johnson
Travis S. Johnson
Xiaohui Zhan
Xiaohui Zhan
Zhi Han
Zhi Han
Zhi Huang
Zhi Huang
Zhi Huang
Publication venue: 'Frontiers Media SA'
Publication date: 01/03/2019
Field of study

Improved cancer prognosis is a central goal for precision health medicine. Though many models can predict differential survival from data, there is a strong need for sophisticated algorithms that can aggregate and filter relevant predictors from increasingly complex data inputs. In turn, these models should provide deeper insight into which types of data are most relevant to improve prognosis. Deep Learning-based neural networks offer a potential solution for both problems because they are highly flexible and account for data complexity in a non-linear fashion. In this study, we implement Deep Learning-based networks to determine how gene expression data predicts Cox regression survival in breast cancer. We accomplish this through an algorithm called SALMON (Survival Analysis Learning with Multi-Omics Neural Networks), which aggregates and simplifies gene expression data and cancer biomarkers to enable prognosis prediction. The results revealed improved performance when more omics data were used in model construction. Rather than use raw gene expression values as model inputs, we innovatively use eigengene modules from the result of gene co-expression network analysis. The corresponding high impact co-expression modules and other omics data are identified by feature selection technique, then examined by conducting enrichment analysis and exploiting biological functions, escalated the interpretation of input feature from gene level to co-expression modules level. Our study shows the feasibility of discovering breast cancer related co-expression modules, sketch a blueprint of future endeavors on Deep Learning-based survival analysis. SALMON source code is available at https://github.com/huangzhii/SALMON/

Directory of Open Access Journals