Search CORE

12,020 research outputs found

Literature-aided interpretation of gene expression data with the weighted global test

Author: Dunnen J. den
Goeman J.J.
Hettne K.M.
Hoen P.A.C. 't
Jelier R.
Schuemie M.J.
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2011
Field of study

Most methods for the interpretation of gene expression profiling experiments rely on the categorization of genes, as provided by the Gene Ontology (GO) and pathway databases. Due to the manual curation process, such databases are never up-to-date and tend to be limited in focus and coverage. Automated literature mining tools provide an attractive, alternative approach. We review how they can be employed for the interpretation of gene expression profiling experiments. We illustrate that their comprehensive scope aids the interpretation of data from domains poorly covered by GO or alternative databases, and allows for the linking of gene expression with diseases, drugs, tissues and other types of concepts. A framework for proper statistical evaluation of the associations between gene expression values and literature concepts was lacking and is now implemented in a weighted extension of global test. The weights are the literature association scores and reflect the importance of a gene for the concept of interest. In a direct comparison with classical GO-based gene sets, we show that use of literature-based associations results in the identification of much more specific GO categories. We demonstrate the possibilities for linking of gene expression data to patient survival in breast cancer and the action and metabolism of drugs. Coupling with online literature mining tools ensures transparency and allows further study of the identified associations. Literature mining tools are therefore powerful additions to the toolbox for the interpretation of high-throughput genomics data.UB – Publicatie

EUR Research Repository

Leiden University Scholary Publications

Quantitative proteomics in resected renal cancer tissue for biomarker discovery and profiling

Author: Atrih A.
Barton G.
Bray S. E.
Fleming S.
Huang J. T.-J.
Lamont D. J.
Mudaliar M. A. V.
Nabi G.
Zakikhani P.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Background: Proteomics-based approaches for biomarker discovery are promising strategies used in cancer research. We present state-of-art label-free quantitative proteomics method to assess proteome of renal cell carcinoma (RCC) compared with noncancer renal tissues. Methods: Fresh frozen tissue samples from eight primary RCC lesions and autologous adjacent normal renal tissues were obtained from surgically resected tumour-bearing kidneys. Proteins were extracted by complete solubilisation of tissues using filter-aided sample preparation (FASP) method. Trypsin digested proteins were analysed using quantitative label-free proteomics approach followed by data interpretation and pathways analysis. Results: A total of 1761 proteins were identified and quantified with high confidence (MASCOT ion score threshold of 35 and P-value <0.05). Of these, 596 proteins were identified as differentially expressed between cancer and noncancer tissues. Two upregulated proteins in tumour samples (adipose differentiation-related protein and Coronin 1A) were further validated by immunohistochemistry. Pathway analysis using IPA, KOBAS 2.0, DAVID functional annotation and FLink tools showed enrichment of many cancer-related biological processes and pathways such as oxidative phosphorylation, glycolysis and amino acid synthetic pathways. Conclusions: Our study identified a number of differentially expressed proteins and pathways using label-free proteomics approach in RCC compared with normal tissue samples. Two proteins validated in this study are the focus of on-going research in a large cohort of patients.</p&gt

Crossref

PubMed Central

Enlighten

University of Dundee Online Publications

Infectious Disease Ontology

Technological developments have resulted in tremendous increases in the volume and diversity of the data and information that must be processed in the course of biomedical and clinical research and practice. Researchers are at the same time under ever greater pressure to share data and to take steps to ensure that data resources are interoperable. The use of ontologies to annotate data has proven successful in supporting these goals and in providing new possibilities for the automated processing of data and information. In this chapter, we describe different types of vocabulary resources and emphasize those features of formal ontologies that make them most useful for computational applications. We describe current uses of ontologies and discuss future goals for ontology-based computing, focusing on its use in the field of infectious diseases. We review the largest and most widely used vocabulary resources relevant to the study of infectious diseases and conclude with a description of the Infectious Disease Ontology (IDO) suite of interoperable ontology modules that together cover the entire infectious disease domain

PhilPapers

CiteSeerX

Crossref

Next-generation text-mining mediated generation of chemical response-specific gene sets for interpretation of gene expression data

Author: Boorsma J. (Jeffrey)
Dartel D.A.M. (Dorien A M) van
Goeman J.J. (Jelle)
Hettne K.M. (Kristina)
Jong E.C. (Esther) de
Kleinjans J. (Jos)
Kors J.A. (Jan)
Piersma A.H. (Aldert)
Stierum R.H. (Rob)
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 31/01/2013
Field of study

Background: Availability of chemical response-specific lists of genes (gene sets) for pharmacological and/or toxic effect prediction for compounds is limited. We hypothesize that more gene sets can be created by next-generation text mining (next-gen TM), and that these can be used with gene set analysis (GSA) methods for chemical treatment identification, for pharmacological mechanism elucidation, and for comparing compound toxicity profiles. Methods. We created 30,211 chemical response-specific gene sets for human and mouse by next-gen TM, and derived 1,189 (human) and 588 (mouse) gene sets from the Comparative Toxicogenomics Database (CTD). We tested for significant differential expression (SDE) (false discovery rate -corrected p-values < 0.05) of the next-gen TM-derived gene sets and the CTD-derived gene sets in gene expression (GE) data sets of five chemicals (from experimental models). We tested for SDE of gene sets for six fibrates in a peroxisome proliferator-activated receptor alpha (PPARA) knock-out GE dataset and compared to results from the Connectivity Map. We tested for SDE of 319 next-gen TM-derived gene sets for environmental toxicants in three GE data sets of triazoles, and tested for SDE of 442 gene sets associated with embryonic structures. We compared the gene sets to triazole effects seen in the Whole Embryo Culture (WEC), and used principal component analysis (PCA) to discriminate triazoles from other chemicals. Results: Next-gen TM-derived gene sets matching the chemical treatment were significantly altered in three GE data sets, and the corresponding CTD-derived gene sets were significantly altered in five GE data sets. Six next-gen TM-derived and four CTD-derived fibrate gene sets were significantly altered in the PPARA knock-out GE dataset. None of the fibrate signatures in cMap scored significant against the PPARA GE signature. 33 environmental toxicant gene sets were significantly altered in the triazole GE data sets. 21 of these toxicants had a similar toxicity pattern as the triazoles. We confirmed embryotoxic effects, and discriminated triazoles from other chemicals. Conclusions: Gene set analysis with next-gen TM-derived chemical response-specific gene sets is a scalable method for identifying similarities in gene responses to other chemicals, from which one may infer potential mode of action and/or toxic effect

Erasmus University Digital Repository

Texture Analysis Platform for Imaging Biomarker Research

Author
Publication venue
Publication date: 01/01/2017
Field of study

abstract: The rate of progress in improving survival of patients with solid tumors is slow due to late stage diagnosis and poor tumor characterization processes that fail to effectively reflect the nature of tumor before treatment or the subsequent change in its dynamics because of treatment. Further advancement of targeted therapies relies on advancements in biomarker research. In the context of solid tumors, bio-specimen samples such as biopsies serve as the main source of biomarkers used in the treatment and monitoring of cancer, even though biopsy samples are susceptible to sampling error and more importantly, are local and offer a narrow temporal scope. Because of its established role in cancer care and its non-invasive nature imaging offers the potential to complement the findings of cancer biology. Over the past decade, a compelling body of literature has emerged suggesting a more pivotal role for imaging in the diagnosis, prognosis, and monitoring of diseases. These advances have facilitated the rise of an emerging practice known as Radiomics: the extraction and analysis of large numbers of quantitative features from medical images to improve disease characterization and prediction of outcome. It has been suggested that radiomics can contribute to biomarker discovery by detecting imaging traits that are complementary or interchangeable with other markers. This thesis seeks further advancement of imaging biomarker discovery. This research unfolds over two aims: I) developing a comprehensive methodological pipeline for converting diagnostic imaging data into mineable sources of information, and II) investigating the utility of imaging data in clinical diagnostic applications. Four validation studies were conducted using the radiomics pipeline developed in aim I. These studies had the following goals: (1 distinguishing between benign and malignant head and neck lesions (2) differentiating benign and malignant breast cancers, (3) predicting the status of Human Papillomavirus in head and neck cancers, and (4) predicting neuropsychological performances as they relate to Alzheimer’s disease progression. The long-term objective of this thesis is to improve patient outcome and survival by facilitating incorporation of routine care imaging data into decision making processes.Dissertation/ThesisDoctoral Dissertation Biomedical Informatics 201

ASU Digital Repository

Multi-lectin Affinity Chromatography and Quantitative Proteomic Analysis Reveal Differential Glycoform Levels between Prostate Cancer and Benign Prostatic Hyperplasia Sera.

Author: Adusumilli Ravali
Brooks James D
Kullolli Majlinda
Mallick Parag
Pitteri Sharon J
Tanimoto Cheylene
Totten Sarah M
Publication venue: eScholarship, University of California
Publication date: 01/04/2018
Field of study

Currently prostate-specific antigen is used for prostate cancer (PCa) screening, however it lacks the necessary specificity for differentiating PCa from other diseases of the prostate such as benign prostatic hyperplasia (BPH), presenting a clinical need to distinguish these cases at the molecular level. Protein glycosylation plays an important role in a number of cellular processes involved in neoplastic progression and is aberrant in PCa. In this study, we systematically interrogate the alterations in the circulating levels of hundreds of serum proteins and their glycoforms in PCa and BPH samples using multi-lectin affinity chromatography and quantitative mass spectrometry-based proteomics. Specific lectins (AAL, PHA-L and PHA-E) were used to target and chromatographically separate core-fucosylated and highly-branched protein glycoforms for analysis, as differential expression of these glycan types have been previously associated with PCa. Global levels of CD5L, CFP, C8A, BST1, and C7 were significantly increased in the PCa samples. Notable glycoform-specific alterations between BPH and PCa were identified among proteins CD163, C4A, and ATRN in the PHA-L/E fraction and among C4BPB and AZGP1 glycoforms in the AAL fraction. Despite these modest differences, substantial similarities in glycoproteomic profiles were observed between PCa and BPH sera

Crossref

Directory of Open Access Journals

eScholarship - University of California

Deep Learning Models For Biomedical Data Analysis

Author: Nwosu Lucy
Publication venue: Digital Commons @PVAMU
Publication date: 01/08/2023
Field of study

The field of biomedical data analysis is a vibrant area of research dedicated to extracting valuable insights from a wide range of biomedical data sources, including biomedical images and genomics data. The emergence of deep learning, an artificial intelligence approach, presents significant prospects for enhancing biomedical data analysis and knowledge discovery. This dissertation focused on exploring innovative deep-learning methods for biomedical image processing and gene data analysis. During the COVID-19 pandemic, biomedical imaging data, including CT scans and chest x-rays, played a pivotal role in identifying COVID-19 cases by categorizing patient chest x-ray outcomes as COVID-19-positive or negative. While supervised deep learning methods have effectively recognized COVID-19 patterns in chest x-ray datasets, the availability of annotated training data remains limited. To address this challenge, the thesis introduced a semi-supervised deep learning model named ssResNet, built upon the Residual Neural Network (ResNet) architecture. The model combines supervised and unsupervised paths, incorporating a weighted supervised loss function to manage data imbalance. The strategies to diminish prediction uncertainty in deep learning models for critical applications like medical image processing is explore. It achieves this through an ensemble deep learning model, integrating bagging deep learning and model calibration techniques. This ensemble model not only boosts biomedical image segmentation accuracy but also reduces prediction uncertainty, as validated on a comprehensive chest x-ray image segmentation dataset. Furthermore, the thesis introduced an ensemble model integrating Proformer and ensemble learning methodologies. This model constructs multiple independent Proformers for predicting gene expression, their predictions are combined through weighted averaging to generate final predictions. Experimental outcomes underscore the efficacy of this ensemble model in enhancing prediction performance across various metrics. In conclusion, this dissertation advances biomedical data analysis by harnessing the potential of deep learning techniques. It devises innovative approaches for processing biomedical images and gene data. By leveraging deep learning\u27s capabilities, this work paves the way for further progress in biomedical data analytics and its applications within clinical contexts. Index Terms- biomedical data analysis, COVID-19, deep learning, ensemble learning, gene data analytics, medical image segmentation, prediction uncertainty, Proformer, Residual Neural Network (ResNet), semi-supervised learning

Digital Commons @ PVAMU (Prairie View A&M Univ)

MR Imaging Radiomics Signatures for Predicting the Risk of Breast Cancer Recurrence as Given by Research Versions of MammaPrint, Oncotype DX, and PAM50 Gene Assays

Author: Burnside Elizabeth S.
Conzen Suzanne D.
Drukker Karen
Fan Cheng
Ganott Marie
Giger Maryellen L.
Hoadley Katherine A.
Huang Erich
Ji Yuan
Li Hui
Morris Elizabeth A.
Net Jose M.
Perou Charles M.
Sutton Elizabeth J.
Whitman Gary J.
Zhu Yitan
Publication venue
Publication date: 01/01/2016
Field of study

To investigate relationships between computer-extracted breast magnetic resonance (MR) imaging phenotypes with multigene assays of MammaPrint, Oncotype DX, and PAM50 to assess the role of radiomics in evaluating the risk of breast cancer recurrence

PubMed Central

Carolina Digital Repository

Multiscale, multimodal analysis of tumor heterogeneity in IDH1 mutant vs wild-type diffuse gliomas.

Author: Adkins Jonathan
Al-Kofahi Yousef
Barnholtz-Sloan Jill S
Berens Michael E
Byron Sara A
Cho Sanghee
Couce Marta
Cuyugan Lori
Devine Karen
Dinn Sean
Ginty Fiona
Graf John F
Halperin Rebecca F
Kiefer Jeffrey
Kim Seungchan
Liang Winnie S
McDonough Elizabeth
Nasser Sara
Nelson Sarah J
Ostrom Quinn
Phillips Joanna J
Prados Michael
Rusu Mirabela
Schyberg Shannon
Sloan Andrew E
Sood Anup
Wolansky Leo
Zavodszky Maria I
Publication venue: eScholarship, University of California
Publication date: 01/01/2019
Field of study

Glioma is recognized to be a highly heterogeneous CNS malignancy, whose diverse cellular composition and cellular interactions have not been well characterized. To gain new clinical- and biological-insights into the genetically-bifurcated IDH1 mutant (mt) vs wildtype (wt) forms of glioma, we integrated data from protein, genomic and MR imaging from 20 treatment-naïve glioma cases and 16 recurrent GBM cases. Multiplexed immunofluorescence (MxIF) was used to generate single cell data for 43 protein markers representing all cancer hallmarks, Genomic sequencing (exome and RNA (normal and tumor) and magnetic resonance imaging (MRI) quantitative features (protocols were T1-post, FLAIR and ADC) from whole tumor, peritumoral edema and enhancing core vs equivalent normal region were also collected from patients. Based on MxIF analysis, 85,767 cells (glioma cases) and 56,304 cells (GBM cases) were used to generate cell-level data for 24 biomarkers. K-means clustering was used to generate 7 distinct groups of cells with divergent biomarker profiles and deconvolution was used to assign RNA data into three classes. Spatial and molecular heterogeneity metrics were generated for the cell data. All features were compared between IDH mt and IDHwt patients and were finally combined to provide a holistic/integrated comparison. Protein expression by hallmark was generally lower in the IDHmt vs wt patients. Molecular and spatial heterogeneity scores for angiogenesis and cell invasion also differed between IDHmt and wt gliomas irrespective of prior treatment and tumor grade; these differences also persisted in the MR imaging features of peritumoral edema and contrast enhancement volumes. A coherent picture of enhanced angiogenesis in IDHwt tumors was derived from multiple platforms (genomic, proteomic and imaging) and scales from individual proteins to cell clusters and heterogeneity, as well as bulk tumor RNA and imaging features. Longer overall survival for IDH1mt glioma patients may reflect mutation-driven alterations in cellular, molecular, and spatial heterogeneity which manifest in discernable radiological manifestations

Directory of Open Access Journals

eScholarship - University of California