Search CORE

35,961 research outputs found

Network-based stratification of tumor mutations.

Author: Carter Hannah
Gross Andrew
Hofree Matan
Ideker Trey
Shen John P
Publication venue: eScholarship, University of California
Publication date: 01/01/2013
Field of study

Many forms of cancer have multiple subtypes with different causes and clinical outcomes. Somatic tumor genome sequences provide a rich new source of data for uncovering these subtypes but have proven difficult to compare, as two tumors rarely share the same mutations. Here we introduce network-based stratification (NBS), a method to integrate somatic tumor genomes with gene networks. This approach allows for stratification of cancer into informative subtypes by clustering together patients with mutations in similar network regions. We demonstrate NBS in ovarian, uterine and lung cancer cohorts from The Cancer Genome Atlas. For each tissue, NBS identifies subtypes that are predictive of clinical outcomes such as patient survival, response to therapy or tumor histology. We identify network regions characteristic of each subtype and show how mutation-derived subtypes can be used to train an mRNA expression signature, which provides similar information in the absence of DNA sequence

CiteSeerX

PubMed Central

eScholarship - University of California

Maximal information component analysis: a novel non-linear network analysis method.

Author: Bennett Brian
Lusis Aldons J
Orozco Luz D
Rau Christoph D
Weiss James
Wisniewski Nicholas
Publication venue: eScholarship, University of California
Publication date: 01/01/2013
Field of study

BackgroundNetwork construction and analysis algorithms provide scientists with the ability to sift through high-throughput biological outputs, such as transcription microarrays, for small groups of genes (modules) that are relevant for further research. Most of these algorithms ignore the important role of non-linear interactions in the data, and the ability for genes to operate in multiple functional groups at once, despite clear evidence for both of these phenomena in observed biological systems.ResultsWe have created a novel co-expression network analysis algorithm that incorporates both of these principles by combining the information-theoretic association measure of the maximal information coefficient (MIC) with an Interaction Component Model. We evaluate the performance of this approach on two datasets collected from a large panel of mice, one from macrophages and the other from liver by comparing the two measures based on a measure of module entropy, Gene Ontology (GO) enrichment, and scale-free topology (SFT) fit. Our algorithm outperforms a widely used co-expression analysis method, weighted gene co-expression network analysis (WGCNA), in the macrophage data, while returning comparable results in the liver dataset when using these criteria. We demonstrate that the macrophage data has more non-linear interactions than the liver dataset, which may explain the increased performance of our method, termed Maximal Information Component Analysis (MICA) in that case.ConclusionsIn making our network algorithm more accurately reflect known biological principles, we are able to generate modules with improved relevance, particularly in networks with confounding factors such as gene by environment interactions

Directory of Open Access Journals

PubMed Central

Frontiers - Publisher Connector

eScholarship - University of California

Machine Learning and Integrative Analysis of Biomedical Big Data.

Author: Choi Howard
Chung Neo Christopher
Mirza Bilal
Ping Peipei
Wang Jie
Wang Wei
Publication venue: eScholarship, University of California
Publication date: 01/01/2019
Field of study

Recent developments in high-throughput technologies have accelerated the accumulation of massive amounts of omics data from multiple sources: genome, epigenome, transcriptome, proteome, metabolome, etc. Traditionally, data from each source (e.g., genome) is analyzed in isolation using statistical and machine learning (ML) methods. Integrative analysis of multi-omics and clinical data is key to new biomedical discoveries and advancements in precision medicine. However, data integration poses new computational challenges as well as exacerbates the ones associated with single-omics studies. Specialized computational approaches are required to effectively and efficiently perform integrative analysis of biomedical data acquired from diverse modalities. In this review, we discuss state-of-the-art ML-based approaches for tackling five specific computational challenges associated with integrative analysis: curse of dimensionality, data heterogeneity, missing data, class imbalance and scalability issues

Multidisciplinary Digital Publishing Institute

Ezid

Directory of Open Access Journals

eScholarship - University of California

Recommended from our members

A Variable Polyglutamine Repeat Affects Subcellular Localization and Regulatory Activity of a Populus ANGUSTIFOLIA Protein.

Author: Barry Kerrie
Bryan Anthony C
Chen Jin-Gui
Guo Jianjun
Jacobson Daniel
Jawdy Sara
Muchero Wellington
Ranjan Priya
Schmutz Jeremy
Singan Vasanth
Tuskan Gerald A
Weighill Deborah
Zhang Jin
Publication venue: eScholarship, University of California
Publication date: 01/07/2018
Field of study

Polyglutamine (polyQ) stretches have been reported to occur in proteins across many organisms including animals, fungi and plants. Expansion of these repeats has attracted much attention due their associations with numerous human diseases including Huntington's and other neurological maladies. This suggests that the relative length of polyQ stretches is an important modulator of their function. Here, we report the identification of a Populus C-terminus binding protein (CtBP) ANGUSTIFOLIA (PtAN1) which contains a polyQ stretch whose functional relevance had not been established. Analysis of 917 resequenced Populus trichocarpa genotypes revealed three allelic variants at this locus encoding 11-, 13- and 15-glutamine residues. Transient expression assays using Populus leaf mesophyll protoplasts revealed that the 11Q variant exhibited strong nuclear localization whereas the 15Q variant was only found in the cytosol, with the 13Q variant exhibiting localization in both subcellular compartments. We assessed functional implications by evaluating expression changes of putative PtAN1 targets in response to overexpression of the three allelic variants and observed allele-specific differences in expression levels of putative targets. Our results provide evidence that variation in polyQ length modulates PtAN1 function by altering subcellular localization

eScholarship - University of California

WIPI1, BAG1 and PEX3 autophagy-related genes are relevant melanoma markers

Author: D’Arcangelo Daniela
Facchiano Antonio
Facchiano Francesco
Giampietri Claudia
Muscio Mario
Scatozza Francesca
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2018
Field of study

ROS and oxidative stress may promote autophagy; on the other hand, autophagy may help reduce oxidative damages. According to the known interplay of ROS, autophagy, and melanoma onset, we hypothesized that autophagy-related genes (ARGs) may represent useful melanoma biomarkers. We therefore analyzed the gene and protein expression of 222 ARGs in human melanoma samples, from 5 independent expression databases (overall 572 patients). Gene expression was first evaluated in the GEO database. Forty-two genes showed extremely high ability to discriminate melanoma from nevi (63 samples) according to ROC (AUC ≥ 0.85) and Mann-Whitney (p < 0.0001) analyses. The 9 genes never related to melanoma before were then in silico validated in the IST online database. BAG1, CHMP2B, PEX3, and WIPI1 confirmed a strong differential gene expression, in 355 samples. A second-round validation performed on the Human Protein Atlas database showed strong differential protein expression for BAG1, PEX3, and WIPI1 in melanoma vs control samples, according to the image analysis of 80 human histological sections. WIPI1 gene expression also showed a significant prognostic value (p < 0.0001) according to 102 melanoma patients' survival data. We finally addressed in Oncomine database whether WIPI1 overexpression is melanoma-specific. Within more than 20 cancer types, the most relevant WIPI1 expression change (p = 0.00002; fold change = 3.1) was observed in melanoma. Molecular/functional relationships of the investigated molecules with melanoma and their molecular/functional network were analyzed via Chilibot software, STRING analysis, and gene ontology enrichment analysis. We conclude that WIPI1 (AUC = 0.99), BAG1 (AUC = 1), and PEX3 (AUC = 0.93) are relevant novel melanoma markers at both gene and protein levels

Directory of Open Access Journals

Archivio della ricerca- Università di Roma La Sapienza