Search CORE

990 research outputs found

Machine Learning and Integrative Analysis of Biomedical Big Data.

Author: Choi Howard
Chung Neo Christopher
Mirza Bilal
Ping Peipei
Wang Jie
Wang Wei
Publication venue: eScholarship, University of California
Publication date: 01/01/2019
Field of study

Recent developments in high-throughput technologies have accelerated the accumulation of massive amounts of omics data from multiple sources: genome, epigenome, transcriptome, proteome, metabolome, etc. Traditionally, data from each source (e.g., genome) is analyzed in isolation using statistical and machine learning (ML) methods. Integrative analysis of multi-omics and clinical data is key to new biomedical discoveries and advancements in precision medicine. However, data integration poses new computational challenges as well as exacerbates the ones associated with single-omics studies. Specialized computational approaches are required to effectively and efficiently perform integrative analysis of biomedical data acquired from diverse modalities. In this review, we discuss state-of-the-art ML-based approaches for tackling five specific computational challenges associated with integrative analysis: curse of dimensionality, data heterogeneity, missing data, class imbalance and scalability issues

Directory of Open Access Journals

eScholarship - University of California

Lessons from genetic profiling in soft tissue sarcomas

Author: Berner J. M.
Fernebro Josefin
Francis Princy
Meza-Zepeda L. A.
Myklebost O.
Namløs H. M.
Nilbert Mef
Publication venue: 'Informa UK Limited'
Publication date: 01/01/2004
Field of study

Lund University Publications

Computational strategies in cardiometabolic diseases:a portal to deeper mechanistic understanding

Author: Lu Chang
Publication venue: 'University of Maastricht'
Publication date: 01/01/2022
Field of study

Gene expression studies from basic research to the clinic

Author: Karjalainen Juha
Publication venue: 'University of Groningen Press'
Publication date: 01/01/2018
Field of study

Dissertations of the University of Groningen

Randomization in Laboratory Procedure Is Key to Obtaining Reproducible Microarray Results

Author: A Brazma
BJ Singer
Christina A. Harrington
Christopher D. Coldren
D Seo
Gary A. Churchill
GK Smyth
Hyuna Yang
JA Hartigan
JD Storey
JE Larkin
JF Waring
JT Leek
KR Shockley
Kristina Vartanian
L Bullinger
P Tamayo
PJ Valk
R Ihaka
R Opgen-Rhein
RA Irizarry
RA Irizarry
Rob Hall
S Dudoit
S Falcon
Thomas Preiss
X Cui
Y Benjamini
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

The quality of gene expression microarray data has improved dramatically since the first arrays were introduced in the late 1990s. However, the reproducibility of data generated at multiple laboratory sites remains a matter of concern, especially for scientists who are attempting to combine and analyze data from public repositories. We have carried out a study in which a common set of RNA samples was assayed five times in four different laboratories using Affymetrix GeneChip arrays. We observed dramatic differences in the results across laboratories and identified batch effects in array processing as one of the primary causes for these differences. When batch processing of samples is confounded with experimental factors of interest it is not possible to separate their effects, and lists of differentially expressed genes may include many artifacts. This study demonstrates the substantial impact of sample processing on microarray analysis results and underscores the need for randomization in the laboratory as a means to avoid confounding of biological factors with procedural effects

Directory of Open Access Journals

Statistical Methods in Integrative Genomics

Author: Richardson Sylvia
Sun Wei
Tseng George C.
Publication venue
Publication date: 01/01/2016
Field of study

Statistical methods in integrative genomics aim to answer important biology questions by jointly analyzing multiple types of genomic data (vertical integration) or aggregating the same type of data across multiple studies (horizontal integration). In this article, we introduce different types of genomic data and data resources, and then review statistical methods of integrative genomics, with emphasis on the motivation and rationale of these methods. We conclude with some summary points and future research directions

Carolina Digital Repository