Search CORE

35 research outputs found

Distributed learning on 20 000+ lung cancer patients - The Personal Health Train

Author: Bussink Johan
Button Mick
Chen Jiayan
Dankers Frank J. W. M.
Deist Timo M.
Dekker Andre
Faivre-Finn Corinne
Janssen Tomas
Jochems Arthur
Lambin Philippe
Marshall M. Scott
Masciocchi Carlotta
Monshouwer René
Nuyttens Joost Jan
Ojha Priyanka
Price Gareth
Spezi Emiliano
Valentini Vincenzo
van Soest Johan
Vernhout René
Wang Jiazhou
Zhang Zhen
Publication venue: 'Elsevier BV'
Publication date: 01/01/2020
Field of study

Background and purpose Access to healthcare data is indispensable for scientific progress and innovation. Sharing healthcare data is time-consuming and notoriously difficult due to privacy and regulatory concerns. The Personal Health Train (PHT) provides a privacy-by-design infrastructure connecting FAIR (Findable, Accessible, Interoperable, Reusable) data sources and allows distributed data analysis and machine learning. Patient data never leaves a healthcare institute. Materials and methods Lung cancer patient-specific databases (tumor staging and post-treatment survival information) of oncology departments were translated according to a FAIR data model and stored locally in a graph database. Software was installed locally to enable deployment of distributed machine learning algorithms via a central server. Algorithms (MATLAB, code and documentation publicly available) are patient privacy-preserving as only summary statistics and regression coefficients are exchanged with the central server. A logistic regression model to predict post-treatment two-year survival was trained and evaluated by receiver operating characteristic curves (ROC), root mean square prediction error (RMSE) and calibration plots. Results In 4 months, we connected databases with 23 203 patient cases across 8 healthcare institutes in 5 countries (Amsterdam, Cardiff, Maastricht, Manchester, Nijmegen, Rome, Rotterdam, Shanghai) using the PHT. Summary statistics were computed across databases. A distributed logistic regression model predicting post-treatment two-year survival was trained on 14 810 patients treated between 1978 and 2011 and validated on 8 393 patients treated between 2012 and 2015. Conclusion The PHT infrastructure demonstrably overcomes patient privacy barriers to healthcare data sharing and enables fast data analyses across multiple institutes from different countries with different regulatory regimens. This infrastructure promotes global evidence-based medicine while prioritizing patient privacy

Maastricht University Research Portal

Online Research @ Cardiff

PubliCatt

EUR Research Repository

The University of Manchester - Institutional Repository

The EGFRvIII transcriptome in glioblastoma, a meta-omics analysis.

BACKGROUND: EGFR is among the genes most frequently altered in glioblastoma, with exons 2-7 deletions (EGFRvIII) being amongst its most common genomic mutations. There are conflicting reports about its prognostic role and it remains unclear whether and how it differs in signalling compared with wildtype EGFR. METHODS: To better understand the oncogenic role of EGFRvIII, we leveraged four large datasets into one large glioblastoma transcriptome dataset (n=741) alongside 81 whole-genome samples from two datasets. RESULTS: The EGFRvIII/EGFR expression ratios differ strongly between tumours and ranges from 1% to 95%. Interestingly, the slope of relative EGFRvIII expression is near-linear, which argues against a more positive selection pressure than EGFR wildtype. An absence of selection pressure is also suggested by the similar survival between EGFRvIII positive and negative glioblastoma patients. EGFRvIII levels are inversely correlated with pan-EGFR (all wildtype and mutant variants) expression, which indicates that EGFRvIII has a higher potency in downstream pathway activation. EGFRvIII-positive glioblastomas have a lower CDK4 or MDM2 amplification incidence than EGFRvIII-negative (p=0.007), which may point towards crosstalk between these pathways. EGFRvIII-expressing tumours have an upregulation of 'classical' subtype genes compared to those with EGFR-amplification only (p=3.873e-6). Genomic breakpoints of the EGFRvIII deletions have a preference towards the 3' end of the large intron-1. These preferred breakpoints preserve a cryptic exon resulting in a novel EGFRvIII variant and preserve an intronic enhancer. CONCLUSIONS: These data provide deeper insights into the complex EGFRvIII biology and provide new insights for targeting EGFRvIII mutated tumours

Maastricht University Research Portal

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

PubMed Central

EUR Research Repository

Open Repository and Bibliography - Liège

Dissertations of the University of Groningen