TOWARDS AN ACCURATE CANCER DIAGNOSIS MODELIZATION:COMPARISON OF RANDOM FOREST STRATEGIES

Azencott, Chloe-Agathe; BOURS, Vincent; Debit, Ahmed; Jerusalem, Guy; JOSSE, Claire; Poulet, Christophe; Van Steen, Kristel

TOWARDS AN ACCURATE CANCER DIAGNOSIS MODELIZATION:COMPARISON OF RANDOM FOREST STRATEGIES

Authors: Chloe-Agathe Azencott
Vincent BOURS
Ahmed Debit
Guy Jerusalem
Claire JOSSE
Christophe Poulet
Kristel Van Steen
Publication date: 15 March 2019
Publisher

Abstract

Machine learning approaches are heavily used to produce models that will one day support clinical decisions. To be reliably used as a medical decision, such diagnosis and prognosis tools have to harbor a high-level of precision. Random Forests have been already used in cancer diagnosis, prognosis, and screening. Numerous Random Forests methods have been derived from the original random forest algorithm from Breiman et al. in 2001. Nevertheless, the precision of their generated models remains unknown when facing biological data. The precision of such models can be therefore too variable to produce models with the same accuracy of classification, making them useless in daily clinics. Here, we perform an empirical comparison of Random Forest based strategies, looking for their precision in model accuracy and overall computational time. An assessment of 15 methods is carried out for the classification of paired normal -tumor patients, from 3 TCGA RNA-Seq datasets: BRCA (Breast Invasive Carcinoma), LUSC (Lung Squamous Cell Carcinoma), and THCA (Thyroid Carcinoma). Results demonstrate noteworthy differences in the precisions of the model accuracy and the overall time processing, between the strategies for one dataset, as well as between datasets for one strategy. Therefore, we highly recommend to test each random forest strategy prior to modelization. This will certainly improve the precision in model accuracy while revealing the method of choice for the candidate data.WALInnov-NACATS 161012

Similar works

Full text

Available Versions

Open Repository and Bibliography - Liège

oai:orbi.ulg.ac.be:2268/251112

Last time updated on 06/10/2020