Machine Learning Approaches to Determine Feature Importance for Predicting Infant Autopsy Outcome

Booth, J; Bryant, W; Hutchinson, C; Issitt, R; Margetts, B; Martin, N; Sebire, NJ

Machine Learning Approaches to Determine Feature Importance for Predicting Infant Autopsy Outcome

Authors: J Booth
W Bryant
C Hutchinson
R Issitt
B Margetts
N Martin
NJ Sebire
Publication date: 30 March 2021
Publisher: SAGE PUBLICATIONS INC

Abstract

Introduction: Sudden unexpected death in infancy (SUDI) represents the commonest presentation of postneonatal death. We explored whether machine learning could be used to derive data driven insights for prediction of infant autopsy outcome. Methods: A paediatric autopsy database containing >7,000 cases, with >300 variables, was analysed by examination stage and autopsy outcome classified as ‘explained (medical cause of death identified)’ or ‘unexplained’. Decision tree, random forest, and gradient boosting models were iteratively trained and evaluated. Results: Data from 3,100 infant and young child (<2 years) autopsies were included. Naïve decision tree using external examination data had performance of 68% for predicting an explained death. Core data items were identified using model feature importance. The most effective model was XG Boost, with overall predictive performance of 80%, demonstrating age at death, and cardiovascular and respiratory histological findings as the most important variables associated with determining medical cause of death. Conclusion: This study demonstrates feasibility of using machine-learning to evaluate component importance of complex medical procedures (paediatric autopsy) and highlights value of collecting routine clinical data according to defined standards. This approach can be applied to a range of clinical and operational healthcare scenario

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

UCL Discovery

oai:eprints.ucl.ac.uk.OAI2:101...

Last time updated on 25/05/2021