57 research outputs found

    Algorithmic encoding of protected characteristics in chest X-ray disease detection models

    Get PDF
    Background It has been rightfully emphasized that the use of AI for clinical decision making could amplify health disparities. An algorithm may encode protected characteristics, and then use this information for making predictions due to undesirable correlations in the (historical) training data. It remains unclear how we can establish whether such information is actually used. Besides the scarcity of data from underserved populations, very little is known about how dataset biases manifest in predictive models and how this may result in disparate performance. This article aims to shed some light on these issues by exploring methodology for subgroup analysis in image-based disease detection models. Methods We utilize two publicly available chest X-ray datasets, CheXpert and MIMIC-CXR, to study performance disparities across race and biological sex in deep learning models. We explore test set resampling, transfer learning, multitask learning, and model inspection to assess the relationship between the encoding of protected characteristics and disease detection performance across subgroups. Findings We confirm subgroup disparities in terms of shifted true and false positive rates which are partially removed after correcting for population and prevalence shifts in the test sets. We find that transfer learning alone is insufficient for establishing whether specific patient information is used for making predictions. The proposed combination of test-set resampling, multitask learning, and model inspection reveals valuable insights about the way protected characteristics are encoded in the feature representations of deep neural networks. Interpretation Subgroup analysis is key for identifying performance disparities of AI models, but statistical differences across subgroups need to be taken into account when analyzing potential biases in disease detection. The proposed methodology provides a comprehensive framework for subgroup analysis enabling further research into the underlying causes of disparities. Funding European Research Council Horizon 2020, UK Research and Innovation

    Development of machine learning support for reading whole body diffusion-weighted MRI (WB-MRI) in myeloma for the detection and quantification of the extent of disease before and after treatment (MALIMAR): protocol for a cross-sectional diagnostic test accuracy study.

    Get PDF
    INTRODUCTION: Whole-body MRI (WB-MRI) is recommended by the National Institute of Clinical Excellence as the first-line imaging tool for diagnosis of multiple myeloma. Reporting WB-MRI scans requires expertise to interpret and can be challenging for radiologists who need to meet rapid turn-around requirements. Automated computational tools based on machine learning (ML) could assist the radiologist in terms of sensitivity and reading speed and would facilitate improved accuracy, productivity and cost-effectiveness. The MALIMAR study aims to develop and validate a ML algorithm to increase the diagnostic accuracy and reading speed of radiological interpretation of WB-MRI compared with standard methods. METHODS AND ANALYSIS: This phase II/III imaging trial will perform retrospective analysis of previously obtained clinical radiology MRI scans and scans from healthy volunteers obtained prospectively to implement training and validation of an ML algorithm. The study will comprise three project phases using approximately 633 scans to (1) train the ML algorithm to identify active disease, (2) clinically validate the ML algorithm and (3) determine change in disease status following treatment via a quantification of burden of disease in patients with myeloma. Phase 1 will primarily train the ML algorithm to detect active myeloma against an expert assessment ('reference standard'). Phase 2 will use the ML output in the setting of radiology reader study to assess the difference in sensitivity when using ML-assisted reading or human-alone reading. Phase 3 will assess the agreement between experienced readers (with and without ML) and the reference standard in scoring both overall burden of disease before and after treatment, and response. ETHICS AND DISSEMINATION: MALIMAR has ethical approval from South Central-Oxford C Research Ethics Committee (REC Reference: 17/SC/0630). IRAS Project ID: 233501. CPMS Portfolio adoption (CPMS ID: 36766). Participants gave informed consent to participate in the study before taking part. MALIMAR is funded by National Institute for Healthcare Research Efficacy and Mechanism Evaluation funding (NIHR EME Project ID: 16/68/34). Findings will be made available through peer-reviewed publications and conference dissemination. TRIAL REGISTRATION NUMBER: NCT03574454

    Post-acute blood biomarkers and disease progression in traumatic brain injury

    Get PDF
    There is substantial interest in the potential for traumatic brain injury to result in progressive neurological deterioration. While blood biomarkers such as glial fibrillary acid protein (GFAP) and neurofilament light have been widely explored in characterizing acute traumatic brain injury (TBI), their use in the chronic phase is limited. Given increasing evidence that these proteins may be markers of ongoing neurodegeneration in a range of diseases, we examined their relationship to imaging changes and functional outcome in the months to years following TBI.Two-hundred and three patients were recruited in two separate cohorts; 6 months post-injury (n = 165); and >5 years post-injury (n = 38; 12 of whom also provided data ∼8 months post-TBI). Subjects underwent blood biomarker sampling (n = 199) and MRI (n = 172; including diffusion tensor imaging). Data from patient cohorts were compared to 59 healthy volunteers and 21 non-brain injury trauma controls. Mean diffusivity and fractional anisotropy were calculated in cortical grey matter, deep grey matter and whole brain white matter. Accelerated brain ageing was calculated at a whole brain level as the predicted age difference defined using T1-weighted images, and at a voxel-based level as the annualized Jacobian determinants in white matter and grey matter, referenced to a population of 652 healthy control subjects.Serum neurofilament light concentrations were elevated in the early chronic phase. While GFAP values were within the normal range at ∼8 months, many patients showed a secondary and temporally distinct elevations up to >5 years after injury. Biomarker elevation at 6 months was significantly related to metrics of microstructural injury on diffusion tensor imaging. Biomarker levels at ∼8 months predicted white matter volume loss at >5 years, and annualized brain volume loss between ∼8 months and 5 years. Patients who worsened functionally between ∼8 months and >5 years showed higher than predicted brain age and elevated neurofilament light levels.GFAP and neurofilament light levels can remain elevated months to years after TBI, and show distinct temporal profiles. These elevations correlate closely with microstructural injury in both grey and white matter on contemporaneous quantitative diffusion tensor imaging. Neurofilament light elevations at ∼8 months may predict ongoing white matter and brain volume loss over >5 years of follow-up. If confirmed, these findings suggest that blood biomarker levels at late time points could be used to identify TBI survivors who are at high risk of progressive neurological damage.</p

    Better together: data harmonization and cross-study analysis of abdominal MRI data from UK Biobank and the German National Cohort

    Get PDF
    OBJECTIVES: The UK Biobank (UKBB) and German National Cohort (NAKO) are among the largest cohort studies, capturing a wide range of health-related data from the general population, including comprehensive magnetic resonance imaging (MRI) examinations. The purpose of this study was to demonstrate how MRI data from these large-scale studies can be jointly analyzed and to derive comprehensive quantitative image-based phenotypes across the general adult population. MATERIALS AND METHODS: Image-derived features of abdominal organs (volumes of liver, spleen, kidneys, and pancreas; volumes of kidney hilum adipose tissue; and fat fractions of liver and pancreas) were extracted from T1-weighted Dixon MRI data of 17,996 participants of UKBB and NAKO based on quality-controlled deep learning generated organ segmentations. To enable valid cross-study analysis, we first analyzed the data generating process using methods of causal discovery. We subsequently harmonized data from UKBB and NAKO using the ComBat approach for batch effect correction. We finally performed quantile regression on harmonized data across studies providing quantitative models for the variation of image-derived features stratified for sex and dependent on age, height, and weight. RESULTS: Data from 8791 UKBB participants (49.9% female; age, 63 ± 7.5 years) and 9205 NAKO participants (49.1% female, age: 51.8 ± 11.4 years) were analyzed. Analysis of the data generating process revealed direct effects of age, sex, height, weight, and the data source (UKBB vs NAKO) on image-derived features. Correction of data source-related effects resulted in markedly improved alignment of image-derived features between UKBB and NAKO. Cross-study analysis on harmonized data revealed comprehensive quantitative models for the phenotypic variation of abdominal organs across the general adult population. CONCLUSIONS: Cross-study analysis of MRI data from UKBB and NAKO as proposed in this work can be helpful for future joint data analyses across cohorts linking genetic, environmental, and behavioral risk factors to MRI-derived phenotypes and provide reference values for clinical diagnostics

    Predictors of Access to Rehabilitation in the Year Following Traumatic Brain Injury : A European Prospective and Multicenter Study

    Get PDF
    Background Although rehabilitation is beneficial for individuals with traumatic brain injury (TBI), a significant proportion of them do not receive adequate rehabilitation after acute care. Objective Therefore, the goal of this prospective and multicenter study was to investigate predictors of access to rehabilitation in the year following injury in patients with TBI. Methods Data from a large European study (CENTER-TBI), including TBIs of all severities between December 2014 and December 2017 were used (N = 4498 patients). Participants were dichotomized into those who had and those who did not have access to rehabilitation in the year following TBI. Potential predictors included sociodemographic factors, psychoactive substance use, preinjury medical history, injury-related factors, and factors related to medical care, complications, and discharge. Results In the year following traumatic injury, 31.4% of patients received rehabilitation services. Access to rehabilitation was positively and significantly predicted by female sex (odds ratio [OR] = 1.50), increased number of years of education completed (OR = 1.05), living in Northern (OR = 1.62; reference: Western Europe) or Southern Europe (OR = 1.74), lower prehospital Glasgow Coma Scale score (OR = 1.03), higher Injury Severity Score (OR = 1.01), intracranial (OR = 1.33) and extracranial (OR = 1.99) surgery, and extracranial complication (OR = 1.75). On contrast, significant negative predictors were lack of preinjury employment (OR = 0.80), living in Central and Eastern Europe (OR = 0.42), and admission to hospital ward (OR = 0.47; reference: admission to intensive care unit) or direct discharge from emergency room (OR = 0.24). Conclusions Based on these findings, there is an urgent need to implement national and international guidelines and strategies for access to rehabilitation after TBI.Peer reviewe

    Tracheal intubation in traumatic brain injury: a multicentre prospective observational study

    Get PDF
    Background We aimed to study the associations between pre- and in-hospital tracheal intubation and outcomes in traumatic brain injury (TBI), and whether the association varied according to injury severity. Methods Data from the international prospective pan-European cohort study, Collaborative European NeuroTrauma Effectiveness Research for TBI (CENTER-TBI), were used (n=4509). For prehospital intubation, we excluded self-presenters. For in-hospital intubation, patients whose tracheas were intubated on-scene were excluded. The association between intubation and outcome was analysed with ordinal regression with adjustment for the International Mission for Prognosis and Analysis of Clinical Trials in TBI variables and extracranial injury. We assessed whether the effect of intubation varied by injury severity by testing the added value of an interaction term with likelihood ratio tests. Results In the prehospital analysis, 890/3736 (24%) patients had their tracheas intubated at scene. In the in-hospital analysis, 460/2930 (16%) patients had their tracheas intubated in the emergency department. There was no adjusted overall effect on functional outcome of prehospital intubation (odds ratio=1.01; 95% confidence interval, 0.79–1.28; P=0.96), and the adjusted overall effect of in-hospital intubation was not significant (odds ratio=0.86; 95% confidence interval, 0.65–1.13; P=0.28). However, prehospital intubation was associated with better functional outcome in patients with higher thorax and abdominal Abbreviated Injury Scale scores (P=0.009 and P=0.02, respectively), whereas in-hospital intubation was associated with better outcome in patients with lower Glasgow Coma Scale scores (P=0.01): in-hospital intubation was associated with better functional outcome in patients with Glasgow Coma Scale scores of 10 or lower. Conclusion The benefits and harms of tracheal intubation should be carefully evaluated in patients with TBI to optimise benefit. This study suggests that extracranial injury should influence the decision in the prehospital setting, and level of consciousness in the in-hospital setting. Clinical trial registration NCT02210221

    How do 66 European institutional review boards approve one protocol for an international prospective observational study on traumatic brain injury? Experiences from the CENTER-TBI study

    Get PDF
    Background The European Union (EU) aims to optimize patient protection and efficiency of health-care research by harmonizing procedures across Member States. Nonetheless, further improvements are required to increase multicenter research efficiency. We investigated IRB procedures in a large prospective European multicenter study on traumatic brain injury (TBI), aiming to inform and stimulate initiatives to improve efficiency. Methods We reviewed relevant documents regarding IRB submission and IRB approval from European neurotrauma centers participating in the Collaborative European NeuroTrauma Effectiveness Research in Traumatic Brain Injury (CENTER-TBI). Documents included detailed information on IRB procedures and the duration from IRB submission until approval(s). They were translated and analyzed to determine the level of harmonization of IRB procedures within Europe. Results From 18 countries, 66 centers provided the requested documents. The primary IRB review was conducted centrally (N = 11, 61%) or locally (N = 7, 39%) and primary IRB approval was obtained after one (N = 8, 44%), two (N = 6, 33%) or three (N = 4, 23%) review rounds with a median duration of respectively 50 and 98 days until primary IRB approval. Additional IRB approval was required in 55% of countries and could increase duration to 535 days. Total duration from submission until required IRB approval was obtained was 114 days (IQR 75-224) and appeared to be shorter after submission to local IRBs compared to central IRBs (50 vs. 138 days, p = 0.0074). Conclusion We found variation in IRB procedures between and within European countries. There were differences in submission and approval requirements, number of review rounds and total duration. Research collaborations could benefit from the implementation of more uniform legislation and regulation while acknowledging local cultural habits and moral values between countries.Peer reviewe

    Quality indicators for patients with traumatic brain injury in European intensive care units

    Get PDF
    Background: The aim of this study is to validate a previously published consensus-based quality indicator set for the management of patients with traumatic brain injury (TBI) at intensive care units (ICUs) in Europe and to study its potential for quality measur

    Machine learning algorithms performed no better than regression models for prognostication in traumatic brain injury

    Get PDF
    Objective: We aimed to explore the added value of common machine learning (ML) algorithms for prediction of outcome for moderate and severe traumatic brain injury. Study Design and Setting: We performed logistic regression (LR), lasso regression, and ridge regression with key baseline predictors in the IMPACT-II database (15 studies, n = 11,022). ML algorithms included support vector machines, random forests, gradient boosting machines, and artificial neural networks and were trained using the same predictors. To assess generalizability of predictions, we performed internal, internal-external, and external validation on the recent CENTER-TBI study (patients with Glasgow Coma Scale <13, n = 1,554). Both calibration (calibration slope/intercept) and discrimination (area under the curve) was quantified. Results: In the IMPACT-II database, 3,332/11,022 (30%) died and 5,233(48%) had unfavorable outcome (Glasgow Outcome Scale less than 4). In the CENTER-TBI study, 348/1,554(29%) died and 651(54%) had unfavorable outcome. Discrimination and calibration varied widely between the studies and less so between the studied algorithms. The mean area under the curve was 0.82 for mortality and 0.77 for unfavorable outcomes in the CENTER-TBI study. Conclusion: ML algorithms may not outperform traditional regression approaches in a low-dimensional setting for outcome prediction after moderate or severe traumatic brain injury. Similar to regression-based prediction models, ML algorithms should be rigorously validated to ensure applicability to new populations

    Changing care pathways and between-center practice variations in intensive care for traumatic brain injury across Europe

    Get PDF
    Purpose: To describe ICU stay, selected management aspects, and outcome of Intensive Care Unit (ICU) patients with traumatic brain injury (TBI) in Europe, and to quantify variation across centers. Methods: This is a prospective observational multicenter study conducted across 18 countries in Europe and Israel. Admission characteristics, clinical data, and outcome were described at patient- and center levels. Between-center variation in the total ICU population was quantified with the median odds ratio (MOR), with correction for case-mix and random variation between centers. Results: A total of 2138 patients were admitted to the ICU, with median age of 49 years; 36% of which were mild TBI (Glasgow Coma Scale; GCS 13–15). Within, 72 h 636 (30%) were discharged and 128 (6%) died. Early deaths and long-stay patients (> 72 h) had more severe injuries based on the GCS and neuroimaging characteristics, compared with short-stay patients. Long-stay patients received more monitoring and were treated at higher intensity, and experienced worse 6-month outcome compared to short-stay patients. Between-center variations were prominent in the proportion of short-stay patients (MOR = 2.3, p < 0.001), use of intracranial pressure (ICP) monitoring (MOR = 2.5, p < 0.001) and aggressive treatme
    corecore