210 research outputs found

    Generalised Dice overlap as a deep learning loss function for highly unbalanced segmentations

    Get PDF
    Deep-learning has proved in recent years to be a powerful tool for image analysis and is now widely used to segment both 2D and 3D medical images. Deep-learning segmentation frameworks rely not only on the choice of network architecture but also on the choice of loss function. When the segmentation process targets rare observations, a severe class imbalance is likely to occur between candidate labels, thus resulting in sub-optimal performance. In order to mitigate this issue, strategies such as the weighted cross-entropy function, the sensitivity function or the Dice loss function, have been proposed. In this work, we investigate the behavior of these loss functions and their sensitivity to learning rate tuning in the presence of different rates of label imbalance across 2D and 3D segmentation tasks. We also propose to use the class re-balancing properties of the Generalized Dice overlap, a known metric for segmentation assessment, as a robust and accurate deep-learning loss function for unbalanced tasks

    A Heteroscedastic Uncertainty Model for Decoupling Sources of MRI Image Quality

    Get PDF
    Quality control (QC) of medical images is essential to ensure that downstream analyses such as segmentation can be performed successfully. Currently, QC is predominantly performed visually at significant time and operator cost. We aim to automate the process by formulating a probabilistic network that estimates uncertainty through a heteroscedastic noise model, hence providing a proxy measure of task-specific image quality that is learnt directly from the data. By augmenting the training data with different types of simulated k-space artefacts, we propose a novel cascading CNN architecture based on a student-teacher framework to decouple sources of uncertainty related to different k-space augmentations in an entirely self-supervised manner. This enables us to predict separate uncertainty quantities for the different types of data degradation. While the uncertainty measures reflect the presence and severity of image artefacts, the network also provides the segmentation predictions given the quality of the data. We show models trained with simulated artefacts provide informative measures of uncertainty on real-world images and we validate our uncertainty predictions on problematic images identified by human-raters

    Where is VALDO? VAscular Lesions Detection and segmentatiOn challenge at MICCAI 2021

    Full text link
    Imaging markers of cerebral small vessel disease provide valuable information on brain health, but their manual assessment is time-consuming and hampered by substantial intra- and interrater variability. Automated rating may benefit biomedical research, as well as clinical assessment, but diagnostic reliability of existing algorithms is unknown. Here, we present the results of the \textit{VAscular Lesions DetectiOn and Segmentation} (\textit{Where is VALDO?}) challenge that was run as a satellite event at the international conference on Medical Image Computing and Computer Aided Intervention (MICCAI) 2021. This challenge aimed to promote the development of methods for automated detection and segmentation of small and sparse imaging markers of cerebral small vessel disease, namely enlarged perivascular spaces (EPVS) (Task 1), cerebral microbleeds (Task 2) and lacunes of presumed vascular origin (Task 3) while leveraging weak and noisy labels. Overall, 12 teams participated in the challenge proposing solutions for one or more tasks (4 for Task 1 - EPVS, 9 for Task 2 - Microbleeds and 6 for Task 3 - Lacunes). Multi-cohort data was used in both training and evaluation. Results showed a large variability in performance both across teams and across tasks, with promising results notably for Task 1 - EPVS and Task 2 - Microbleeds and not practically useful results yet for Task 3 - Lacunes. It also highlighted the performance inconsistency across cases that may deter use at an individual level, while still proving useful at a population level

    A Decoupled Uncertainty Model for MRI Segmentation Quality Estimation

    Get PDF
    Quality control (QC) of MR images is essential to ensure that downstream analyses such as segmentation can be performed successfully. Currently, QC is predominantly performed visually and subjectively, at significant time and operator cost. We aim to automate the process using a probabilistic network that estimates segmentation uncertainty through a heteroscedastic noise model, providing a measure of task-specific quality. By augmenting training images with k-space artefacts, we propose a novel CNN architecture to decouple sources of uncertainty related to the task and different k-space artefacts in a self-supervised manner. This enables the prediction of separate uncertainties for different types of data degradation. While the uncertainty predictions reflect the presence and severity of artefacts, the network provides more robust and generalisable segmentation predictions given the quality of the data. We show that models trained with artefact augmentation provide informative measures of uncertainty on both simulated artefacts and problematic real-world images identified by human raters, both qualitatively and quantitatively in the form of error bars on volume measurements. Relating artefact uncertainty to segmentation Dice scores, we observe that our uncertainty predictions provide a better estimate of MRI quality from the point of view of the task (gray matter segmentation) compared to commonly used metrics of quality including signal-to-noise ratio (SNR) and contrast-to-noise ratio (CNR), hence providing a real-time quality metric indicative of segmentation quality

    Hierarchical brain parcellation with uncertainty

    Get PDF
    Many atlases used for brain parcellation are hierarchically organised, progressively dividing the brain into smaller sub-regions. However, state-of-the-art parcellation methods tend to ignore this structure and treat labels as if they are `flat'. We introduce a hierarchically-aware brain parcellation method that works by predicting the decisions at each branch in the label tree. We further show how this method can be used to model uncertainty separately for every branch in this label tree. Our method exceeds the performance of flat uncertainty methods, whilst also providing decomposed uncertainty estimates that enable us to obtain self-consistent parcellations and uncertainty maps at any level of the label hierarchy. We demonstrate a simple way these decision-specific uncertainty maps may be used to provided uncertainty-thresholded tissue maps at any level of the label tree.Comment: To be published in the MICCAI 2020 workshop: Uncertainty for Safe Utilization of Machine Learning in Medical Imagin

    APOE ?4 status is associated with white matter hyperintensities volume accumulation rate independent of AD diagnosis.

    Get PDF
    To assess the relationship between carriage of APOE ?4 allele and evolution of white matter hyperintensities (WMHs) volume, we longitudinally studied 339 subjects from the Alzheimer's Disease Neuroimaging Initiative cohort with diagnoses ranging from normal controls to probable Alzheimer's disease (AD). A purpose-built longitudinal automatic method was used to segment WMH using constraints derived from an atlas-based model selection applied to a time-averaged image. Linear mixed models were used to evaluate the differences in rate of change across diagnosis and genetic groups. After adjustment for covariates (age, sex, and total intracranial volume), homozygous APOE ?4?4 subjects had a significantly higher rate of WMH accumulation (22.5% per year 95% CI [14.4, 31.2] for a standardized population having typical values of covariates) compared with the heterozygous (?4?3) subjects (10.0% per year [6.7, 13.4]) and homozygous ?3?3 (6.6% per year [4.1, 9.3]) subjects. Rates of accumulation increased with diagnostic severity; controls accumulated 5.8% per year 95% CI: [2.2, 9.6] for the standardized population, early mild cognitive impairment 6.6% per year [3.9, 9.4], late mild cognitive impairment 12.5% per year [8.2, 17.0] and AD subjects 14.7% per year [6.0, 24.0]. Following adjustment for APOE status, these differences became nonstatistically significant suggesting that APOE ?4 genotype is the major driver of accumulation of WMH volume rather than diagnosis of AD

    Let's agree to disagree: learning highly debatable multirater labelling

    Full text link
    Classification and differentiation of small pathological objects may greatly vary among human raters due to differences in training, expertise and their consistency over time. In a radiological setting, objects commonly have high within-class appearance variability whilst sharing certain characteristics across different classes, making their distinction even more difficult. As an example, markers of cerebral small vessel disease, such as enlarged perivascular spaces (EPVS) and lacunes, can be very varied in their appearance while exhibiting high inter-class similarity, making this task highly challenging for human raters. In this work, we investigate joint models of individual rater behaviour and multirater consensus in a deep learning setting, and apply it to a brain lesion object-detection task. Results show that jointly modelling both individual and consensus estimates leads to significant improvements in performance when compared to directly predicting consensus labels, while also allowing the characterization of human-rater consistency.Comment: Accepted at MICCAI 201

    Common infections and neuroimaging markers of dementia in three UK cohort studies

    Get PDF
    INTRODUCTION: We aimed to investigate associations between common infections and neuroimaging markers of dementia risk (brain volume, hippocampal volume, white matter lesions) across three population-based studies. METHODS: We tested associations between serology measures (pathogen serostatus, cumulative burden, continuous antibody responses) and outcomes using linear regression, including adjustments for total intracranial volume and scanner/clinic information (basic model), age, sex, ethnicity, education, socioeconomic position, alcohol, body mass index, and smoking (fully adjusted model). Interactions between serology measures and apolipoprotein E (APOE) genotype were tested. Findings were meta-analyzed across cohorts (Nmain  = 2632; NAPOE-interaction  = 1810). RESULTS: Seropositivity to John Cunningham virus associated with smaller brain volumes in basic models (β = -3.89 mL [-5.81, -1.97], Padjusted  < 0.05); these were largely attenuated in fully adjusted models (β = -1.59 mL [-3.55, 0.36], P = 0.11). No other relationships were robust to multiple testing corrections and sensitivity analyses, but several suggestive associations were observed. DISCUSSION: We did not find clear evidence for relationships between common infections and markers of dementia risk. Some suggestive findings warrant testing for replication
    corecore