42 research outputs found
Explainable artificial intelligence (XAI) in deep learning-based medical image analysis
With an increase in deep learning-based methods, the call for explainability
of such methods grows, especially in high-stakes decision making areas such as
medical image analysis. This survey presents an overview of eXplainable
Artificial Intelligence (XAI) used in deep learning-based medical image
analysis. A framework of XAI criteria is introduced to classify deep
learning-based medical image analysis methods. Papers on XAI techniques in
medical image analysis are then surveyed and categorized according to the
framework and according to anatomical location. The paper concludes with an
outlook of future opportunities for XAI in medical image analysis.Comment: Submitted for publication. Comments welcome by email to first autho
Volumetric breast density estimation on MRI using explainable deep learning regression
To purpose of this paper was to assess the feasibility of volumetric breast density estimations on MRI without segmentations accompanied with an explainability step. A total of 615 patients with breast cancer were included for volumetric breast density estimation. A 3-dimensional regression convolutional neural network (CNN) was used to estimate the volumetric breast density. Patients were split in training (N = 400), validation (N = 50), and hold-out test set (N = 165). Hyperparameters were optimized using Neural Network Intelligence and augmentations consisted of translations and rotations. The estimated densities were evaluated to the ground truth using Spearman's correlation and Bland-Altman plots. The output of the CNN was visually analyzed using SHapley Additive exPlanations (SHAP). Spearman's correlation between estimated and ground truth density was ρ = 0.81 (N = 165, P < 0.001) in the hold-out test set. The estimated density had a median bias of 0.70% (95% limits of agreement = - 6.8% to 5.0%) to the ground truth. SHAP showed that in correct density estimations, the algorithm based its decision on fibroglandular and fatty tissue. In incorrect estimations, other structures such as the pectoral muscle or the heart were included. To conclude, it is feasible to automatically estimate volumetric breast density on MRI without segmentations, and to provide accompanying explanations
Multi-modal volumetric concept activation to explain detection and classification of metastatic prostate cancer on PSMA-PET/CT
Explainable artificial intelligence (XAI) is increasingly used to analyze the
behavior of neural networks. Concept activation uses human-interpretable
concepts to explain neural network behavior. This study aimed at assessing the
feasibility of regression concept activation to explain detection and
classification of multi-modal volumetric data.
Proof-of-concept was demonstrated in metastatic prostate cancer patients
imaged with positron emission tomography/computed tomography (PET/CT).
Multi-modal volumetric concept activation was used to provide global and local
explanations.
Sensitivity was 80% at 1.78 false positive per patient. Global explanations
showed that detection focused on CT for anatomical location and on PET for its
confidence in the detection. Local explanations showed promise to aid in
distinguishing true positives from false positives. Hence, this study
demonstrated feasibility to explain detection and classification of multi-modal
volumetric data using regression concept activation.Comment: Accepted as: Kraaijveld, R.C.J., Philippens, M.E.P., Eppinga, W.S.C.,
J\"urgenliemk-Schulz, I.M., Gilhuijs, K.G.A., Kroon, P.S., van der Velden,
B.H.M. "Multi-modal volumetric concept activation to explain detection and
classification of metastatic prostate cancer on PSMA-PET/CT." MICCAI workshop
on Interpretability of Machine Intelligence in Medical Image Computing
(iMIMIC), 202
Radiologic versus Segmentation Measurements to Quantify Wilms Tumor Volume on MRI in Pediatric Patients
Wilms tumor is a common pediatric solid tumor. To evaluate tumor response to chemotherapy and decide whether nephron-sparing surgery is possible, tumor volume measurements based on magnetic resonance imaging (MRI) are important. Currently, radiological volume measurements are based on measuring tumor dimensions in three directions. Manual segmentation-based volume measurements might be more accurate, but this process is time-consuming and user-dependent. The aim of this study was to investigate whether manual segmentation-based volume measurements are more accurate and to explore whether these segmentations can be automated using deep learning. We included the MRI images of 45 Wilms tumor patients (age 0-18 years). First, we compared radiological tumor volumes with manual segmentation-based tumor volume measurements. Next, we created an automated segmentation method by training a nnU-Net in a five-fold cross-validation. Segmentation quality was validated by comparing the automated segmentation with the manually created ground truth segmentations, using Dice scores and the 95th percentile of the Hausdorff distances (HD95). On average, manual tumor segmentations result in larger tumor volumes. For automated segmentation, the median dice was 0.90. The median HD95 was 7.2 mm. We showed that radiological volume measurements underestimated tumor volume by about 10% when compared to manual segmentation-based volume measurements. Deep learning can potentially be used to replace manual segmentation to benefit from accurate volume measurements without time and observer constraints
Automated rating of background parenchymal enhancement in MRI of extremely dense breasts without compromising the association with breast cancer in the DENSE trial
Objectives: Background parenchymal enhancement (BPE) on dynamic contrast-enhanced MRI (DCE-MRI) as rated by radiologists is subject to inter- and intrareader variability. We aim to automate BPE category from DCE-MRI. Methods: This study represents a secondary analysis of the Dense Tissue and Early Breast Neoplasm Screening trial. 4553 women with extremely dense breasts who received supplemental breast MRI screening in eight hospitals were included. Minimal, mild, moderate and marked BPE rated by radiologists were used as reference. Fifteen quantitative MRI features of the fibroglandular tissue were extracted to predict BPE using Random Forest, Naïve Bayes, and KNN classifiers. Majority voting was used to combine the predictions. Internal-external validation was used for training and validation. The inverse-variance weighted mean accuracy was used to express mean performance across the eight hospitals. Cox regression was used to verify non inferiority of the association between automated rating and breast cancer occurrence compared to the association for manual rating. Results: The accuracy of majority voting ranged between 0.56 and 0.84 across the eight hospitals. The weighted mean prediction accuracy for the four BPE categories was 0.76. The hazard ratio (HR) of BPE for breast cancer occurrence was comparable between automated rating and manual rating (HR = 2.12 versus HR = 1.97, P = 0.65 for mild/moderate/marked BPE relative to minimal BPE). Conclusion: It is feasible to rate BPE automatically in DCE-MRI of women with extremely dense breasts without compromising the underlying association between BPE and breast cancer occurrence. The accuracy for minimal BPE is superior to that for other BPE categories
Contralateral parenchymal enhancement on breast MRI before and during neoadjuvant endocrine therapy in relation to the preoperative endocrine prognostic index
OBJECTIVES: To investigate whether contralateral parenchymal enhancement (CPE) on MRI during neoadjuvant endocrine therapy (NET) is associated with the preoperative endocrine prognostic index (PEPI) of ER+/HER2- breast cancer. METHODS: This retrospective observational cohort study included 40 unilateral ER+/HER2- breast cancer patients treated with NET. Patients received NET for 6 to 9 months with MRI response monitoring after 3 and/or 6 months. PEPI was used as endpoint. PEPI is based on surgery-derived pathology (pT- and pN-stage, Ki67, and ER-status) and stratifies patients in three groups with distinct prognoses. Mixed effects and ROC analysis were performed to investigate whether CPE was associated with PEPI and to assess discriminatory ability. RESULTS: The median patient age was 61 (interquartile interval: 52, 69). Twelve patients had PEPI-1 (good prognosis), 15 PEPI-2 (intermediate), and 13 PEPI-3 (poor). High pretreatment CPE was associated with PEPI-3: pretreatment CPE was 39.4% higher on average (95% CI = 1.3, 91.9%; p = .047) compared with PEPI-1. CPE decreased after 3 months in PEPI-2 and PEPI-3. The average reduction was 24.4% (95% CI = 2.6, 41.3%; p = .032) in PEPI-2 and 29.2% (95% CI = 7.8, 45.6%; p = .011) in PEPI-3 compared with baseline. Change in CPE was predictive of PEPI-1 vs PEPI-2+3 (AUC = 0.77; 95% CI = 0.57, 0.96). CONCLUSIONS: CPE during NET is associated with PEPI-group in ER+/HER2- breast cancer: a high pretreatment CPE and a decrease in CPE during NET were associated with a poor prognosis after NET on the basis of PEPI. KEY POINTS: • Change in contralateral breast parenchymal enhancement on MRI during neoadjuvant endocrine therapy distinguished between patients with a good and intermediate/poor prognosis at final pathology. • Patients with a poor prognosis at final pathology showed higher baseline parenchymal enhancement on average compared to patients with a good prognosis. • Patients with an intermediate/poor prognosis at final pathology showed a higher average reduction in parenchymal enhancement after 3 months of neoadjuvant endocrine therapy
Long-term Survival in Breast Cancer Patients Is Associated with Contralateral Parenchymal Enhancement at MRI: Outcomes of the SELECT Study
Background Several single-center studies found that high contralateral parenchymal enhancement (CPE) at breast MRI was associated with improved long-term survival in patients with estrogen receptor (ER)-positive and human epidermal growth factor receptor 2 (HER2)-negative breast cancer. Due to varying sample sizes, population characteristics, and follow-up times, consensus of the association is currently lacking. Purpose To confirm whether CPE is associated with long-term survival in a large multicenter retrospective cohort, and to investigate if CPE is associated with endocrine therapy effectiveness. Materials and Methods This multicenter observational cohort included women with unilateral ER-positive HER2-negative breast cancer (tumor size ≤50 mm and ≤three positive lymph nodes) who underwent MRI from January 2005 to December 2010. Overall survival (OS), recurrence-free survival (RFS), and distant RFS (DRFS) were assessed. Kaplan-Meier analysis was performed to investigate differences in absolute risk after 10 years, stratified according to CPE tertile. Multivariable Cox proportional hazards regression analysis was performed to investigate whether CPE was associated with prognosis and endocrine therapy effectiveness. Results Overall, 1432 women (median age, 54 years [IQR, 47-63 years]) were included from 10 centers. Differences in absolute OS after 10 years were stratified according to CPE tertile as follows: 88.5% (95% CI: 88.1, 89.1) in tertile 1, 85.8% (95% CI: 85.2, 86.3) in tertile 2, and 85.9% (95% CI: 85.4, 86.4) in tertile 3. CPE was independently associated with OS, with a hazard ratio (HR) of 1.17 (95% CI: 1.0, 1.36; P = .047), but was not associated with RFS (HR, 1.11; P = .16) or DRFS (HR, 1.11; P = .19). The effect of endocrine therapy on survival could not be accurately assessed; therefore, the association between endocrine therapy efficacy and CPE could not reliably be estimated. Conclusion High contralateral parenchymal enhancement was associated with a marginally decreased overall survival in patients with estrogen receptor-positive and human epidermal growth factor receptor 2-negative breast cancer, but was not associated with recurrence-free survival (RFS) or distant RFS. Published under a CC BY 4.0 license. Supplemental material is available for this article. See also the editorial by Honda and Iima in this issue