21 research outputs found
Lack of agreement between radiologists: implications for image-based model observers
We tested the agreement of radiologists' rankings of different reconstructions of breast computed tomography images based on their diagnostic (classification) performance and on their subjective image quality assessments. We used 102 pathology proven cases (62 malignant, 40 benign), and an iterative image reconstruction (IIR) algorithm to obtain 24 reconstructions per case with different image appearances. Using image feature analysis, we selected 3 IIRs and 1 clinical reconstruction and 50 lesions. The reconstructions produced a range of image quality from smooth/low-noise to sharp/high-noise, which had a range in classifier performance corresponding to AUCs of 0.62 to 0.96. Six experienced Mammography Quality Standards Act (MQSA) radiologists rated the likelihood of malignancy for each lesion. We conducted an additional reader study with the same radiologists and a subset of 30 lesions. Radiologists ranked each reconstruction according to their preference. There was disagreement among the six radiologists on which reconstruction produced images with the highest diagnostic content, but they preferred the midsharp/noise image appearance over the others. However, the reconstruction they preferred most did not match with their performance. Due to these disagreements, it may be difficult to develop a single image-based model observer that is representative of a population of radiologists for this particular imaging task
Recommended from our members
Lack of agreement between radiologists: implications for image-based model observers
We tested the agreement of radiologists' rankings of different reconstructions of breast computed tomography images based on their diagnostic (classification) performance and on their subjective image quality assessments. We used 102 pathology proven cases (62 malignant, 40 benign), and an iterative image reconstruction (IIR) algorithm to obtain 24 reconstructions per case with different image appearances. Using image feature analysis, we selected 3 IIRs and 1 clinical reconstruction and 50 lesions. The reconstructions produced a range of image quality from smooth/low-noise to sharp/high-noise, which had a range in classifier performance corresponding to AUCs of 0.62 to 0.96. Six experienced Mammography Quality Standards Act (MQSA) radiologists rated the likelihood of malignancy for each lesion. We conducted an additional reader study with the same radiologists and a subset of 30 lesions. Radiologists ranked each reconstruction according to their preference. There was disagreement among the six radiologists on which reconstruction produced images with the highest diagnostic content, but they preferred the midsharp/noise image appearance over the others. However, the reconstruction they preferred most did not match with their performance. Due to these disagreements, it may be difficult to develop a single image-based model observer that is representative of a population of radiologists for this particular imaging task
Recommended from our members
Relationships Between Human-Extracted MRI Tumor Phenotypes of Breast Cancer and Clinical Prognostic Indicators Including Receptor Status and Molecular Subtype
The purpose of this study was to investigate if human-extracted MRI tumor phenotypes of breast cancer could predict receptor status and tumor molecular subtype using MRIs from The Cancer Genome Atlas project.
Our retrospective interpretation study utilized the analysis of HIPAA-compliant breast MRI data from The Cancer Imaging Archive. One hundred and seven preoperative breast MRIs of biopsy proven invasive breast cancers were analyzed by 3 fellowship-trained breast-imaging radiologists. Each study was scored according to the Breast Imaging Reporting and Data System lexicon for mass and nonmass features. The Spearman rank correlation was used for association analysis of continuous variables; the Kruskal-Wallis test was used for associating continuous outcomes with categorical variables. The Fisher-exact test was used to assess correlations between categorical image-derived features and receptor status. Prediction of estrogen receptor (ER), progesterone receptor, human epidermal growth factor receptor, and molecular subtype were performed using random forest classifiers.
ER+ tumors were associated with the absence of rim enhancement (P = 0.019, odds ratio [OR] 5.5), heterogeneous internal enhancement (P = 0.02, OR 6.5), peritumoral edema (P = 0.0001, OR 10.0), and axillary adenopathy (P = 0.04, OR 4.4). ER+ tumors were smaller than ER- tumors (23.7 mm vs 29.2 mm, P = 0.02, OR 8.2). All of these variables except the lack of axillary adenopathy were also associated with progesterone receptor+ status. Luminal A tumors (n = 57) were smaller compared to nonLuminal A (21.8 mm vs 27.5 mm, P = 0.035, OR 7.3) and lacked peritumoral edema (P = 0.001, OR 6.8). Basal like tumors were associated with heterogeneous internal enhancement (P = 0.05, OR 10.1), rim enhancement (P = 0.05, OR6.9), and perituomral edema (P = 0.0001, OR 13.8).
Human extracted MRI tumor phenotypes may be able to differentiate those tumors with a more favorable clinical prognosis from their more aggressive counterparts
Recommended from our members
Linkage of the ACR National Mammography Database to the Network of State Cancer Registries: Proof of Concept Evaluation by the ACR National Mammography Database Committee
PurposeThe National Mammography Database (NMD) contains nearly 20 million examinations from 693 facilities; it is the largest information source for use and effectiveness of breast imaging in the United States. NMD collects demographic, imaging, interpretation, biopsy, and basic pathology results, enabling facility and physician comparison for quality improvement. However, NMD lacks treatment and clinical outcomes data. The network of state cancer registries (CRs) contains detailed pathologic, treatment, and clinical outcomes data. This pilot study assessed electronic linkage of NMD and CR data at a multicenter institution as proof of concept.Materials and methodsWe obtained Quality Oversight Committee approval for this retrospective study. Data of patients diagnosed with breast cancer in 2014 and 2015 were retrieved from our NMD-approved radiology information system (RIS) and matched with reportable patients in our CR using social security number (SSN), first name (fname), last name (lname), and date of birth (DOB). Matching was repeated without SSN. Percentage and reasons for mismatch were evaluated.ResultsThe RIS query identified 1,316 patients. CR linkage was 99.2% successful (n = 1,305 of 1,316) using SSN, fname, lname, and DOB. Eleven mismatches included four CR case-finding failures, one NMD fname error, five nonreportable in the CR, and one with correct identifiers in both databases. Without SSN, linkage was 97.3% successful (n = 1,281 of 1,316); name errors accounted for 19 and DOB accounted for 5 additional mismatches.ConclusionUsing common data elements, linkage between the NMD and state CRs may be feasible and could provide critical outcomes information to advance accurate assessment of breast imaging in the United States
Recommended from our members
COVID-19 and Breast Radiologist Wellness: Impact of Gender, Financial Loss, and Childcare Need.
PurposeThe purpose of this study was to evaluate the emotional and financial impact of coronavirus disease 2019 (COVID-19) on breast radiologists to understand potential consequences on physician wellness and gender disparities in radiology.MethodsA 41-question survey was distributed from June to September 2020 to members of the Society of Breast Imaging and the National Consortium of Breast Centers. Psychological distress and financial loss scores were calculated on the basis of survey responses and compared across gender and age subgroups. A multivariate logistic model was used to identify factors associated with psychological distress scores.ResultsA total of 628 surveys were completed (18% response rate); the mean respondent age was 52 ± 10 years, and 79% were women. Anxiety was reported by 68% of respondents, followed by sadness (41%), sleep problems (36%), anger (25%), and depression (23%). A higher psychological distress score correlated with female gender (odds ratio [OR], 1.9; P = .001), younger age (OR, 0.8 per SD; P = .005), and a higher financial loss score (OR, 1.4; P < .0001). Participants whose practices had not initiated wellness efforts specific to COVID-19 (54%) had higher psychological distress scores (OR, 1.4; P = .03). Of those with children at home, 38% reported increased childcare needs, higher in women than men (40% versus 29%, P < .001). Thirty-seven percent reported that childcare needs had adversely affected their jobs, which correlated with higher psychological distress scores (OR, 2.2-3.3; P < .05).ConclusionsPsychological distress was highest among younger and female respondents and those with greater pandemic-specific childcare needs and financial loss. Practice-initiated COVID-19-specific wellness efforts were associated with decreased psychological distress. Policies are needed to mitigate pandemic-specific burnout and worsening gender disparities