11 research outputs found

    Cytologic predictors of malignancy in bile duct brushings: a multi-reviewer analysis of 60 cases

    No full text
    Diagnosing malignancy in bile duct brushings is highly challenging. Seven reviewers of variable backgrounds and levels of participation in bile duct brushing sign out blindly reviewed 60 specimens (30 malignant with histologic confirmation and 30 benign (15 stented) with resection or >= 18 months of uneventful follow-up), testing the utility of 14 malignant characteristics. Eleven characteristics were statistically significantly associated with malignancy including 3-dimensional clusters (63% in malignant vs 3% in benign, odds ratio 50, P=0.0003), pleomorphism (62 vs 3, odds ratio 48, P=0.0004), 2-cell population (60% vs 3, odds ratio 44, P=0.0005), chromatin pattern (hypo/hyperchromasia) changes (70% vs 7%, odds ratio 33, P= 3 malignant characteristics, while 23 (77%) benign brushings had none. Of 20 brushings with >= 4 characteristics, 1(5%) proved benign and showed detachment atypia, a close malignant mimicker in brushings. Identification of 3 characteristics maximized the combined sensitivity (70%), specificity (97%) and accuracy (83%), but sensitivity dropped as number of characteristics increased. Identification of 3/11 characteristics (3-dimensional clusters, pleomorphism, high nuclear-to-cytoplasmic ratio, nuclear irregularity, hypercellularity, discohesion, chromatin changes, vacuoles, prominent nucleoli, molding and 2-cell population) improves pathologists' overall performance greatly

    Multi-institutional Assessment of Pathologist Scoring HER2 Immunohistochemistry.

    No full text
    The HercepTest was approved 20+ years ago as the companion diagnostic test for trastuzumab in human epidermal growth factor 2 (HER2) or ERBB2 gene-amplified/overexpressing breast cancers. Subsequent HER2 immunohistochemistry (IHC) assays followed, including the now most common Ventana 4B5 assay. Although this IHC assay has become the clinical standard, its reliability, reproducibility, and accuracy have largely been approved and accepted on the basis of concordance among small numbers of pathologists without validation in a real-world setting. In this study, we evaluated the concordance and interrater reliability of scoring HER2 IHC in 170 breast cancer biopsies by 18 breast cancer-specialized pathologists from 15 institutions. We used the Observers Needed to Evaluate Subjective Tests method to determine the plateau of concordance and the minimum number of pathologists needed to estimate interrater agreement values for large numbers of raters, as seen in the real-world setting. We report substantial discordance within the intermediate categories (\u3c1% agreement for 1+ and 3.6% agreement for 2+) in the 4-category HER2 IHC scoring system. The discordance within the IHC 0 cases is also substantial with an overall percent agreement (OPA) of only 25% and poor interrater reliability metrics (0.49 Fleiss\u27 kappa, 0.55 intraclass correlation coefficient). This discordance can be partially reduced by using a 3-category system (28.8% vs 46.5% OPA for 4-category and 3-category scoring systems, respectively). Observers Needed to Evaluate Subjective Tests plots suggest that the OPA for the task of determining a HER2 IHC score 0 from not 0 plateaus statistically around 59.4% at 10 raters. Conversely, at the task of scoring HER2 IHC as 3+ or not 3+ pathologists\u27 concordance was much higher with an OPA that plateaus at 87.1% with 6 raters. This suggests that legacy HER2 IHC remains valuable for finding the patients in whom the ERBB2 gene is amplified but unacceptably discordant in assigning HER2-low or HER2-negative status for the emerging HER2-low therapies
    corecore