13 research outputs found

    Analytical validation of a standardised scoring protocol for Ki67 immunohistochemistry on breast cancer excision whole sections: an international multicentre collaboration

    Get PDF
    Aims The nuclear proliferation marker Ki67 assayed by immunohistochemistry has multiple potential uses in breast cancer, but an unacceptable level of interlaboratory variability has hampered its clinical utility. The International Ki67 in Breast Cancer Working Group has undertaken a systematic programme to determine whether Ki67 measurement can be analytically validated and standardised among laboratories. This study addresses whether acceptable scoring reproducibility can be achieved on excision whole sections. Methods and results Adjacent sections from 30 primary ER+ breast cancers were centrally stained for Ki67 and sections were circulated among 23 pathologists in 12 countries. All pathologists scored Ki67 by two methods: (i) global: four fields of 100 tumour cells each were selected to reflect observed heterogeneity in nuclear staining; (ii) hot-spot: the field with highest apparent Ki67 index was selected and up to 500 cells scored. The intraclass correlation coefficient (ICC) for the global method [confidence interval (CI) = 0.87; 95% CI = 0.799-0.93] marginally met the prespecified success criterion (lower 95% CI >= 0.8), while the ICC for the hot-spot method (0.83; 95% CI = 0.74-0.90) did not. Visually, interobserver concordance in location of selected hot-spots varies between cases. The median times for scoring were 9 and 6 min for global and hot-spot methods, respectively. Conclusions The global scoring method demonstrates adequate reproducibility to warrant next steps towards evaluation for technical and clinical validity in appropriate cohorts of cases. The time taken for scoring by either method is practical using counting software we are making publicly available. Establishment of external quality assessment schemes is likely to improve the reproducibility between laboratories further

    Analytical validation of a standardized scoring protocol for Ki67: phase 3 of an international multicenter collaboration

    Get PDF
    Pathological analysis of the nuclear proliferation biomarker Ki67 has multiple potential roles in breast and other cancers. However, clinical utility of the immunohistochemical (IHC) assay for Ki67 immunohistochemistry has been hampered by unacceptable between-laboratory analytical variability. The International Ki67 Working Group has conducted a series of studies aiming to decrease this variability and improve the evaluation of Ki67. This study tries to assess whether acceptable performance can be achieved on prestained core-cut biopsies using a standardized scoring method. Sections from 30 primary ER+ breast cancer core biopsies were centrally stained for Ki67 and circulated among 22 laboratories in 11 countries. Each laboratory scored Ki67 using three methods: (1) global (4 fields of 100 cells each); (2) weighted global (same as global but weighted by estimated percentages of total area); and (3) hot-spot (single field of 500 cells). The intraclass correlation coefficient (ICC), a measure of interlaboratory agreement, for the unweighted global method (0.87; 95% credible interval (CI): 0.81–0.93) met the prespecified success criterion for scoring reproducibility, whereas that for the weighted global (0.87; 95% CI: 0.7999–0.93) and hot-spot methods (0.84; 95% CI: 0.77–0.92) marginally failed to do so. The unweighted global assessment of Ki67 IHC analysis on core biopsies met the prespecified criterion of success for scoring reproducibility. A few cases still showed large scoring discrepancies. Establishment of external quality assessment schemes is likely to improve the agreement between laboratories further. Additional evaluations are needed to assess staining variability and clinical validity in appropriate cohorts of samples

    An international study to increase concordance in Ki67 scoring.

    No full text
    Although an important biomarker in breast cancer, Ki67 lacks scoring standardization, which has limited its clinical use. Our previous study found variability when laboratories used their own scoring methods on centrally stained tissue microarray slides. In this current study, 16 laboratories from eight countries calibrated to a specific Ki67 scoring method and then scored 50 centrally MIB-1 stained tissue microarray cases. Simple instructions prescribed scoring pattern and staining thresholds for determination of the percentage of stained tumor cells. To calibrate, laboratories scored 18 'training' and 'test' web-based images. Software tracked object selection and scoring. Success for the calibration was prespecified as Root Mean Square Error of scores compared with reference 0.70 but aiming for observed intraclass correlation ≄0.90. Laboratory performance showed non-significant but promising trends of improvement through the calibration exercise (mean Root Mean Square Error decreased from 0.6 to 0.4, Maximum Absolute Deviation from 1.6 to 0.9; paired t-test: P=0.07 for Root Mean Square Error, 0.06 for Maximum Absolute Deviation). For tissue microarray scoring, the intraclass correlation estimate was 0.94 (95% credible interval: 0.90-0.97), markedly and significantly >0.70, the prespecified minimum target for success. Some discrepancies persisted, including around clinically relevant cutoffs. After calibrating to a common scoring method via a web-based tool, laboratories can achieve high inter-laboratory reproducibility in Ki67 scoring on centrally stained tissue microarray slides. Although these data are potentially encouraging, suggesting that it may be possible to standardize scoring of Ki67 among pathology laboratories, clinically important discrepancies persist. Before this biomarker could be recommended for clinical use, future research will need to extend this approach to biopsies and whole sections, account for staining variability, and link to outcomes.Modern Pathology advance online publication, 20 February 2015; doi:10.1038/modpathol.2015.38

    Analytical validation of a standardized scoring protocol for Ki67 assessed on breast excision whole sections:An international multicenter collaboration

    No full text
    Aims: (i) Determine whether between-observer reproducibility for Ki67 when assessed on whole sections according to a standardized scoring protocol is adequate for clinical application. (ii) Compare between-observer reproducibility of Ki67 scores assessed on hot-spots to scores using a global method that averages across a tissue section.Background: The nuclear proliferation biomarker Ki67 has multiple potential roles in breast cancer, including aiding decisions based on prognosis, but unacceptable levels of between-laboratory variability have been observed. The International Ki67 in Breast Cancer Working Group has undertaken a systematic program to determine whether Ki67 measurement can be analytically validated and standardized across labs. In phase 1, variability in visual interpretation was identified as an important source of variability. Phases 2 and 3a showed that adherence to defined scoring methods substantially improved reproducibility in scoring tissue microarrays and core-cut biopsies. We now assess whether acceptable reproducibility can be achieved on whole sections.Methods: Adjacent sections from 30 primary ER+ breast cancers were centrally stained for Ki67 to assemble 4 sets of 30 stained tumor sections, circulated around 23 labs in 12 countries. Ki67 was scored by 2 methods by all labs: (a) global: 4 fields of 100 tumor cells each were selected to reflect observed heterogeneity in nuclear staining (b) hot-spot: the field with highest Ki67 percentage of tumor cells with nuclear staining was selected and up to 500 cells scored. Ki67 scores were log2-transformed for statistical analyses and back-transformed for presentation. The primary objective was to assess whether either method could achieve an intraclass correlation coefficient (ICC) significantly greater than 0.8, considered substantial to almost-perfect reproducibility. Secondary objectives were to assess which method had highest observed ICC and to assess whether observers identified the same “hot-spots”.Results: ICC for the global method was 0.87 (95%CI: 0.799-0.93), marginally meeting the prespecified success criterion. The ICC for the hot-spot method was 0.83 (95%CI: 0.74-0.90) and had a CI extending below the success criterion. Across the 23 labs, geometric mean value of the 30 scores ranged from 8.5 to 19.6 for the global method and from 12.8 to 30.3 for the hot-spot method. The overall mean (95% CI) of these values was 12.9 (11.9-14.0) and 20.9 (19.1-22.8), respectively. Visually, between-laboratory agreement in location of selected hot-spot varies between cases. The median times for scoring were 9 and 6 minutes for global and hot-spot methods respectively.Conclusions: The global method marginally met the prespecified criterion of success; it should now be evaluated for clinical validity in appropriate cohorts of cases. The hot-spot method was observed to have slightly less reproducibility between labs. The time taken for scoring by either method is practical using counting software we are making publicly available. Establishment of external quality assessment schemes is likely to improve the reproducibility between labs furthe

    Systematically higher Ki67 scores on core biopsy samples compared to corresponding resection specimen in breast cancer: a multi-operator and multi-institutional study

    Get PDF
    International audienceAbstract Ki67 has potential clinical importance in breast cancer but has yet to see broad acceptance due to inter-laboratory variability. Here we tested an open source and calibrated automated digital image analysis (DIA) platform to: (i) investigate the comparability of Ki67 measurement across corresponding core biopsy and resection specimen cases, and (ii) assess section to section differences in Ki67 scoring. Two sets of 60 previously stained slides containing 30 core-cut biopsy and 30 corresponding resection specimens from 30 estrogen receptor-positive breast cancer patients were sent to 17 participating labs for automated assessment of average Ki67 expression. The blocks were centrally cut and immunohistochemically (IHC) stained for Ki67 (MIB-1 antibody). The QuPath platform was used to evaluate tumoral Ki67 expression. Calibration of the DIA method was performed as in published studies. A guideline for building an automated Ki67 scoring algorithm was sent to participating labs. Very high correlation and no systematic error ( p = 0.08) was found between consecutive Ki67 IHC sections. Ki67 scores were higher for core biopsy slides compared to paired whole sections from resections ( p ≀ 0.001; median difference: 5.31%). The systematic discrepancy between core biopsy and corresponding whole sections was likely due to pre-analytical factors (tissue handling, fixation). Therefore, Ki67 IHC should be tested on core biopsy samples to best reflect the biological status of the tumor
    corecore