42 research outputs found
Deep Learning-Based Grading of Ductal Carcinoma In Situ in Breast Histopathology Images
Ductal carcinoma in situ (DCIS) is a non-invasive breast cancer that can
progress into invasive ductal carcinoma (IDC). Studies suggest DCIS is often
overtreated since a considerable part of DCIS lesions may never progress into
IDC. Lower grade lesions have a lower progression speed and risk, possibly
allowing treatment de-escalation. However, studies show significant
inter-observer variation in DCIS grading. Automated image analysis may provide
an objective solution to address high subjectivity of DCIS grading by
pathologists.
In this study, we developed a deep learning-based DCIS grading system. It was
developed using the consensus DCIS grade of three expert observers on a dataset
of 1186 DCIS lesions from 59 patients. The inter-observer agreement, measured
by quadratic weighted Cohen's kappa, was used to evaluate the system and
compare its performance to that of expert observers. We present an analysis of
the lesion-level and patient-level inter-observer agreement on an independent
test set of 1001 lesions from 50 patients.
The deep learning system (dl) achieved on average slightly higher
inter-observer agreement to the observers (o1, o2 and o3)
() than the
observers amongst each other () at the lesion-level. At the patient-level, the deep
learning system achieved similar agreement to the observers
() as the
observers amongst each other ().
In conclusion, we developed a deep learning-based DCIS grading system that
achieved a performance similar to expert observers. We believe this is the
first automated system that could assist pathologists by providing robust and
reproducible second opinions on DCIS grade
Alcohol consumption and breast tumor gene expression
Background Alcohol consumption is an established risk factor for breast cancer and the association generally appears stronger among estrogen receptor (ER)-positive tumors. However, the biological mechanisms underlying this association are not completely understood. Methods We analyzed messenger RNA (mRNA) microarray data from both invasive breast tumors (N = 602) and tumor-adjacent normal tissues (N = 508) from participants diagnosed with breast cancer in the Nurses’ Health Study (NHS) and NHSII. Multivariable linear regression, controlling for other known breast cancer risk factors, was used to identify differentially expressed genes by pre-diagnostic alcohol intake. For pathway analysis, we performed gene set enrichment analysis (GSEA). Differentially expressed genes or enriched pathway-defined gene sets with false discovery rate (FDR) \u3c0.1 identified in tumors were validated in RNA sequencing data of invasive breast tumors (N = 166) from The Cancer Genome Atlas. Results No individual genes were significantly differentially expressed by alcohol consumption in the NHS/NHSII. However, GSEA identified 33 and 68 pathway-defined gene sets at FDR \u3c0.1 among 471 ER+ and 127 ER- tumors, respectively, all of which were validated. Among ER+ tumors, consuming 10+ grams of alcohol per day (vs. 0) was associated with upregulation in RNA metabolism and transport, cell cycle regulation, and DNA repair, and downregulation in lipid metabolism. Among ER- tumors, in addition to upregulation in RNA processing and cell cycle, alcohol intake was linked to overexpression of genes involved in cytokine signaling, including interferon and transforming growth factor (TGF)-β signaling pathways, and translation and post-translational modifications. Lower lipid metabolism was observed in both ER+ tumors and ER+ tumor-adjacent normal samples. Most of the significantly enriched gene sets identified in ER- tumors showed a similar enrichment pattern among ER- tumor-adjacent normal tissues. Conclusions Our data suggest that moderate alcohol consumption (i.e. 10+ grams/day, equivalent to one or more drinks/day) is associated with several specific and reproducible biological processes and pathways, which adds potential new insight into alcohol-related breast carcinogenesis
Predicting breast tumor proliferation from whole-slide images : the TUPAC16 challenge
Tumor proliferation is an important biomarker indicative of the prognosis of breast cancer patients. Assessment of tumor proliferation in a clinical setting is a highly subjective and labor-intensive task. Previous efforts to automate tumor proliferation assessment by image analysis only focused on mitosis detection in predefined tumor regions. However, in a real-world scenario, automatic mitosis detection should be performed in whole-slide images (WSIs) and an automatic method should be able to produce a tumor proliferation score given a WSI as input. To address this, we organized the TUmor Proliferation Assessment Challenge 2016 (TUPAC16) on prediction of tumor proliferation scores from WSIs.
The challenge dataset consisted of 500 training and 321 testing breast cancer histopathology WSIs. In order to ensure fair and independent evaluation, only the ground truth for the training dataset was provided to the challenge participants. The first task of the challenge was to predict mitotic scores, i.e., to reproduce the manual method of assessing tumor proliferation by a pathologist. The second task was to predict the gene expression based PAM50 proliferation scores from the WSI.
The best performing automatic method for the first task achieved a quadratic-weighted Cohen's kappa score of κ = 0.567, 95% CI [0.464, 0.671] between the predicted scores and the ground truth. For the second task, the predictions of the top method had a Spearman's correlation coefficient of r = 0.617, 95% CI [0.581 0.651] with the ground truth.
This was the first comparison study that investigated tumor proliferation assessment from WSIs. The achieved results are promising given the difficulty of the tasks and weakly-labeled nature of the ground truth. However, further research is needed to improve the practical utility of image analysis methods for this task
The molecular basis of breast cancer pathological phenotypes
The histopathological evaluation of morphological features in breast tumours provides prognostic information to guide therapy. Adjunct molecular analyses provide further diagnostic, prognostic and predictive information. However, there is limited knowledge of the molecular basis of morphological phenotypes in invasive breast cancer. This study integrated genomic, transcriptomic and protein data to provide a comprehensive molecular profiling of morphological features in breast cancer. Fifteen pathologists assessed 850 invasive breast cancer cases from The Cancer Genome Atlas (TCGA). Morphological features were significantly associated with genomic alteration, DNA methylation subtype, PAM50 and microRNA subtypes, proliferation scores, gene expression and/or RPPA subtype. Marked nuclear pleomorphism, necrosis, inflammation and high mitotic count were associated with Basal-like subtype and have similar molecular basis. Omics-based signatures were constructed to predict morphological features. The association of morphology transcriptome signatures with overall survival in oestrogen receptor (ER)-positive and ER-negative breast cancer was first assessed using the METABRIC dataset; signatures that remained prognostic in the METABRIC multivariate analysis were further evaluated in five additional datasets. The transcriptomic signature of epithelial tubule formation was prognostic in ER-positive breast cancer. No signature was prognostic in ER-negative. This study provided new insights into the molecular basis of breast cancer morphological phenotypes. The integration of morphological with molecular data has potential to refine breast cancer classification, predict response to therapy, enhance our understanding of breast cancer biology and improve clinical management. This work is publicly accessible at www.dx.ai/tcga_breast
Automated clear cell renal carcinoma grade classification with prognostic significance.
We developed an automated 2-tiered Fuhrman's grading system for clear cell renal cell carcinoma (ccRCC). Whole slide images (WSI) and clinical data were retrieved for 395 The Cancer Genome Atlas (TCGA) ccRCC cases. Pathologist 1 reviewed and selected regions of interests (ROIs). Nuclear segmentation was performed. Quantitative morphological, intensity, and texture features (n = 72) were extracted. Features associated with grade were identified by constructing a Lasso model using data from cases with concordant 2-tiered Fuhrman's grades between TCGA and Pathologist 1 (training set n = 235; held-out test set n = 42). Discordant cases (n = 118) were additionally reviewed by Pathologist 2. Cox proportional hazard model evaluated the prognostic efficacy of the predicted grades in an extended test set which was created by combining the test set and discordant cases (n = 160). The Lasso model consisted of 26 features and predicted grade with 84.6% sensitivity and 81.3% specificity in the test set. In the extended test set, predicted grade was significantly associated with overall survival after adjusting for age and gender (Hazard Ratio 2.05; 95% CI 1.21-3.47); manual grades were not prognostic. Future work can adapt our computational system to predict WHO/ISUP grades, and validating this system on other ccRCC cohorts