11 research outputs found

    Beyond attention: deriving biologically interpretable insights from weakly-supervised multiple-instance learning models

    Full text link
    Recent advances in attention-based multiple instance learning (MIL) have improved our insights into the tissue regions that models rely on to make predictions in digital pathology. However, the interpretability of these approaches is still limited. In particular, they do not report whether high-attention regions are positively or negatively associated with the class labels or how well these regions correspond to previously established clinical and biological knowledge. We address this by introducing a post-training methodology to analyse MIL models. Firstly, we introduce prediction-attention-weighted (PAW) maps by combining tile-level attention and prediction scores produced by a refined encoder, allowing us to quantify the predictive contribution of high-attention regions. Secondly, we introduce a biological feature instantiation technique by integrating PAW maps with nuclei segmentation masks. This further improves interpretability by providing biologically meaningful features related to the cellular organisation of the tissue and facilitates comparisons with known clinical features. We illustrate the utility of our approach by comparing PAW maps obtained for prostate cancer diagnosis (i.e. samples containing malignant tissue, 381/516 tissue samples) and prognosis (i.e. samples from patients with biochemical recurrence following surgery, 98/663 tissue samples) in a cohort of patients from the international cancer genome consortium (ICGC UK Prostate Group). Our approach reveals that regions that are predictive of adverse prognosis do not tend to co-locate with the tumour regions, indicating that non-cancer cells should also be studied when evaluating prognosis

    Genomic evolution shapes prostate cancer disease type

    Get PDF
    H.R.F. was supported by a Cancer Research UK Programme Grant to Simon Tavaré (C14303/A17197), as, partially, was A.G.L. A.G.L. acknowledges the support of the University of St Andrews. A.G.L. and J.H.R.F. also acknowledge the support of the Cambridge Cancer Research Fund.The development of cancer is an evolutionary process involving the sequential acquisition of genetic alterations that disrupt normal biological processes, enabling tumor cells to rapidly proliferate and eventually invade and metastasize to other tissues. We investigated the genomic evolution of prostate cancer through the application of three separate classification methods, each designed to investigate a different aspect of tumor evolution. Integrating the results revealed the existence of two distinct types of prostate cancer that arise from divergent evolutionary trajectories, designated as the Canonical and Aalternative evolutionary disease types. We therefore propose the evotype model for prostate cancer evolution wherein Alternative-evotype tumors diverge from those of the Canonical-evotype through the stochastic accumulation of genetic alterations associated with disruptions to androgen receptor DNA binding. Our model unifies many previous molecular observations, providing a powerful new framework to investigate prostate cancer disease progression.Peer reviewe

    The architecture of clonal expansions in morphologically normal tissue from cancerous and non-cancerous prostates

    Get PDF
    Background: Up to 80% of cases of prostate cancer present with multifocal independent tumour lesions leading to the concept of a field effect present in the normal prostate predisposing to cancer development. In the present study we applied Whole Genome DNA Sequencing (WGS) to a group of morphologically normal tissue (n = 51), including benign prostatic hyperplasia (BPH) and non-BPH samples, from men with and men without prostate cancer. We assess whether the observed genetic changes in morphologically normal tissue are linked to the development of cancer in the prostate. Results: Single nucleotide variants (P = 7.0 × 10–03, Wilcoxon rank sum test) and small insertions and deletions (indels, P = 8.7 × 10–06) were significantly higher in morphologically normal samples, including BPH, from men with prostate cancer compared to those without. The presence of subclonal expansions under selective pressure, supported by a high level of mutations, were significantly associated with samples from men with prostate cancer (P = 0.035, Fisher exact test). The clonal cell fraction of normal clones was always higher than the proportion of the prostate estimated as epithelial (P = 5.94 × 10–05, paired Wilcoxon signed rank test) which, along with analysis of primary fibroblasts prepared from BPH specimens, suggests a stromal origin. Constructed phylogenies revealed lineages associated with benign tissue that were completely distinct from adjacent tumour clones, but a common lineage between BPH and non-BPH morphologically normal tissues was often observed. Compared to tumours, normal samples have significantly less single nucleotide variants (P = 3.72 × 10–09, paired Wilcoxon signed rank test), have very few rearrangements and a complete lack of copy number alterations. Conclusions: Cells within regions of morphologically normal tissue (both BPH and non-BPH) can expand under selective pressure by mechanisms that are distinct from those occurring in adjacent cancer, but that are allied to the presence of cancer. Expansions, which are probably stromal in origin, are characterised by lack of recurrent driver mutations, by almost complete absence of structural variants/copy number alterations, and mutational processes similar to malignant tissue. Our findings have implications for treatment (focal therapy) and early detection approaches.publishedVersionPeer reviewe

    Appraising the relevance of DNA copy number loss and gain in prostate cancer using whole genome DNA sequence data.

    Get PDF
    A variety of models have been proposed to explain regions of recurrent somatic copy number alteration (SCNA) in human cancer. Our study employs Whole Genome DNA Sequence (WGS) data from tumor samples (n = 103) to comprehensively assess the role of the Knudson two hit genetic model in SCNA generation in prostate cancer. 64 recurrent regions of loss and gain were detected, of which 28 were novel, including regions of loss with more than 15% frequency at Chr4p15.2-p15.1 (15.53%), Chr6q27 (16.50%) and Chr18q12.3 (17.48%). Comprehensive mutation screens of genes, lincRNA encoding sequences, control regions and conserved domains within SCNAs demonstrated that a two-hit genetic model was supported in only a minor proportion of recurrent SCNA losses examined (15/40). We found that recurrent breakpoints and regions of inversion often occur within Knudson model SCNAs, leading to the identification of ZNF292 as a target gene for the deletion at 6q14.3-q15 and NKX3.1 as a two-hit target at 8p21.3-p21.2. The importance of alterations of lincRNA sequences was illustrated by the identification of a novel mutational hotspot at the KCCAT42, FENDRR, CAT1886 and STCAT2 loci at the 16q23.1-q24.3 loss. Our data confirm that the burden of SCNAs is predictive of biochemical recurrence, define nine individual regions that are associated with relapse, and highlight the possible importance of ion channel and G-protein coupled-receptor (GPCR) pathways in cancer development. We concluded that a two-hit genetic model accounts for about one third of SCNA indicating that mechanisms, such haploinsufficiency and epigenetic inactivation, account for the remaining SCNA losses.We acknowledge support from Cancer Research UK (C5047/A22530, C309/A11566, C368/A6743, A368/A7990, C14303/A17197) and the Dallaglio Foundation. We also acknowledge support from the National Institute of Health Research (NIHR) (The Biomedical Research Centre at The Institute of Cancer Research & The Royal Marsden NHS Foundation Trust and the project "Prostate Cancer: Mechanisms of Progression and Treatment (PROMPT)" [G0500966/75466]). We thank the Wellcome Trust, Bob Champion Cancer Trust, The Orchid Cancer appeal, The RoseTrees Trust, The North West Cancer Research Fund, Big C, The King family, and The Masonic Charitable Foundation for funding. This research is supported by the Francis Crick Institute which receives its core funding from Cancer Research UK (FC001202), the UK Medical Research Council (FC001202), and the Wellcome Trust (FC001202). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript

    The architecture of clonal expansions in morphologically normal tissue from cancerous and non-cancerous prostates.

    Get PDF
    BACKGROUND: Up to 80% of cases of prostate cancer present with multifocal independent tumour lesions leading to the concept of a field effect present in the normal prostate predisposing to cancer development. In the present study we applied Whole Genome DNA Sequencing (WGS) to a group of morphologically normal tissue (n = 51), including benign prostatic hyperplasia (BPH) and non-BPH samples, from men with and men without prostate cancer. We assess whether the observed genetic changes in morphologically normal tissue are linked to the development of cancer in the prostate. RESULTS: Single nucleotide variants (P = 7.0 × 10-03, Wilcoxon rank sum test) and small insertions and deletions (indels, P = 8.7 × 10-06) were significantly higher in morphologically normal samples, including BPH, from men with prostate cancer compared to those without. The presence of subclonal expansions under selective pressure, supported by a high level of mutations, were significantly associated with samples from men with prostate cancer (P = 0.035, Fisher exact test). The clonal cell fraction of normal clones was always higher than the proportion of the prostate estimated as epithelial (P = 5.94 × 10-05, paired Wilcoxon signed rank test) which, along with analysis of primary fibroblasts prepared from BPH specimens, suggests a stromal origin. Constructed phylogenies revealed lineages associated with benign tissue that were completely distinct from adjacent tumour clones, but a common lineage between BPH and non-BPH morphologically normal tissues was often observed. Compared to tumours, normal samples have significantly less single nucleotide variants (P = 3.72 × 10-09, paired Wilcoxon signed rank test), have very few rearrangements and a complete lack of copy number alterations. CONCLUSIONS: Cells within regions of morphologically normal tissue (both BPH and non-BPH) can expand under selective pressure by mechanisms that are distinct from those occurring in adjacent cancer, but that are allied to the presence of cancer. Expansions, which are probably stromal in origin, are characterised by lack of recurrent driver mutations, by almost complete absence of structural variants/copy number alterations, and mutational processes similar to malignant tissue. Our findings have implications for treatment (focal therapy) and early detection approaches

    Genomic evolution shapes prostate cancer disease type.

    No full text
    The development of cancer is an evolutionary process involving the sequential acquisition of genetic alterations that disrupt normal biological processes, enabling tumor cells to rapidly proliferate and eventually invade and metastasize to other tissues. We investigated the genomic evolution of prostate cancer through the application of three separate classification methods, each designed to investigate a different aspect of tumor evolution. Integrating the results revealed the existence of two distinct types of prostate cancer that arise from divergent evolutionary trajectories, designated as the Canonical and Alternative evolutionary disease types. We therefore propose the evotype model for prostate cancer evolution wherein Alternative-evotype tumors diverge from those of the Canonical-evotype through the stochastic accumulation of genetic alterations associated with disruptions to androgen receptor DNA binding. Our model unifies many previous molecular observations, providing a powerful new framework to investigate prostate cancer disease progression

    Appraising the relevance of DNA copy number loss and gain in prostate cancer using whole genome DNA sequence data

    No full text
    A variety of models have been proposed to explain regions of recurrent somatic copy number alteration (SCNA) in human cancer. Our study employs Whole Genome DNA Sequence (WGS) data from tumor samples (n = 103) to comprehensively assess the role of the Knudson two hit genetic model in SCNA generation in prostate cancer. 64 recurrent regions of loss and gain were detected, of which 28 were novel, including regions of loss with more than 15% frequency at Chr4p15.2-p15.1 (15.53%), Chr6q27 (16.50%) and Chr18q12.3 (17.48%). Comprehensive mutation screens of genes, lincRNA encoding sequences, control regions and conserved domains within SCNAs demonstrated that a two-hit genetic model was supported in only a minor proportion of recurrent SCNA losses examined (15/40). We found that recurrent breakpoints and regions of inversion often occur within Knudson model SCNAs, leading to the identification of ZNF292 as a target gene for the deletion at 6q14.3-q15 and NKX3.1 as a two-hit target at 8p21.3-p21.2. The importance of alterations of lincRNA sequences was illustrated by the identification of a novel mutational hotspot at the KCCAT42, FENDRR, CAT1886 and STCAT2 loci at the 16q23.1-q24.3 loss. Our data confirm that the burden of SCNAs is predictive of biochemical recurrence, define nine individual regions that are associated with relapse, and highlight the possible importance of ion channel and G-protein coupled-receptor (GPCR) pathways in cancer development. We concluded that a two-hit genetic model accounts for about one third of SCNA indicating that mechanisms, such haploinsufficiency and epigenetic inactivation, account for the remaining SCNA losses
    corecore