10 research outputs found
On measuring selection in cancer from subclonal mutation frequencies
Recently available cancer sequencing data have revealed a complex view of the cancer genome containing a multitude of mutations, including drivers responsible for cancer progression and neutral passengers. Measuring selection in cancer and distinguishing drivers from passengers have important implications for development of novel treatment strategies. It has recently been argued that a third of cancers are evolving neutrally, as their mutational frequency spectrum follows a 1/f power law expected from neutral evolution in a particular intermediate frequency range. We study a stochastic model of cancer evolution and derive a formula for the probability distribution of the cancer cell frequency of a subclonal driver, demonstrating that driver frequency is biased towards 0 and 1. We show that it is difficult to capture a driver mutation at an intermediate frequency, and thus the calling of neutrality due to a lack of such driver will significantly overestimate the number of neutrally evolving tumors. Our approach provides quantification of the validity of the 1/f statistic across the entire range of relevant parameter values. We also show that our conclusions remain valid for non-exponential models: spatial 3d model and sigmoidal growth, relevant for early- and late stages of cancer growth
Applications of Machine Learning in Drug Discovery II: Biomarker Discovery, Patient Stratification and Pharmacoeconomics
Cancer remains a leading cause of morbidity and mortality around the world. Despite significant advances in our understanding of the pathology of the disease, and the substantial public and private investment into treatment development, late-stage patients often exhaust therapeutic options. Indeed, in the US alone, there were >1.7 million new cancer diagnoses and >600,000 cancer-associated deaths in 2019. As biology in general and cancer research in particular become ever richer in data, we explore the role of machine learning (ML) in changing the cancer drug development landscape. In the first part of this analysis, we focussed on ML for target identification and drug design. We discussed the growing need for ML-based analysis as we enter an age of clinical -omic data and provided a primer to ML-based techniques for the non-statistician/mathematician. In this chapter, we will explore the problem of tumour heterogeneity together with the role of ML in the discovery and development of cancer biomarkers and for clinical trial design. We end with a brief consideration of the economics of personalised cancer treatment
Quantification of within-sample genetic heterogeneity from SNP-array data
Intra-tumour genetic heterogeneity (ITH) fosters drug resistance and is a critical hurdle to clinical treatment. ITH can be well-measured using multi-region sampling but this is costly and challenging to implement. There is therefore a need for tools to estimate ITH in individual samples, using standard genomic data such as SNP-arrays, that could be implemented routinely. We designed two novel scores S and R, respectively based on the Shannon diversity index and Ripley’s L statistic of spatial homogeneity, to quantify ITH in single SNP-array samples. We created in-silico and in-vitro mixtures of tumour clones, in which diversity was known for benchmarking purposes. We found significant but highly-variable associations of our scores with diversity in-silico (p < 0.001) and moderate associations in–vitro (p = 0.015 and p = 0.085). Our scores were also correlated to previous ITH estimates from sequencing data but heterogeneity in the fraction of tumour cells present across samples hampered accurate quantification. The prognostic potential of both scores was moderate but significantly predictive of survival in several tumour types (corrected p = 0.03). Our work thus shows how individual SNP-arrays reveal intra-sample clonal diversity with moderate accuracy
High-resolution deconstruction of evolution induced by chemotherapy treatments in breast cancer xenografts.
The processes by which tumors evolve are essential to the efficacy of treatment, but quantitative understanding of intratumoral dynamics has been limited. Although intratumoral heterogeneity is common, quantification of evolution is difficult from clinical samples because treatment replicates cannot be performed and because matched serial samples are infrequently available. To circumvent these problems we derived and assayed large sets of human triple-negative breast cancer xenografts and cell cultures from two patients, including 86 xenografts from cyclophosphamide, doxorubicin, cisplatin, docetaxel, or vehicle treatment cohorts as well as 45 related cell cultures. We assayed these samples via exome-seq and/or high-resolution droplet digital PCR, allowing us to distinguish complex therapy-induced selection and drift processes among endogenous cancer subclones with cellularity uncertaint
Modeling the Subclonal Evolution of Cancer Cell Populations
Increasing evidence shows that tumor clonal architectures are often the consequence of a complex branching process, yet little is known about the expected dynamics and extent to which these divergent subclonal expansions occur. Here, we develop and implement more than 88,000 instances of a stochastic evolutionary model simulating genetic drift and neoplastic progression. Under different combinations of population genetic parameter values, including those estimated for colorectal cancer and glioblastoma multiforme, the distribution of sizes of subclones carrying driver mutations had a heavy right tail at the time of tumor detection, with only 1 to 4 dominant clones present at >= 10% frequency. In contrast, the vast majority of subclones were present at = 10% frequency) in a tumor correlated strongly with the number of subclones (<10% of the tumor). Overall, these subclones were frequently below current standard detection thresholds, frequently harbored treatment-resistant mutations, and were more common in slow-growing tumors. Significance: The model presented in this paper addresses tumor heterogeneity by framing expectations for the number of resistant subclones in a tumor, with implications for future studies of the evolution of therapeutic resistance. (C) 2017 AACR
Bulk Genotyping of Biopsies Can Create Spurious Evidence for Hetereogeneity in Mutation Content
<div><p>When multiple samples are taken from the neoplastic tissues of a single patient, it is natural to compare their mutation content. This is often done by bulk genotyping of whole biopsies, but the chance that a mutation will be detected in bulk genotyping depends on its local frequency in the sample. When the underlying mutation count per cell is equal, homogenous biopsies will have more high-frequency mutations, and thus more detectable mutations, than heterogeneous ones. Using simulations, we show that bulk genotyping of data simulated under a neutral model of somatic evolution generates strong spurious evidence for non-neutrality, because the pattern of tissue growth systematically generates differences in biopsy heterogeneity. Any experiment which compares mutation content across bulk-genotyped biopsies may therefore suggest mutation rate or selection intensity variation even when these forces are absent. We discuss computational and experimental approaches for resolving this problem.</p></div
Classifying the evolutionary and ecological features of neoplasms
The consensus conference was supported by Wellcome Genome Campus Advanced Courses and Scientific Conferences. C.C.M. is supported in part by US NIH grants P01 CA91955, R01 CA149566, R01 CA170595, R01 CA185138 and R01 CA140657 as well as CDMRP Breast Cancer Research Program Award BC132057. M.J. is supported by NIH grant K99CA201606. K.S.A. is supported by NCI 5R21 CA196460. K. Polyak is supported by R35 CA197623, U01 CA195469, U54 CA193461, and the Breast Cancer Research Foundation. K.J.P. is supported by NIH grants CA143803, CA163124, CA093900 and CA143055. D.P. is supported by the European Research Council (ERC-617457- PHYLOCANCER), the Spanish Ministry of Economy and Competitiveness (BFU2015-63774-P) and the Education, Culture and University Development Department of the Galician Government. K.S.A. is supported in part by the Breast Cancer Research Foundation and NCI R21CA196460. C.S. is supported by the Royal Society, Cancer Research UK (FC001169), the UK Medical Research Council (FC001169), and the Wellcome Trust (FC001169), NovoNordisk Foundation (ID 16584), the Breast Cancer Research Foundation (BCRF), the European Research Council (THESEUS) and Marie Curie Network PloidyNet. T.A.G. is a Cancer Research UK fellow and a Wellcome Trust funded Investigator. E.S.H. is supported by R01 CA185138-01 and W81XWH-14-1-0473. M.Gerlinger is supported by Cancer Research UK and The Royal Marsden/ICR National Institute of Health Research Biomedical Research Centre. M.Ge., M.Gr., Y.Y., and A.So. were also supported in part by the Wellcome Trust [105104/Z/14/Z]. J.D.S. holds the Edward B. Clark, MD Chair in Pediatric Research, and is supported by the Primary Children's Hospital (PCH) Pediatric Cancer Research Program, funded by the Intermountain Healthcare Foundation and the PCH Foundation. A.S. is supported by the Chris Rokos Fellowship in Evolution and Cancer. Y.Y. is a Cancer Research UK fellow and supported by The Royal Marsden/ICR National Institute of Health Research Biomedical Research Centre. E.S.H. was supported in part by PCORI grants 1505–30497 and 1503–29572, NIH grants R01 CA185138, T32 CA093245, and U10 CA180857, CDMRP Breast Cancer Research Program Award BC132057, a CRUK Grand Challenge grant, and the Breast Cancer Research Foundation. A.R.A.A. was funded in part by NIH grant U01CA151924. A.R.A.A., R.G. and J.S.B. were funded in part by NIH grant U54CA193489
Artificial Intelligence in Oncology Drug Discovery and Development
There exists a profound conflict at the heart of oncology drug development. The efficiency of the drug development process is falling, leading to higher costs per approved drug, at the same time personalised medicine is limiting the target market of each new medicine. Even as the global economic burden of cancer increases, the current paradigm in drug development is unsustainable. In this book, we discuss the development of techniques in machine learning for improving the efficiency of oncology drug development and delivering cost-effective precision treatment. We consider how to structure data for drug repurposing and target identification, how to improve clinical trials and how patients may view artificial intelligence
Recommended from our members
Studying the Clonal Origins of Drug Resistance in Human Breast Cancers
Despite remarkable advances in our understanding of the drivers of human malignancies, new targeted therapies often fail to show sufficient efficacy in clinical trials. Indeed, the cost of bringing a new agent to market has risen substantially in the last several decades, fuelled partly by lack of efficacy in late phase clinical trials. Even in cases where a new agent is deemed 'successful', the development of resistance is often seen as inevitable and clinical responses can be fleeting. Typically, resistance to targeted therapies is thought to arise from pre-existing populations within the tumour, rather than from de novo evolution, yet few studies have experimentally tested this understanding. Indeed, recent reports in the literature have described epigenetically regulated drug tolerant populations within cancers, defined by cell-cycle regulation and/or quiescent repopulation dynamics, drug induced chromatin remodelling or differential transcription factor binding, that can be transient or permanent in nature. This thesis will outline experiments using high complexity molecular barcodes to trace the fate of individual cellular clones in the development of drug resistance. With this technique, cellular clones can be uncoupled from their genomic backgrounds, giving a new depth to our understanding of clonal selection in cancer. In particular, high complexity barcodes are used to identify a pre-existing tamoxifen resistant population in the MCF7 cell line. This resistance phenotype is then linked to the induction of embryonic transcription factor OCT4. Finally, we use our molecular barcoding technique to interrogate the repopulation dynamics of a breast cancer PDX model, supporting their use as complex model systems suitable for studying the origins and consequences of tumour heterogeneity.Cancer Research U