36 research outputs found

    CNETML: Maximum likelihood inference of phylogeny from copy number profiles of spatio-temporal samples

    Get PDF
    Phylogenetic trees based on copy number alterations (CNAs) for multi-region samples of a single cancer patient are helpful to understand the spatio-temporal evolution of cancers, especially in tumours driven by chromosomal instability. Due to the high cost of deep sequencing data, low-coverage data are more accessible in practice, which only allow the calling of (relative) total copy numbers due to the lower resolution. However, methods to reconstruct sample phylogenies from CNAs often use allele-specific copy numbers and those using total copy number are mostly distance matrix or maximum parsimony methods which do not handle temporal data or estimate mutation rates. In this work, we developed a new maximum likelihood method based on a novel evolutionary model of CNAs, CNETML, to infer phylogenies from spatio-temporal samples taken within a single patient. CNETML is the first program to jointly infer the tree topology, node ages, and mutation rates from total copy numbers when samples were taken at different time points. Our extensive simulations suggest CNETML performed well even on relative copy numbers with subclonal whole genome doubling events and under slight violation of model assumptions. The application of CNETML to real data from Barrett’s esophagus patients also generated consistent results with previous discoveries and novel early CNAs for further investigations

    CNETML: maximum likelihood inference of phylogeny from copy number profiles of multiple samples

    Get PDF
    Phylogenetic trees based on copy number profiles from multiple samples of a patient are helpful to understand cancer evolution. Here, we develop a new maximum likelihood method, CNETML, to infer phylogenies from such data. CNETML is the first program to jointly infer the tree topology, node ages, and mutation rates from total copy numbers of longitudinal samples. Our extensive simulations suggest CNETML performs well on copy numbers relative to ploidy and under slight violation of model assumptions. The application of CNETML to real data generates results consistent with previous discoveries and provides novel early copy number events for further investigation

    The mutational signatures of formalin fixation on the human genome

    Get PDF
    Clinical archives of patient material near-exclusively consist of formalin-fixed and paraffin-embedded (FFPE) blocks. The ability to precisely characterise mutational signatures from FFPE-derived DNA has tremendous translational potential. However, sequencing of DNA derived from FFPE material is known to be riddled with artefacts. Here we derive genome-wide mutational signatures caused by formalin fixation. We show that the FFPE-signature is highly similar to signature 30 (the signature of Base Excision Repair deficiency due to NTHL1 mutations), and chemical repair of DNA lesions leads to a signature highly similar to signature 1 (clock-like signature due to spontaneous deamination of methylcytosine). We demonstrate that using uncorrected mutational catalogues of FFPE samples leads to major mis-assignment of signature activities. To correct for this, we introduce FFPEsig, a computational algorithm to rectify the formalin-induced artefacts in the mutational catalogue. We demonstrate that FFPEsig enables accurate mutational signature analysis both in simulated and whole-genome sequenced FFPE cancer samples. FFPEsig thus provides an opportunity to unlock additional clinical potential of archival patient tissues.Peer reviewe

    Implications of Epigenetic Drift in Colorectal Neoplasia

    Get PDF
    NIH grants U01CA182940 (G.E. Luebeck, W.D. Hazelton, W.M. Grady, S.K. Madden, K. Curtius), U01CA199336 (G.E. Luebeck, W.D. Hazelton); Barts Charity grant 472-2300, London (K. Curtius) and UK Medical Research Council Rutherford fellowship (K. Curtius); and NIH grants (P30CA15704, U01CA152756, R01CA194663, R01CA220004, U54CA143862, P01CA077852),R.A.C.E. Charities, Cottrell Family Fund, R03CA165153, Listwin Family Foundation, Seattle Translational Tumor Research program, Fred Hutchinson Cancer Research Center (S.K. Madden, M. Yu, K.T. Carter, and W.M. Grady), R01CA189184 (C. Lee, C.M. Ulrich, S.K.Madden, M. Yu, K.T. Carter, and W.M. Grady), R01CA112516, R01CA114467, R01CA120523 (C.M. Ulrich, S.K. Madden, M. Yu, K.T. Carter, and W.M. Grady), Huntsman Cancer Foundation, U01 CA206110, R01CA189184 R01CA 207371 and P30CACA042014 (C.M. Ulrich). U24CA074794 (P.A. Newcomb, S.K. Madden, M. Yu, K.T. Carter, and W.M. Grady). This material is the result of work supported in part by resources from the VA Puget Sound Health Care System and the ColoCare Study

    Evolutionary history of human colitis-associated colorectal cancer

    Get PDF
    Objective: IBD confers an increased lifetime risk of developing colorectal cancer (CRC), and colitis-associated CRC (CA-CRC) is molecularly distinct from sporadic CRC (S-CRC). Here we have dissected the evolutionary history of CA-CRC using multiregion sequencing. Design: Exome sequencing was performed on fresh-frozen multiple regions of carcinoma, adjacent non-cancerous mucosa and blood from 12 patients with CA-CRC (n=55 exomes), and key variants were validated with orthogonal methods. Genome-wide copy number profiling was performed using single nucleotide polymorphism arrays and low-pass whole genome sequencing on archival non-dysplastic mucosa (n=9), low-grade dysplasia (LGD; n=30), high-grade dysplasia (HGD; n=13), mixed LGD/HGD (n=7) and CA-CRC (n=19). Phylogenetic trees were reconstructed, and evolutionary analysis used to reveal the temporal sequence of events leading to CA-CRC. Results: 10/12 tumours were microsatellite stable with a median mutation burden of 3.0 single nucleotide alterations (SNA) per Mb, ~20% higher than S-CRC (2.5 SNAs/Mb), and consistent with elevated ageing-associated mutational processes. Non-dysplastic mucosa had considerable mutation burden (median 47 SNAs), including mutations shared with the neighbouring CA-CRC, indicating a precancer mutational field. CA-CRCs were often near triploid (40%) or near tetraploid (20%) and phylogenetic analysis revealed that copy number alterations (CNAs) began to accrue in non-dysplastic bowel, but the LGD/HGD transition often involved a punctuated ‘catastrophic’ CNA increase. Conclusions: Evolutionary genomic analysis revealed precancer clones bearing extensive SNAs and CNAs, with progression to cancer involving a dramatic accrual of CNAs at HGD. Detection of the cancerised field is an encouraging prospect for surveillance, but punctuated evolution may limit the window for early detection

    Evolutionary history of human colitis-associated colorectal cancer

    Get PDF
    Objective IBD confers an increased lifetime risk of developing colorectal cancer (CRC), and colitis-associated CRC (CA-CRC) is molecularly distinct from sporadic CRC (S-CRC). Here we have dissected the evolutionary history of CA-CRC using multiregion sequencing. Design Exome sequencing was performed on fresh-frozen multiple regions of carcinoma, adjacent non-cancerous mucosa and blood from 12 patients with CA-CRC (n=55 exomes), and key variants were validated with orthogonal methods. Genome-wide copy number profiling was performed using single nucleotide polymorphism arrays and low-pass whole genome sequencing on archival non-dysplastic mucosa (n=9), low-grade dysplasia (LGD; n=30), high-grade dysplasia (HGD; n=13), mixed LGD/HGD (n=7) and CA-CRC (n=19). Phylogenetic trees were reconstructed, and evolutionary analysis used to reveal the temporal sequence of events leading to CA-CRC. Results 10/12 tumours were microsatellite stable with a median mutation burden of 3.0 single nucleotide alterations (SNA) per Mb, ~20% higher than S-CRC (2.5 SNAs/Mb), and consistent with elevated ageing-associated mutational processes. Non-dysplastic mucosa had considerable mutation burden (median 47 SNAs), including mutations shared with the neighbouring CA-CRC, indicating a precancer mutational field. CA-CRCs were often near triploid (40%) or near tetraploid (20%) and phylogenetic analysis revealed that copy number alterations (CNAs) began to accrue in non-dysplastic bowel, but the LGD/HGD transition often involved a punctuated ‘catastrophic’ CNA increase. Conclusions Evolutionary genomic analysis revealed precancer clones bearing extensive SNAs and CNAs, with progression to cancer involving a dramatic accrual of CNAs at HGD. Detection of the cancerised field is an encouraging prospect for surveillance, but punctuated evolution may limit the window for early detection
    corecore