2 research outputs found

    Sequence dependencies and mutation rates of localized mutational processes in cancer

    No full text
    Abstract Background Cancer mutations accumulate through replication errors and DNA damage coupled with incomplete repair. Individual mutational processes often show nucleotide sequence and functional region preferences. As a result, some sequence contexts mutate at much higher rates than others, with additional variation found between functional regions. Mutational hotspots, with recurrent mutations across cancer samples, represent genomic positions with elevated mutation rates, often caused by highly localized mutational processes. Methods We count the 11-mer genomic sequences across the genome, and using the PCAWG set of 2583 pan-cancer whole genomes, we associate 11-mers with mutational signatures, hotspots of single nucleotide variants, and specific genomic regions. We evaluate the mutation rates of individual and combined sets of 11-mers and derive mutational sequence motifs. Results We show that hotspots generally identify highly mutable sequence contexts. Using these, we show that some mutational signatures are enriched in hotspot sequence contexts, corresponding to well-defined sequence preferences for the underlying localized mutational processes. This includes signature 17b (of unknown etiology) and signatures 62 (POLE deficiency), 7a (UV), and 72 (linked to lymphomas). In some cases, the mutation rate and sequence preference increase further when focusing on certain genomic regions, such as signature 62 in transcribed regions, where the mutation rate is increased up to 9-folds over cancer type and mutational signature average. Conclusions We summarize our findings in a catalog of localized mutational processes, their sequence preferences, and their estimated mutation rates

    Additional file 1 of Sequence dependencies and mutation rates of localized mutational processes in cancer

    No full text
    Additional file 1: Fig. S1. Catalog of localized mutational processes and their mutation rates. Fig. S2. Catalog of sequence dependencies. Fig. S3. Cosine similarities between signatures. Fig. S4. Mutation rates of 11-mers with and without hotspots. Fig. S5. Background 11-mer sets derived from mutational signatures. Fig. S6. UV-signature sequence characteristics across mutation rates. Fig. S7. Characterization of GAAACTTCTTT-sequences in repetitive elements. Table S1. K-mer statistics
    corecore