An Analysis of the Sensitivity of Proteogenomic Mapping of Somatic Mutations and Novel Splicing Events in Cancer

Askenazi, Manor; Cao, Song; Carr, Steven A.; Chan, Daniel; Chen, Xian; Clauser, Karl R.; Davies, Sherri R.; Ding, Li; Ellis, Matthew J.; Erdmann-Gilmore, Petra; Fenyo, David; Grover, Himanshu; Gunawardena, Harsha P.; Hoadley, Katherine A.; Kinsinger, Christopher R.; Li, Shunqiang; Liebler, Daniel C.; Liu, Tao; Maher, Christopher A.; McLellan, Michael D.; Mertins, Philipp; Payne, Samuel; Perou, Charles M.; Rodland, Karen D.; Rodriguez, Henry; Ruggles, Kelly V.; Slebos, Robbert; Smith, Richard D.; Sun, Shisheng; Tabb, David L.; Tang, Zuojian; Teubl, Jennifer; Townsend, R. Reid; Wang, Xuya; Xie, Ling; Zhang, Hui; Zhang, Zhen; Zhou, Jian-Ying

An Analysis of the Sensitivity of Proteogenomic Mapping of Somatic Mutations and Novel Splicing Events in Cancer

Abstract

Improvements in mass spectrometry (MS)-based peptide sequencing provide a new opportunity to determine whether polymorphisms, mutations, and splice variants identified in cancer cells are translated. Herein, we apply a proteogenomic data integration tool (QUILTS) to illustrate protein variant discovery using whole genome, whole transcriptome, and global proteome datasets generated from a pair of luminal and basal-like breast-cancer-patient-derived xenografts (PDX). The sensitivity of proteogenomic analysis for singe nucleotide variant (SNV) expression and novel splice junction (NSJ) detection was probed using multiple MS/MS sample process replicates defined here as an independent tandem MS experiment using identical sample material. Despite analysis of over 30 sample process replicates, only about 10% of SNVs (somatic and germline) detected by both DNA and RNA sequencing were observed as peptides. An even smaller proportion of peptides corresponding to NSJ observed by RNA sequencing were detected (<0.1%). Peptides mapping to DNA-detected SNVs without a detectable mRNA transcript were also observed, suggesting that transcriptome coverage was incomplete (∼80%). In contrast to germline variants, somatic variants were less likely to be detected at the peptide level in the basal-like tumor than in the luminal tumor, raising the possibility of differential translation or protein degradation effects. In conclusion, this large-scale proteogenomic integration allowed us to determine the degree to which mutations are translated and identify gaps in sequence coverage, thereby benchmarking current technology and progress toward whole cancer proteome and transcriptome analysis

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Carolina Digital Repository

cdr.lib.unc.edu:4f16c773t

Last time updated on 23/04/2020