33 research outputs found
Pharmacoproteomic characterisation of human colon and rectal cancer
Most molecular cancer therapies act on protein targets but data on the proteome status of patients and cellular models for proteome-guided pre-clinical drug sensitivity studies are only beginning to emerge. Here, we profiled the proteomes of 65 colorectal cancer (CRC) cell lines to a depth of > 10,000 proteins using mass spectrometry. Integration with proteomes of 90 CRC patients and matched transcriptomics data defined integrated CRC subtypes, highlighting cell lines representative of each tumour subtype. Modelling the responses of 52 CRC cell lines to 577 drugs as a function of proteome profiles enabled predicting drug sensitivity for cell lines and patients. Among many novel associations, MERTK was identified as a predictive marker for resistance towards MEK1/2 inhibitors and immunohistochemistry of 1,074 CRC tumours confirmed MERTK as a prognostic survival marker. We provide the proteomic and pharmacological data as a resource to the community to, for example, facilitate the design of innovative prospective clinical trials. © 2017 The Authors. Published under the terms of the CC BY 4.0 licens
The OTUD6B-LIN28B-MYC axis determines the proliferative state in multiple myeloma
Deubiquitylases (DUBs) are therapeutically amenable components of the ubiquitin machinery that stabilize substrate proteins. Their inhibition can destabilize oncoproteins that may otherwise be undruggable. Here, we screened for DUB vulnerabilities in multiple myeloma, an incurable malignancy with dependency on the ubiquitin proteasome system and identified OTUD6B as an oncogene that drives the G1/S-transition. LIN28B, a suppressor of microRNA biogenesis, is specified as a bona fide cell cycle-specific substrate of OTUD6B. Stabilization of LIN28B drives MYC expression at G1/S, which in turn allows for rapid S-phase entry. Silencing OTUD6B or LIN28B inhibits multiple myeloma outgrowth in vivo and high OTUD6B expression evolves in patients that progress to symptomatic multiple myeloma and results in an adverse outcome of the disease. Thus, we link proteolytic ubiquitylation with post-transcriptional regulation and nominate OTUD6B as a potential mediator of the MGUS-multiple myeloma transition, a central regulator of MYC, and an actionable vulnerability in multiple myeloma and other tumors with an activated OTUD6B-LIN28B axis.</p
A large peptidome dataset improves HLA class I epitope prediction across most of the human population
Published in final edited form as: Nat Biotechnol. 2020 February ; 38(2): 199–209. doi:10.1038/s41587-019-0322-9.Prediction of HLA epitopes is important for the development of cancer immunotherapies and vaccines. However, current prediction algorithms have limited predictive power, in part because they were not trained on high-quality epitope datasets covering a broad range of HLA alleles. To enable prediction of endogenous HLA class I-associated peptides across a large fraction of the human population, we used mass spectrometry to profile >185,000 peptides eluted from 95 HLA-A, -B, -C and -G mono-allelic cell lines. We identified canonical peptide motifs per HLA allele, unique and shared binding submotifs across alleles and distinct motifs associated with different peptide lengths. By integrating these data with transcript abundance and peptide processing, we developed HLAthena, providing allele-and-length-specific and pan-allele-pan-length prediction models for endogenous peptide presentation. These models predicted endogenous HLA class I-associated ligands with 1.5-fold improvement in positive predictive value compared with existing tools and correctly identified >75% of HLA-bound peptides that were observed experimentally in 11 patient-derived tumor cell lines.P01 CA229092 - NCI NIH HHS; P50 CA101942 - NCI NIH HHS; T32 HG002295 - NHGRI NIH HHS; T32 CA009172 - NCI NIH HHS; U24 CA224331 - NCI NIH HHS; R21 CA216772 - NCI NIH HHS; R01 CA155010 - NCI NIH HHS; U01 CA214125 - NCI NIH HHS; T32 CA207021 - NCI NIH HHS; R01 HL103532 - NHLBI NIH HHS; U24 CA210986 - NCI NIH HHSAccepted manuscrip
MScDB: A Mass Spectrometry-centric Protein Sequence Database for Proteomics
Protein sequence databases are indispensable
tools for life science
research including mass spectrometry (MS)-based proteomics. In current
database construction processes, sequence similarity clustering is
used to reduce redundancies in the source data. Albeit powerful, it
ignores the peptide-centric nature of proteomic data and the fact
that MS is able to distinguish similar sequences. Therefore, we introduce
an approach that structures the protein sequence space at the peptide
level using theoretical and empirical information from large-scale
proteomic data to generate a mass spectrometry-centric protein sequence
database (MScDB). The core modules of MScDB are an <i>in-silico</i> proteolytic digest and a peptide-centric clustering algorithm that
groups protein sequences that are indistinguishable by mass spectrometry.
Analysis of various MScDB uses cases against five complex human proteomes,
resulting in 69 peptide identifications not present in UniProtKB as
well as 79 putative single amino acid polymorphisms. MScDB retains
∼99% of the identifications in comparison to common databases
despite a 3–48% increase in the theoretical peptide search
space (but comparable protein sequence space). In addition, MScDB
enables cross-species applications such as human/mouse graft models,
and our results suggest that the uncertainty in protein assignments
to one species can be smaller than 20%
MScDB: A Mass Spectrometry-centric Protein Sequence Database for Proteomics
Protein sequence databases are indispensable
tools for life science
research including mass spectrometry (MS)-based proteomics. In current
database construction processes, sequence similarity clustering is
used to reduce redundancies in the source data. Albeit powerful, it
ignores the peptide-centric nature of proteomic data and the fact
that MS is able to distinguish similar sequences. Therefore, we introduce
an approach that structures the protein sequence space at the peptide
level using theoretical and empirical information from large-scale
proteomic data to generate a mass spectrometry-centric protein sequence
database (MScDB). The core modules of MScDB are an <i>in-silico</i> proteolytic digest and a peptide-centric clustering algorithm that
groups protein sequences that are indistinguishable by mass spectrometry.
Analysis of various MScDB uses cases against five complex human proteomes,
resulting in 69 peptide identifications not present in UniProtKB as
well as 79 putative single amino acid polymorphisms. MScDB retains
∼99% of the identifications in comparison to common databases
despite a 3–48% increase in the theoretical peptide search
space (but comparable protein sequence space). In addition, MScDB
enables cross-species applications such as human/mouse graft models,
and our results suggest that the uncertainty in protein assignments
to one species can be smaller than 20%
MScDB: A Mass Spectrometry-centric Protein Sequence Database for Proteomics
Protein sequence databases are indispensable
tools for life science
research including mass spectrometry (MS)-based proteomics. In current
database construction processes, sequence similarity clustering is
used to reduce redundancies in the source data. Albeit powerful, it
ignores the peptide-centric nature of proteomic data and the fact
that MS is able to distinguish similar sequences. Therefore, we introduce
an approach that structures the protein sequence space at the peptide
level using theoretical and empirical information from large-scale
proteomic data to generate a mass spectrometry-centric protein sequence
database (MScDB). The core modules of MScDB are an <i>in-silico</i> proteolytic digest and a peptide-centric clustering algorithm that
groups protein sequences that are indistinguishable by mass spectrometry.
Analysis of various MScDB uses cases against five complex human proteomes,
resulting in 69 peptide identifications not present in UniProtKB as
well as 79 putative single amino acid polymorphisms. MScDB retains
∼99% of the identifications in comparison to common databases
despite a 3–48% increase in the theoretical peptide search
space (but comparable protein sequence space). In addition, MScDB
enables cross-species applications such as human/mouse graft models,
and our results suggest that the uncertainty in protein assignments
to one species can be smaller than 20%
MScDB: A Mass Spectrometry-centric Protein Sequence Database for Proteomics
Protein sequence databases are indispensable
tools for life science
research including mass spectrometry (MS)-based proteomics. In current
database construction processes, sequence similarity clustering is
used to reduce redundancies in the source data. Albeit powerful, it
ignores the peptide-centric nature of proteomic data and the fact
that MS is able to distinguish similar sequences. Therefore, we introduce
an approach that structures the protein sequence space at the peptide
level using theoretical and empirical information from large-scale
proteomic data to generate a mass spectrometry-centric protein sequence
database (MScDB). The core modules of MScDB are an <i>in-silico</i> proteolytic digest and a peptide-centric clustering algorithm that
groups protein sequences that are indistinguishable by mass spectrometry.
Analysis of various MScDB uses cases against five complex human proteomes,
resulting in 69 peptide identifications not present in UniProtKB as
well as 79 putative single amino acid polymorphisms. MScDB retains
∼99% of the identifications in comparison to common databases
despite a 3–48% increase in the theoretical peptide search
space (but comparable protein sequence space). In addition, MScDB
enables cross-species applications such as human/mouse graft models,
and our results suggest that the uncertainty in protein assignments
to one species can be smaller than 20%
MScDB: A Mass Spectrometry-centric Protein Sequence Database for Proteomics
Protein sequence databases are indispensable
tools for life science
research including mass spectrometry (MS)-based proteomics. In current
database construction processes, sequence similarity clustering is
used to reduce redundancies in the source data. Albeit powerful, it
ignores the peptide-centric nature of proteomic data and the fact
that MS is able to distinguish similar sequences. Therefore, we introduce
an approach that structures the protein sequence space at the peptide
level using theoretical and empirical information from large-scale
proteomic data to generate a mass spectrometry-centric protein sequence
database (MScDB). The core modules of MScDB are an <i>in-silico</i> proteolytic digest and a peptide-centric clustering algorithm that
groups protein sequences that are indistinguishable by mass spectrometry.
Analysis of various MScDB uses cases against five complex human proteomes,
resulting in 69 peptide identifications not present in UniProtKB as
well as 79 putative single amino acid polymorphisms. MScDB retains
∼99% of the identifications in comparison to common databases
despite a 3–48% increase in the theoretical peptide search
space (but comparable protein sequence space). In addition, MScDB
enables cross-species applications such as human/mouse graft models,
and our results suggest that the uncertainty in protein assignments
to one species can be smaller than 20%
Phosphoproteome Profiling Reveals Molecular Mechanisms of Growth-Factor-Mediated Kinase Inhibitor Resistance in EGFR-Overexpressing Cancer Cells
Although
substantial progress has been made regarding the use of
molecularly targeted cancer therapies, resistance almost invariably
develops and presents a major clinical challenge. The tumor microenvironment
can rescue cancer cells from kinase inhibitors by growth-factor-mediated
induction of pro-survival pathways. Here we show that epidermal growth
factor receptor (EGFR) inhibition by Gefitinib is counteracted by
growth factors, notably FGF2, and we assessed the global molecular
consequences of this resistance at the proteome and phosphoproteome
level in A431 cells. Tandem mass tag peptide labeling and quantitative
mass spectrometry allowed the identification and quantification of
22 000 phosphopeptides and 8800 proteins in biological triplicates
without missing values. The data show that FGF2 protects the cells
from the antiproliferative effect of Gefitinib and largely prevents
reprogramming of the proteome and phosphoproteome. Simultaneous EGFR/FGFR
or EGFR/GSG2 (Haspin) inhibition overcomes this resistance, and the
phosphoproteomic experiments further prioritized the RAS/MEK/ERK as
well as the PI3K/mTOR axis for combination treatment. Consequently,
the MEK inhibitor Trametinib prevented FGF2-mediated survival of EGFR
inhibitor-resistant cells when used in combination with Gefitinib.
Surprisingly, the PI3K/mTOR inhibitor Omipalisib reversed resistance
mediated by all four growth factors tested, making it an interesting
candidate for mitigating the effects of the tumor microenvironment