11 research outputs found
Cassia grandis
The Human Proteome Project was launched in September
2010 with
the goal of characterizing at least one protein product from each
protein-coding gene. Here we assess how much of the proteome has been
detected to date via tandem mass spectrometry by analyzing PeptideAtlas,
a compendium of human derived LC–MS/MS proteomics data from
many laboratories around the world. All data sets are processed with
a consistent set of parameters using the Trans-Proteomic Pipeline
and subjected to a 1% protein FDR filter before inclusion in PeptideAtlas.
Therefore, PeptideAtlas contains only high confidence protein identifications.
To increase proteome coverage, we explored new comprehensive public
data sources for data likely to add new proteins to the Human PeptideAtlas.
We then folded these data into a Human PeptideAtlas 2012 build and
mapped it to Swiss-Prot, a protein sequence database curated to contain
one entry per human protein coding gene. We find that this latest
PeptideAtlas build includes at least one peptide for each of ∼12500
Swiss-Prot entries, leaving ∼7500 gene products yet to be confidently
cataloged. We characterize these “PA-unseen” proteins
in terms of tissue localization, transcript abundance, and Gene Ontology
enrichment, and propose reasons for their absence from PeptideAtlas
and strategies for detecting them in the future
The State of the Human Proteome in 2012 as Viewed through PeptideAtlas
The Human Proteome Project was launched in September
2010 with
the goal of characterizing at least one protein product from each
protein-coding gene. Here we assess how much of the proteome has been
detected to date via tandem mass spectrometry by analyzing PeptideAtlas,
a compendium of human derived LC–MS/MS proteomics data from
many laboratories around the world. All data sets are processed with
a consistent set of parameters using the Trans-Proteomic Pipeline
and subjected to a 1% protein FDR filter before inclusion in PeptideAtlas.
Therefore, PeptideAtlas contains only high confidence protein identifications.
To increase proteome coverage, we explored new comprehensive public
data sources for data likely to add new proteins to the Human PeptideAtlas.
We then folded these data into a Human PeptideAtlas 2012 build and
mapped it to Swiss-Prot, a protein sequence database curated to contain
one entry per human protein coding gene. We find that this latest
PeptideAtlas build includes at least one peptide for each of ∼12500
Swiss-Prot entries, leaving ∼7500 gene products yet to be confidently
cataloged. We characterize these “PA-unseen” proteins
in terms of tissue localization, transcript abundance, and Gene Ontology
enrichment, and propose reasons for their absence from PeptideAtlas
and strategies for detecting them in the future
The State of the Human Proteome in 2012 as Viewed through PeptideAtlas
The Human Proteome Project was launched in September
2010 with
the goal of characterizing at least one protein product from each
protein-coding gene. Here we assess how much of the proteome has been
detected to date via tandem mass spectrometry by analyzing PeptideAtlas,
a compendium of human derived LC–MS/MS proteomics data from
many laboratories around the world. All data sets are processed with
a consistent set of parameters using the Trans-Proteomic Pipeline
and subjected to a 1% protein FDR filter before inclusion in PeptideAtlas.
Therefore, PeptideAtlas contains only high confidence protein identifications.
To increase proteome coverage, we explored new comprehensive public
data sources for data likely to add new proteins to the Human PeptideAtlas.
We then folded these data into a Human PeptideAtlas 2012 build and
mapped it to Swiss-Prot, a protein sequence database curated to contain
one entry per human protein coding gene. We find that this latest
PeptideAtlas build includes at least one peptide for each of ∼12500
Swiss-Prot entries, leaving ∼7500 gene products yet to be confidently
cataloged. We characterize these “PA-unseen” proteins
in terms of tissue localization, transcript abundance, and Gene Ontology
enrichment, and propose reasons for their absence from PeptideAtlas
and strategies for detecting them in the future
The State of the Human Proteome in 2012 as Viewed through PeptideAtlas
The Human Proteome Project was launched in September
2010 with
the goal of characterizing at least one protein product from each
protein-coding gene. Here we assess how much of the proteome has been
detected to date via tandem mass spectrometry by analyzing PeptideAtlas,
a compendium of human derived LC–MS/MS proteomics data from
many laboratories around the world. All data sets are processed with
a consistent set of parameters using the Trans-Proteomic Pipeline
and subjected to a 1% protein FDR filter before inclusion in PeptideAtlas.
Therefore, PeptideAtlas contains only high confidence protein identifications.
To increase proteome coverage, we explored new comprehensive public
data sources for data likely to add new proteins to the Human PeptideAtlas.
We then folded these data into a Human PeptideAtlas 2012 build and
mapped it to Swiss-Prot, a protein sequence database curated to contain
one entry per human protein coding gene. We find that this latest
PeptideAtlas build includes at least one peptide for each of ∼12500
Swiss-Prot entries, leaving ∼7500 gene products yet to be confidently
cataloged. We characterize these “PA-unseen” proteins
in terms of tissue localization, transcript abundance, and Gene Ontology
enrichment, and propose reasons for their absence from PeptideAtlas
and strategies for detecting them in the future
The State of the Human Proteome in 2012 as Viewed through PeptideAtlas
The Human Proteome Project was launched in September
2010 with
the goal of characterizing at least one protein product from each
protein-coding gene. Here we assess how much of the proteome has been
detected to date via tandem mass spectrometry by analyzing PeptideAtlas,
a compendium of human derived LC–MS/MS proteomics data from
many laboratories around the world. All data sets are processed with
a consistent set of parameters using the Trans-Proteomic Pipeline
and subjected to a 1% protein FDR filter before inclusion in PeptideAtlas.
Therefore, PeptideAtlas contains only high confidence protein identifications.
To increase proteome coverage, we explored new comprehensive public
data sources for data likely to add new proteins to the Human PeptideAtlas.
We then folded these data into a Human PeptideAtlas 2012 build and
mapped it to Swiss-Prot, a protein sequence database curated to contain
one entry per human protein coding gene. We find that this latest
PeptideAtlas build includes at least one peptide for each of ∼12500
Swiss-Prot entries, leaving ∼7500 gene products yet to be confidently
cataloged. We characterize these “PA-unseen” proteins
in terms of tissue localization, transcript abundance, and Gene Ontology
enrichment, and propose reasons for their absence from PeptideAtlas
and strategies for detecting them in the future
The State of the Human Proteome in 2012 as Viewed through PeptideAtlas
The Human Proteome Project was launched in September
2010 with
the goal of characterizing at least one protein product from each
protein-coding gene. Here we assess how much of the proteome has been
detected to date via tandem mass spectrometry by analyzing PeptideAtlas,
a compendium of human derived LC–MS/MS proteomics data from
many laboratories around the world. All data sets are processed with
a consistent set of parameters using the Trans-Proteomic Pipeline
and subjected to a 1% protein FDR filter before inclusion in PeptideAtlas.
Therefore, PeptideAtlas contains only high confidence protein identifications.
To increase proteome coverage, we explored new comprehensive public
data sources for data likely to add new proteins to the Human PeptideAtlas.
We then folded these data into a Human PeptideAtlas 2012 build and
mapped it to Swiss-Prot, a protein sequence database curated to contain
one entry per human protein coding gene. We find that this latest
PeptideAtlas build includes at least one peptide for each of ∼12500
Swiss-Prot entries, leaving ∼7500 gene products yet to be confidently
cataloged. We characterize these “PA-unseen” proteins
in terms of tissue localization, transcript abundance, and Gene Ontology
enrichment, and propose reasons for their absence from PeptideAtlas
and strategies for detecting them in the future
The State of the Human Proteome in 2012 as Viewed through PeptideAtlas
The Human Proteome Project was launched in September
2010 with
the goal of characterizing at least one protein product from each
protein-coding gene. Here we assess how much of the proteome has been
detected to date via tandem mass spectrometry by analyzing PeptideAtlas,
a compendium of human derived LC–MS/MS proteomics data from
many laboratories around the world. All data sets are processed with
a consistent set of parameters using the Trans-Proteomic Pipeline
and subjected to a 1% protein FDR filter before inclusion in PeptideAtlas.
Therefore, PeptideAtlas contains only high confidence protein identifications.
To increase proteome coverage, we explored new comprehensive public
data sources for data likely to add new proteins to the Human PeptideAtlas.
We then folded these data into a Human PeptideAtlas 2012 build and
mapped it to Swiss-Prot, a protein sequence database curated to contain
one entry per human protein coding gene. We find that this latest
PeptideAtlas build includes at least one peptide for each of ∼12500
Swiss-Prot entries, leaving ∼7500 gene products yet to be confidently
cataloged. We characterize these “PA-unseen” proteins
in terms of tissue localization, transcript abundance, and Gene Ontology
enrichment, and propose reasons for their absence from PeptideAtlas
and strategies for detecting them in the future
The State of the Human Proteome in 2012 as Viewed through PeptideAtlas
The Human Proteome Project was launched in September
2010 with
the goal of characterizing at least one protein product from each
protein-coding gene. Here we assess how much of the proteome has been
detected to date via tandem mass spectrometry by analyzing PeptideAtlas,
a compendium of human derived LC–MS/MS proteomics data from
many laboratories around the world. All data sets are processed with
a consistent set of parameters using the Trans-Proteomic Pipeline
and subjected to a 1% protein FDR filter before inclusion in PeptideAtlas.
Therefore, PeptideAtlas contains only high confidence protein identifications.
To increase proteome coverage, we explored new comprehensive public
data sources for data likely to add new proteins to the Human PeptideAtlas.
We then folded these data into a Human PeptideAtlas 2012 build and
mapped it to Swiss-Prot, a protein sequence database curated to contain
one entry per human protein coding gene. We find that this latest
PeptideAtlas build includes at least one peptide for each of ∼12500
Swiss-Prot entries, leaving ∼7500 gene products yet to be confidently
cataloged. We characterize these “PA-unseen” proteins
in terms of tissue localization, transcript abundance, and Gene Ontology
enrichment, and propose reasons for their absence from PeptideAtlas
and strategies for detecting them in the future
The State of the Human Proteome in 2012 as Viewed through PeptideAtlas
The Human Proteome Project was launched in September
2010 with
the goal of characterizing at least one protein product from each
protein-coding gene. Here we assess how much of the proteome has been
detected to date via tandem mass spectrometry by analyzing PeptideAtlas,
a compendium of human derived LC–MS/MS proteomics data from
many laboratories around the world. All data sets are processed with
a consistent set of parameters using the Trans-Proteomic Pipeline
and subjected to a 1% protein FDR filter before inclusion in PeptideAtlas.
Therefore, PeptideAtlas contains only high confidence protein identifications.
To increase proteome coverage, we explored new comprehensive public
data sources for data likely to add new proteins to the Human PeptideAtlas.
We then folded these data into a Human PeptideAtlas 2012 build and
mapped it to Swiss-Prot, a protein sequence database curated to contain
one entry per human protein coding gene. We find that this latest
PeptideAtlas build includes at least one peptide for each of ∼12500
Swiss-Prot entries, leaving ∼7500 gene products yet to be confidently
cataloged. We characterize these “PA-unseen” proteins
in terms of tissue localization, transcript abundance, and Gene Ontology
enrichment, and propose reasons for their absence from PeptideAtlas
and strategies for detecting them in the future
State of the Human Proteome in 2013 as Viewed through PeptideAtlas: Comparing the Kidney, Urine, and Plasma Proteomes for the Biology- and Disease-Driven Human Proteome Project
The
kidney, urine, and plasma proteomes are intimately related:
proteins and metabolic waste products are filtered from the plasma
by the kidney and excreted via the urine, while kidney proteins may
be secreted into the circulation or released into the urine. Shotgun
proteomics data sets derived from human kidney, urine, and plasma
samples were collated and
processed using a uniform software pipeline, and relative protein
abundances were estimated by spectral counting. The resulting PeptideAtlas
builds yielded 4005, 2491, and 3553 nonredundant proteins at 1% FDR
for the kidney, urine, and plasma proteomes, respectively
for kidney and plasma, the largest high-confidence protein sets to
date. The same pipeline applied to all available human data yielded
a 2013 Human PeptideAtlas build containing 12 644 nonredundant
proteins and at least one peptide for each of ∼14 000
Swiss-Prot entries, an increase over 2012 of ∼7.5% of the predicted
human proteome. We demonstrate that abundances are correlated between
plasma and urine, examine the most abundant urine proteins not derived
from either plasma or kidney, and consider the biomarker potential
of proteins associated with renal decline. This analysis forms part
of the Biology and Disease-driven Human Proteome Project (B/D-HPP)
and is a contribution to the Chromosome-centric Human Proteome Project
(C-HPP) special issue
