Article thumbnail
Location of Repository

Multi-spectra peptide sequencing and its applications to multistage mass spectrometry

By Nuno Bandeira, Jesper V. Olsen, Matthias Mann and Pavel A. Pevzner


Despite a recent surge of interest in database-independent peptide identifications, accurate de novo peptide sequencing remains an elusive goal. While the recently introduced spectral network approach resulted in accurate peptide sequencing in low-complexity samples, its success depends on the chance of presence of spectra from overlapping peptides. On the other hand, while multistage mass spectrometry (collecting multiple MS 3 spectra from each MS 2 spectrum) can be applied to all spectra in a complex sample, there are currently no software tools for de novo peptide sequencing by multistage mass spectrometry. We describe a rigorous probabilistic framework for analyzing spectra of overlapping peptides and show how to apply it for multistage mass spectrometry. Our software results in both accurate de novo peptide sequencing from multistage mass spectra (despite the inferior quality of MS 3 spectra) and improved interpretation of spectral networks. We further study the problem of de novo peptide sequencing with accurate parent mass (but inaccurate fragment masses), the protocol that may soon become the dominant mode of spectral acquisition. Most existing peptide sequencing algorithms (based on the spectrum graph approach) do not track the accurate parent mass and are thus not equipped for solving this problem. We describe a de novo peptide sequencing algorithm aimed at this experimental protocol and show that it improves the sequencing accuracy on both tandem and multistage mass spectrometry

Topics: Ismb 2008 Conference Proceedings 19–23 July 2008, Toronto
Publisher: Oxford University Press
OAI identifier:
Provided by: PubMed Central

Suggested articles


  1. (2007a) Shotgun protein sequencing: assembly of tandem mass spectra from mixtures of modified proteins.
  2. (2007b) Protein identification via spectral networks analysis.
  3. (1995). A computer program to aid the sequencing of peptidesincollision-activateddecompositionexperiments.
  4. (2001). A dynamic programming approach to de novo peptide sequencing via tandem mass spectrometry.
  5. (1999). and Nucleic Acids. Cambridge University Press. Edwards,N.J.(2007)Novelpeptideidentificationfromtandemmassspectrausingests and sequence database compression.
  6. (1998). C-terminal peptide sequencing via multistage mass spectrometry.
  7. (2000). De novo peptide sequencing by two-dimensional fragment correlation mass spectrometry.
  8. (1999). De novo peptide sequencing via tandem mass spectrometry.
  9. (2006). De novo proteomic sequencing of a monoclonal antibody raised against ox40 ligand.
  10. (2003). Detection of secreted peptides by using hypothesis-driven multistage mass spectrometry.
  11. (1967). Error bounds for convolutional codes and an asymptotically optimum decoding algorithm.
  12. (1990). Fast algorithm for peptide sequencing by mass spectroscopy.
  13. (2005). Identification of post-translational modifications by blind search of mass spectra.
  14. (2005). Immunoglobulin gene diversification.
  15. (2004). Improved peptide identification in proteomics by two consecutive stages of mass spectrometric fragmentation.
  16. (2007). Improving gene annotation using peptide mass spectrometry.
  17. (2005). InsPecT: identification of posttranslationally modified peptides from tandem mass spectra.
  18. (2008). Investigating ms2-ms3 matching statistics: A model for coupling consecutive stage mass spectrometry data for increased peptide identification confidence.
  19. (2003). Mass spectrometry-based proteomics.
  20. (1995). Mining genomes: correlating tandem mass spectra of modified and unmodified peptides to sequences in nucleotide databases.
  21. (2007). Msnovo: a dynamic programming algorithm for de novo peptide sequencing via tandem mass spectrometry.
  22. (2005). Novohmm: a hidden markov model for de novo peptide sequencing.
  23. (2003). On de-novo interpretation of tandem mass spectra for peptide identification.
  24. (2003). PEAKS: powerful software for peptide de novo sequencing by tandem mass spectrometry.
  25. (2005). PepNovo: de novo peptide sequencing via probabilistic network modeling.
  26. (2006). Performance evaluation of existing de novo sequencing algorithms.
  27. (1999). Probability-based protein identification by searching sequence databases using mass spectrometry data.
  28. (1997). Rapid ‘de novo’ peptide sequencing by a combination of nanoelectrospray, isotopic labeling and a quadrupole/time-of-flight mass spectrometer.
  29. (2006). Recombinant polyclonal antibodies: the next generation of antibody therapeutics?
  30. (2005). Robust accurate identification of peptides (raid): decipheringms2datausingastructuredlibrarysearchwithdenovobasedstatistics.
  31. (2004). Shotgun protein sequencing by tandem mass spectra assembly.
  32. (2005). Small peptides, big world: biotechnological potential in neglected bioactive peptides from arthropod venoms.
  33. (2007). The diversity of bioactive proteins in australian snake venoms. Mol. Cell Proteomics,
  34. (2007). Whole proteome analysis of post-translational modifications: applications of mass-spectrometry for proteogenomic annotation.

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.