Skip to main content
Article thumbnail
Location of Repository

Characterising phase variations in MALDI-TOF data and correcting them by peak alignment

By Simon M Lin, Richard P Haney, Michael J Campa, Michael C Fitzgerald and Edward F Patz

Abstract

The use of MALDI-TOF mass spectrometry as a means of analyzing the proteome has been evaluated extensively in recent years. One of the limitations of this technique that has impeded the development of robust data analysis algorithms is the variability in the location of protein ion signals along the x-axis. We studied technical variations of MALDI-TOF measurements in the context of proteomics profiling. By acquiring a benchmark data set with five replicates, we estimated 76% to 85% of the total variance is due to phase variation. We devised a lobster plot, so named because of the resemblance to a lobster claw, to help detect the phase variation in replicates. We also investigated a peak alignment algorithm to remove the phase variation. This operation is analogous to the normalization step in microarray data analysis. Only after this critical step can features of biological interest be clearly revealed. With the help of principal component analysis, we demonstrated that after peak alignment, the differences among replicates are reduced. We compared this approach to peak alignment with a model-based calibration approach in which there was known information about peaks in common among all spectra. Finally, we examined the potential value at each point in an analysis pipeline of having a set of methods available that includes parametric, semiparametric and nonparametric methods; among such methods are those that benefit from the use of prior information

Topics: Original Research
Publisher: Libertas Academica
OAI identifier: oai:pubmedcentral.nih.gov:2657651
Provided by: PubMed Central
Download PDF:
Sorry, we are unable to provide the full text but you may find it at the following location(s):
  • http://www.pubmedcentral.nih.g... (external link)
  • Suggested articles

    Citations

    1. (2002). A comparison of two algorithms for warping of analytical signals.
    2. (1998). Aligning of single and multiple wavelength chromatographic profiles for chemometric data analysis using correlation optimised warping.
    3. Clustering mass spectrometry data using order statistics.
    4. (1998). Curve registration.
    5. (1978). Dynamic-Programming Algorithm Optimization for Spoken Word Recognition. Ieee Transactions on Acoustics Speech and Signal Processing 26: 43-49.Technical variations in
    6. (2003). Exploring the proteome with MALDI-TOF.
    7. (2004). High-resolution Serum Proteomic Features for Ovarian Cancer Detection. Endocrine Related Cancer
    8. (2003). Machine learning approaches to lung cancer prediction from mass spectra.
    9. (2003). Megavariate data analysis of mass spectrometric proteomics data using latent variable projection method.
    10. (1996). NMR spectral quantitation by principal-component analysis. II. Determination of frequency and phase shifts.
    11. (2004). Normalization, Baseline Correction and Alignment of High-Throughput Mass Spectrometry Data Proceedings Gensips
    12. (2003). Permutation Achieved Classifi cation Error (PACE): A Signifi cance Test for Assessing the Statistical Significance of Classifiers for Peptide Profiling via MALDI/ SELDI-TOF Mass Spectrometry. Presentation in the 2nd EDRN workshop.
    13. (2003). Protein expression profi ling identifi es macrophage migration inhibitory factor and cyclophilin a as potential molecular targets in non-small cell lung cancer.
    14. (2003). Proteomic approaches to the diagnosis, treatment, and monitoring of cancer.
    15. (2002). Proteomics 2, 1365-1373 Precalibration of Matrix-Assisted Laser Desorption/Ionization-Time of Flight Spectra for Peptide Mass Fingerprinting Wulfkuhle
    16. (2002). Proteomics and bioinformatics approaches for identifi cation of serum biomarkers to detect breast cancer.
    17. (2004). R.(2004) “Correlation is driven by calibration”, manuscript, personal communication to Simon Lin.
    18. (2002). Serum protein fi ngerprinting coupled with a pattern-matching algorithm distinguishes prostate cancer from benign prostate hyperplasia and healthy men.
    19. (2003). Use of Mixture models in MALDI-TOF proteomic data for peak registration. In "Proceedings of the CBGI"

    To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.