UvA-DARE (Digital Academic Repository) A classification model for the Leiden proteomics competition A Classification Model for the Leiden Proteomics Competition A Classification Model for the Leiden Proteomics Competition

A K ( Smilde; A K Smilde; Age K Smilde; Age K Smilde; H C J; H C J Hoefsloot; Huub C J Hoefsloot; Huub C J Hoefsloot; S; S Smit; Suzanne Smit; Suzanne Smit

UvA-DARE (Digital Academic Repository) A classification model for the Leiden proteomics competition A Classification Model for the Leiden Proteomics Competition A Classification Model for the Leiden Proteomics Competition

Authors: A K ( Smilde
A K Smilde
Age K Smilde
Age K Smilde
H C J
H C J Hoefsloot
Huub C J Hoefsloot
Huub C J Hoefsloot
S
S Smit
Suzanne Smit
Suzanne Smit
Publication date: 1 January 2008
Publisher

Abstract

Abstract A strategy is presented to build a discrimination model in proteomics studies. The model is built using cross-validation. This cross-validation step can simply be combined with a variable selection method, called rank products. The strategy is especially suitable for the low-samplesto-variables-ratio (undersampling) case, as is often encountered in proteomics and metabolomics studies. As a classification method, Principal Component Discriminant Analysis is used; however, the methodology can be used with any classifier. A data set containing serum samples from breast cancer patients and healthy controls is analysed. Double cross-validation shows that the sensitivity of the model is 82% and the specificity 86%. Potential putative biomarkers are identified using the variable selection method. In each cross-validation loop a classification model is built. The final classification uses a majority voting scheme from the ensemble classifier

Similar works

Full text

Available Versions

CiteSeerX

oai:CiteSeerX.psu:10.1.1.1044....

Last time updated on 07/12/2020