Validation of nonlinear PCA

A Herman; A Ilin; AN Gorban; B Chalmond; B Christiansen; B Efron; B Schölkopf; BW Lu; D DeMers; JB Tenenbaum; LK Saul; M Scholz; MA Kramer; Matthias Scholz; MR Hestenes; ND Lawrence; P Demartines; R Hecht-Nielsen; S Girard; S Harmeling; S Mika; ST Roweis; T Hastie; T Kohonen; WW Hsieh; WW Hsieh; WW Hsieh

research

Validation of nonlinear PCA

Authors: A Herman
A Ilin
AN Gorban
B Chalmond
B Christiansen
B Efron
B Schölkopf
BW Lu
D DeMers
JB Tenenbaum
LK Saul
M Scholz
MA Kramer
Matthias Scholz
MR Hestenes
ND Lawrence
P Demartines
R Hecht-Nielsen
S Girard
S Harmeling
S Mika
ST Roweis
T Hastie
T Kohonen
WW Hsieh
WW Hsieh
WW Hsieh
Publication date: 1 January 2012
Publisher: 'Springer Science and Business Media LLC'
Doi

Abstract

Linear principal component analysis (PCA) can be extended to a nonlinear PCA by using artificial neural networks. But the benefit of curved components requires a careful control of the model complexity. Moreover, standard techniques for model selection, including cross-validation and more generally the use of an independent test set, fail when applied to nonlinear PCA because of its inherent unsupervised characteristics. This paper presents a new approach for validating the complexity of nonlinear PCA models by using the error in missing data estimation as a criterion for model selection. It is motivated by the idea that only the model of optimal complexity is able to predict missing values with the highest accuracy. While standard test set validation usually favours over-fitted nonlinear PCA models, the proposed model validation approach correctly selects the optimal model complexity.Comment: 12 pages, 5 figure

Similar works

Full text

Available Versions

Crossref

info:doi/10.1007%2Fs11063-012-...

Last time updated on 01/04/2019

Archivio istituzionale della ricerca - Fondazione Edmund Mach

oai:openpub.fmach.it:10449/212...

Last time updated on 22/03/2018