Noise, regularizers, and unrealizable scenarios in online learning from restricted training sets

A. Krogh; A.C.C. Coolen; A.C.C. Coolen; A.C.C. Coolen and; B. Lopez; C.M. Bishop; C.W.H. Mace; C.W.H. Mace; D. Saad; D. Saad; D. Saad; D. Saad; David Saad; H. Horner; H. Horner; J.A. Hertz; M. Biehl; M. Biehl; M. Rattray; M. Rattray; M. Rattray; M. Rattray; P. Sollich; S. Lee; W. Kinzel; Y. LeCun; Yuan-Sheng Xiong

Noise, regularizers, and unrealizable scenarios in online learning from restricted training sets

Authors: A. Krogh
A.C.C. Coolen
A.C.C. Coolen
A.C.C. Coolen and
B. Lopez
C.M. Bishop
C.W.H. Mace
C.W.H. Mace
D. Saad
D. Saad
D. Saad
D. Saad
David Saad
H. Horner
H. Horner
J.A. Hertz
M. Biehl
M. Biehl
M. Rattray
M. Rattray
M. Rattray
M. Rattray
P. Sollich
S. Lee
W. Kinzel
Y. LeCun
Yuan-Sheng Xiong
Publication date: 27 June 2001
Publisher: 'American Physical Society (APS)'
Doi

Abstract

We study the dynamics of on-line learning in multilayer neural networks where training examples are sampled with repetition and where the number of examples scales with the number of network weights. The analysis is carried out using the dynamical replica method aimed at obtaining a closed set of coupled equations for a set of macroscopic variables from which both training and generalization errors can be calculated. We focus on scenarios whereby training examples are corrupted by additive Gaussian output noise and regularizers are introduced to improve the network performance. The dependence of the dynamics on the noise level, with and without regularizers, is examined, as well as that of the asymptotic values obtained for both training and generalization errors. We also demonstrate the ability of the method to approximate the learning dynamics in structurally unrealizable scenarios. The theoretical results show good agreement with those obtained by computer simulations

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Aston Publications Explorer

oai:publications.aston.ac.uk:1...

Last time updated on 06/03/2017

Crossref

info:doi/10.1103%2Fphysreve.64...

Last time updated on 05/06/2019