Analysis of Natural Gradient Descent for Multilayer Neural Networks

Rattray, Magnus; Saad, David

research

Analysis of Natural Gradient Descent for Multilayer Neural Networks

Authors: Magnus Rattray
David Saad
Publication date: 21 January 1999
Publisher: 'American Physical Society (APS)'
Doi

Abstract

Natural gradient descent is a principled method for adapting the parameters of a statistical model on-line using an underlying Riemannian parameter space to redefine the direction of steepest descent. The algorithm is examined via methods of statistical physics which accurately characterize both transient and asymptotic behavior. A solution of the learning dynamics is obtained for the case of multilayer neural network training in the limit of large input dimension. We find that natural gradient learning leads to optimal asymptotic performance and outperforms gradient descent in the transient, significantly shortening or even removing plateaus in the transient generalization performance which typically hamper gradient descent training.Comment: 14 pages including figures. To appear in Physical Review

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Aston Publications Explorer

oai:publications.aston.ac.uk:3...

Last time updated on 04/04/2019

Crossref

info:doi/10.1103%2Fphysreve.59...

Last time updated on 05/06/2019