Automatic Learning Rate Maximization by On-Line Estimation of the Hessian's Eigenvectors

Lecun, Yann; Pearlmutter, Barak A.; Simard, Patrice Y.

Automatic Learning Rate Maximization by On-Line Estimation of the Hessian's Eigenvectors

Authors: Yann Lecun
Barak A. Pearlmutter
Patrice Y. Simard
Publication date: 1 January 1993
Publisher: Neural Information Processing Systems (NIPS)

Abstract

We propose a very simple, and well principled wayofcomputing the optimal step size in gradient descent algorithms. The on-line version is very efficient computationally, and is applicable to large backpropagation networks trained on large data sets. The main ingredient is a technique for estimating the principal eigenvalue(s) and eigenvector(s) of the objective function's second derivativematrix (Hessian), which does not require to even calculate the Hessian. Several other applications of this technique are proposed for speeding up learning, or for eliminating useless parameters

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

MURAL - Maynooth University Research Archive Library

oai:mural.maynoothuniversity.i...

Last time updated on 10/04/2020