Search CORE

6 research outputs found

Ensemble Kalman Inversion: A Derivative-Free Technique For Machine Learning Tasks

Author: Kovachki Nikola B.
Stuart Andrew M.
Publication venue: 'IOP Publishing'
Publication date: 10/08/2018
Field of study

The standard probabilistic perspective on machine learning gives rise to empirical risk-minimization tasks that are frequently solved by stochastic gradient descent (SGD) and variants thereof. We present a formulation of these tasks as classical inverse or filtering problems and, furthermore, we propose an efficient, gradient-free algorithm for finding a solution to these problems using ensemble Kalman inversion (EKI). Applications of our approach include offline and online supervised learning with deep neural networks, as well as graph-based semi-supervised learning. The essence of the EKI procedure is an ensemble based approximate gradient descent in which derivatives are replaced by differences from within the ensemble. We suggest several modifications to the basic method, derived from empirically successful heuristics developed in the context of SGD. Numerical results demonstrate wide applicability and robustness of the proposed algorithm.Comment: 41 pages, 14 figure

arXiv.org e-Print Archive

Caltech Authors

Ensemble Kalman Inversion: A Derivative-Free Technique For Machine Learning Tasks

Author: Kovachki Nikola B.
Stuart Andrew M.
Publication venue: 'AIP Publishing'
Publication date: 01/09/2019
Field of study

The standard probabilistic perspective on machine learning gives rise to empirical risk-minimization tasks that are frequently solved by stochastic gradient descent (SGD) and variants thereof. We present a formulation of these tasks as classical inverse or filtering problems and, furthermore, we propose an efficient, gradient-free algorithm for finding a solution to these problems using ensemble Kalman inversion (EKI). The method is inherently parallelizable and is applicable to problems with non-differentiable loss functions, for which back-propagation is not possible. Applications of our approach include offline and online supervised learning with deep neural networks, as well as graph-based semi-supervised learning. The essence of the EKI procedure is an ensemble based approximate gradient descent in which derivatives are replaced by differences from within the ensemble. We suggest several modifications to the basic method, derived from empirically successful heuristics developed in the context of SGD. Numerical results demonstrate wide applicability and robustness of the proposed algorithm

A parallel implementation of the ensemble Kalman filter based on modified Cholesky decomposition

Author: Anderson
Bickel
Blackford
Emerick
Evensen
Huang
Hunt
Kucharski
Lahabar
Liu
Miyoshi
Molteni
Nerger
Ott
Rao
Sakov
Terasaki
Zupanski
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref