Stochastic Gradient Descent for Gaussian Processes Done Right

Antorán, Javier; Hernández-Lobato, José Miguel; Janz, David; Lin, Jihao Andreas; Padhy, Shreyas; Szepesvári, Csaba; Terenin, Alexander; Tripp, Austin

Stochastic Gradient Descent for Gaussian Processes Done Right

Authors: Javier Antorán
José Miguel Hernández-Lobato
David Janz
Jihao Andreas Lin
Shreyas Padhy
Csaba Szepesvári
Alexander Terenin
Austin Tripp
Publication date: 31 October 2023
Publisher

Abstract

We study the optimisation problem associated with Gaussian process regression using squared loss. The most common approach to this problem is to apply an exact solver, such as conjugate gradient descent, either directly, or to a reduced-order version of the problem. Recently, driven by successes in deep learning, stochastic gradient descent has gained traction as an alternative. In this paper, we show that when done right\unicode{x2014}by which we mean using specific insights from the optimisation and kernel communities\unicode{x2014}this approach is highly effective. We thus introduce a particular stochastic dual gradient descent algorithm, that may be implemented with a few lines of code using any deep learning framework. We explain our design decisions by illustrating their advantage against alternatives with ablation studies and show that the new method is highly competitive. Our evaluations on standard regression benchmarks and a Bayesian optimisation task set our approach apart from preconditioned conjugate gradients, variational Gaussian process approximations, and a previous version of stochastic gradient descent for Gaussian processes. On a molecular binding affinity prediction task, our method places Gaussian process regression on par in terms of performance with state-of-the-art graph neural networks

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2310.20581

Last time updated on 18/01/2024