PFNs Are Flexible Models for Real-World Bayesian Optimization

Feurer, Matthias; Hollmann, Noah; Hutter, Frank; Müller, Samuel

PFNs Are Flexible Models for Real-World Bayesian Optimization

Authors: Matthias Feurer
Noah Hollmann
Frank Hutter
Samuel Müller
Publication date: 27 May 2023
Publisher

Abstract

In this paper, we use Prior-data Fitted Networks (PFNs) as a flexible surrogate for Bayesian Optimization (BO). PFNs are neural processes that are trained to approximate the posterior predictive distribution (PPD) for any prior distribution that can be efficiently sampled from. We describe how this flexibility can be exploited for surrogate modeling in BO. We use PFNs to mimic a naive Gaussian process (GP), an advanced GP, and a Bayesian Neural Network (BNN). In addition, we show how to incorporate further information into the prior, such as allowing hints about the position of optima (user priors), ignoring irrelevant dimensions, and performing non-myopic BO by learning the acquisition function. The flexibility underlying these extensions opens up vast possibilities for using PFNs for BO. We demonstrate the usefulness of PFNs for BO in a large-scale evaluation on artificial GP samples and three different hyperparameter optimization testbeds: HPO-B, Bayesmark, and PD1. We publish code alongside trained models at http://github.com/automl/PFNs4BO.Comment: Accepted at ICML 202

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2305.17535

Last time updated on 02/06/2023