Bayesian Dropout

Herlau, Tue; Mørup, Morten; Schmidt, Mikkel N.

research

Bayesian Dropout

Authors: Tue Herlau
Morten Mørup
Mikkel N. Schmidt
Publication date: 12 August 2015
Publisher
Doi

Abstract

Dropout has recently emerged as a powerful and simple method for training neural networks preventing co-adaptation by stochastically omitting neurons. Dropout is currently not grounded in explicit modelling assumptions which so far has precluded its adoption in Bayesian modelling. Using Bayesian entropic reasoning we show that dropout can be interpreted as optimal inference under constraints. We demonstrate this on an analytically tractable regression model providing a Bayesian interpretation of its mechanism for regularizing and preventing co-adaptation as well as its connection to other Bayesian techniques. We also discuss two general approximate techniques for applying Bayesian dropout for general models, one based on an analytical approximation and the other on stochastic variational techniques. These techniques are then applied to a Baysian logistic regression problem and are shown to improve performance as the model become more misspecified. Our framework roots dropout as a theoretically justified and practical tool for statistical modelling allowing Bayesians to tap into the benefits of dropout training.Comment: 21 pages, 3 figures. Manuscript prepared 2014 and awaiting submissio

Similar works

Full text

Available Versions

Online Research Database In Technology

oai:pure.atira.dk:publications...

Last time updated on 23/08/2022

arXiv.org e-Print Archive

oai:arXiv.org:1508.02905

Last time updated on 24/09/2015