GFlowNet-EM for learning compositional latent variable models

Bengio, Yoshua; Everett, Katie; Graikos, Alexandros; Hu, Edward J.; Jain, Moksh; Malkin, Nikolay

GFlowNet-EM for learning compositional latent variable models

Authors: Yoshua Bengio
Katie Everett
Alexandros Graikos
Edward J. Hu
Moksh Jain
Nikolay Malkin
Publication date: 3 June 2023
Publisher

Abstract

Latent variable models (LVMs) with discrete compositional latents are an important but challenging setting due to a combinatorially large number of possible configurations of the latents. A key tradeoff in modeling the posteriors over latents is between expressivity and tractable optimization. For algorithms based on expectation-maximization (EM), the E-step is often intractable without restrictive approximations to the posterior. We propose the use of GFlowNets, algorithms for sampling from an unnormalized density by learning a stochastic policy for sequential construction of samples, for this intractable E-step. By training GFlowNets to sample from the posterior over latents, we take advantage of their strengths as amortized variational inference algorithms for complex distributions over discrete structures. Our approach, GFlowNet-EM, enables the training of expressive LVMs with discrete compositional latents, as shown by experiments on non-context-free grammar induction and on images using discrete variational autoencoders (VAEs) without conditional independence enforced in the encoder.Comment: ICML 2023; code: https://github.com/GFNOrg/GFlowNet-E

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2302.06576

Last time updated on 04/03/2023