Multimodal Generative Learning Utilizing Jensen-Shannon-Divergence

Daunhawer, Imant; Sutter, Thomas M.; Vogt, Julia E.

Multimodal Generative Learning Utilizing Jensen-Shannon-Divergence

Authors: Imant Daunhawer
Thomas M. Sutter
Julia E. Vogt
Publication date: 2 November 2020
Publisher

Abstract

Learning from different data types is a long-standing goal in machine learning research, as multiple information sources co-occur when describing natural phenomena. However, existing generative models that approximate a multimodal ELBO rely on difficult or inefficient training schemes to learn a joint distribution and the dependencies between modalities. In this work, we propose a novel, efficient objective function that utilizes the Jensen-Shannon divergence for multiple distributions. It simultaneously approximates the unimodal and joint multimodal posteriors directly via a dynamic prior. In addition, we theoretically prove that the new multimodal JS-divergence (mmJSD) objective optimizes an ELBO. In extensive experiments, we demonstrate the advantage of the proposed mmJSD model compared to previous work in unsupervised, generative learning tasks.Comment: Accepted at NeurIPS 2020, camera-ready versio

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2006.08242

Last time updated on 16/06/2020