Learning Robust Representations via Multi-View Information Bottleneck

Akata, Z; Dutta, A; Federici, M; Forré, P; Kushmann, N

Learning Robust Representations via Multi-View Information Bottleneck

Authors: Z Akata
A Dutta
M Federici
P Forré
N Kushmann
Publication date: 1 January 2020
Publisher: International Conference on Learning Representation

Abstract

This is the author accepted manuscript.The information bottleneck method (Tishby et al. 2000) provides an information theoretic method for representation learning, by training an encoder to retain all information which is relevant for predicting the label, while minimizing the amount of other, superfluous information in the representation. The original formulation, however, requires labeled data in order to identify which information is superfluous. In this work, we extend this ability to the multi-view unsupervised setting, in which two views of the same underlying entity are provided but the label in unknown. This enables us to identify superfluous information as that which is not shared by both views. A theoretical analysis leads to the definition of a new multi-view model which produces state-of-the-art results on the Sketchy dataset and on label-limited versions of the MIR-Flickr dataset. We also extend our theory to the single-view setting by taking advantage of standard data augmentation techniques, empirically showing better generalization capabilities when compared to traditional unsupervised approaches for representation learning.European Union Horizon 202

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

MPG.PuRe

oai:pure.mpg.de:item_3286404

Last time updated on 03/03/2021

Supporting member

Open Research Exeter

oai:ore.exeter.ac.uk:10871/402...

Last time updated on 04/01/2020