Deep convolutional inverse graphics network

Kohli, Pushmeet; Kulkarni, Tejas Dattatraya; Tenenbaum, Joshua B; Whitney, William F.

research

Deep convolutional inverse graphics network

Authors: Pushmeet Kohli
Tejas Dattatraya Kulkarni
Joshua B Tenenbaum
William F. Whitney
Publication date: 8 December 2017
Publisher: Neural Information Processing Systems Foundation, Inc

Abstract

This paper presents the Deep Convolution Inverse Graphics Network (DC-IGN), a model that aims to learn an interpretable representation of images, disentangled with respect to three-dimensional scene structure and viewing transformations such as depth rotations and lighting variations. The DC-IGN model is composed of multiple layers of convolution and de-convolution operators and is trained using the Stochastic Gradient Variational Bayes (SGVB) algorithm [10]. We propose a training procedure to encourage neurons in the graphics code layer to represent a specific transformation (e.g. pose or light). Given a single input image, our model can generate new images of the same object with variations in pose and lighting. We present qualitative and quantitative tests of the model's efficacy at learning a 3D rendering engine for varied object classes including faces and chairs

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

DSpace@MIT

oai:dspace.mit.edu:1721.1/1127...

Last time updated on 14/12/2017