Pix3D: Dataset and Methods for Single-Image 3D Shape Modeling

Freeman, William T.; Sun, Xingyuan; Tenenbaum, Joshua B.; Wu, Jiajun; Xue, Tianfan; Zhang, Chengkai; Zhang, Xiuming; Zhang, Zhoutong

research

Pix3D: Dataset and Methods for Single-Image 3D Shape Modeling

Authors: William T. Freeman
Xingyuan Sun
Joshua B. Tenenbaum
Jiajun Wu
Tianfan Xue
Chengkai Zhang
Xiuming Zhang
Zhoutong Zhang
Publication date: 12 April 2018
Publisher
Doi

Abstract

We study 3D shape modeling from a single image and make contributions to it in three aspects. First, we present Pix3D, a large-scale benchmark of diverse image-shape pairs with pixel-level 2D-3D alignment. Pix3D has wide applications in shape-related tasks including reconstruction, retrieval, viewpoint estimation, etc. Building such a large-scale dataset, however, is highly challenging; existing datasets either contain only synthetic data, or lack precise alignment between 2D images and 3D shapes, or only have a small number of images. Second, we calibrate the evaluation criteria for 3D shape reconstruction through behavioral studies, and use them to objectively and systematically benchmark cutting-edge reconstruction algorithms on Pix3D. Third, we design a novel model that simultaneously performs 3D reconstruction and pose estimation; our multi-task learning approach achieves state-of-the-art performance on both tasks.Comment: CVPR 2018. The first two authors contributed equally to this work. Project page: http://pix3d.csail.mit.ed

Similar works

Full text

Available Versions

DSpace@MIT

oai:dspace.mit.edu:1721.1/1321...

Last time updated on 05/10/2022

Crossref

Last time updated on 10/08/2021