Search CORE

41 research outputs found

Factoring Shape, Pose, and Layout from the 2D Image of a 3D Scene

Author: Efros Alexei A.
Fouhey David
Gupta Saurabh
Malik Jitendra
Tulsiani Shubham
Publication venue
Publication date: 24/04/2018
Field of study

The goal of this paper is to take a single 2D image of a scene and recover the 3D structure in terms of a small set of factors: a layout representing the enclosing surfaces as well as a set of objects represented in terms of shape and pose. We propose a convolutional neural network-based approach to predict this representation and benchmark it on a large dataset of indoor scenes. Our experiments evaluate a number of practical design questions, demonstrate that we can infer this representation, and quantitatively and qualitatively demonstrate its merits compared to alternate representations.Comment: Project url with code: https://shubhtuls.github.io/factored3

arXiv.org e-Print Archive

Crossref

Learning Single-Image Depth from Videos using Quality Assessment Networks

Author: Chen Weifeng
Deng Jia
Qian Shengyi
Publication venue
Publication date: 01/01/2019
Field of study

Depth estimation from a single image in the wild remains a challenging problem. One main obstacle is the lack of high-quality training data for images in the wild. In this paper we propose a method to automatically generate such data through Structure-from-Motion (SfM) on Internet videos. The core of this method is a Quality Assessment Network that identifies high-quality reconstructions obtained from SfM. Using this method, we collect single-view depth training data from a large number of YouTube videos and construct a new dataset called YouTube3D. Experiments show that YouTube3D is useful in training depth estimation networks and advances the state of the art of single-view depth estimation in the wild

arXiv.org e-Print Archive

Princeton University Open Access Repository