Search CORE

13 research outputs found

Few-Shot Single-View 3-D Object Reconstruction with Compositional Priors

Author: AP Witkin
B Xu
BO Community
CB Choy
FS Nooruddin
J Wu
KN Kutulakos
N Wang
R Girdhar
S Savarese
V Ramakrishna
Publication venue
Publication date: 02/05/2020
Field of study

The impressive performance of deep convolutional neural networks in single-view 3D reconstruction suggests that these models perform non-trivial reasoning about the 3D structure of the output space. However, recent work has challenged this belief, showing that complex encoder-decoder architectures perform similarly to nearest-neighbor baselines or simple linear decoder models that exploit large amounts of per category data in standard benchmarks. On the other hand settings where 3D shape must be inferred for new categories with few examples are more natural and require models that generalize about shapes. In this work we demonstrate experimentally that naive baselines do not apply when the goal is to learn to reconstruct novel objects using very few examples, and that in a \emph{few-shot} learning setting, the network must learn concepts that can be applied to new categories, avoiding rote memorization. To address deficiencies in existing approaches to this problem, we propose three approaches that efficiently integrate a class prior into a 3D reconstruction model, allowing to account for intra-class variability and imposing an implicit compositional structure that the model should learn. Experiments on the popular ShapeNet database demonstrate that our method significantly outperform existing baselines on this task in the few-shot setting

arXiv.org e-Print Archive

Crossref

BodyNet: Volumetric Inference of 3D Human Body Shapes

Author: A Newell
Catalin Ionescu
DJ Butler
F Bogo
FS Nooruddin
H Rhodin
IB Barbosa
J Nocedal
J Yang
ME Yumer
ME Yumer
T Lewiner
Y. LeCun
Publication venue
Publication date: 18/08/2018
Field of study

Human shape estimation is an important task for video editing, animation and fashion industry. Predicting 3D human body shape from natural images, however, is highly challenging due to factors such as variation in human bodies, clothing and viewpoint. Prior methods addressing this problem typically attempt to fit parametric body models with certain priors on pose and shape. In this work we argue for an alternative representation and propose BodyNet, a neural network for direct inference of volumetric body shape from a single image. BodyNet is an end-to-end trainable network that benefits from (i) a volumetric 3D loss, (ii) a multi-view re-projection loss, and (iii) intermediate supervision of 2D pose, 2D body part segmentation, and 3D pose. Each of them results in performance improvement as demonstrated by our experiments. To evaluate the method, we fit the SMPL model to our network output and show state-of-the-art results on the SURREAL and Unite the People datasets, outperforming recent approaches. Besides achieving state-of-the-art performance, our method also enables volumetric body-part segmentation.Comment: Appears in: European Conference on Computer Vision 2018 (ECCV 2018). 27 page

arXiv.org e-Print Archive

Crossref

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

HAL-Rennes 1

Three-Dimensional Object Registration Using Wavelet Features

Author: AA Cole-Rhodes
BKP Horn
C Fookes
CJ Tymczak
D Van De Ville
DG Lowe
DH Ballard
DW Scott
EJ Stollnitz
F Maes
F Mokhtarian
FS Nooruddin
G Barequet
G Stockman
G Strang
H Shimazaki
JBA Maintz
Julie S. Chalfant
KH Ko
KH Ko
LR Rabiner
M Jenkinson
M Urschler
N Kingsbury
N Kingsbury
Nicholas M. Patrikalakis
PJ Besl
R Xu
Y Sun
Z Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/05/2008
Field of study

Recent developments in shape-based modeling and data acquisition have brought three-dimensional models to the forefront of computer graphics and visualization research. New data acquisition methods are producing large numbers of models in a variety of fields. Three-dimensional registration (alignment) is key to the useful application of such models in areas from automated surface inspection to cancer detection and surgery. The algorithms developed in this research accomplish automatic registration of three-dimensional voxelized models. We employ features in a wavelet transform domain to accomplish registration. The features are extracted in a multi-resolutional format, thus delineating features at various scales for robust and rapid matching. Registration is achieved by using a voting scheme to select peaks in sets of rotation quaternions, then separately identifying translation. The method is robust to occlusion, clutter, and noise. The efficacy of the algorithm is demonstrated through examples from solid modeling and medical imaging applications

DSpace@MIT

Crossref

BodyNet: Volumetric Inference of 3D Human Body Shapes

Author: A Newell
Catalin Ionescu
DJ Butler
F Bogo
FS Nooruddin
H Rhodin
IB Barbosa
J Nocedal
J Yang
ME Yumer
ME Yumer
T Lewiner
Y. LeCun
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 08/09/2018
Field of study

International audienceHuman shape estimation is an important task for video editing , animation and fashion industry. Predicting 3D human body shape from natural images, however, is highly challenging due to factors such as variation in human bodies, clothing and viewpoint. Prior methods addressing this problem typically attempt to fit parametric body models with certain priors on pose and shape. In this work we argue for an alternative representation and propose BodyNet, a neural network for direct inference of volumetric body shape from a single image. BodyNet is an end-to-end trainable network that benefits from (i) a volumetric 3D loss, (ii) a multi-view re-projection loss, and (iii) intermediate supervision of 2D pose, 2D body part segmentation, and 3D pose. Each of them results in performance improvement as demonstrated by our experiments. To evaluate the method, we fit the SMPL model to our network output and show state-of-the-art results on the SURREAL and Unite the People datasets, outperforming recent approaches. Besides achieving state-of-the-art performance, our method also enables volumetric body-part segmentation

Crossref

Hal - Université Grenoble Alpes