Search CORE

907 research outputs found

It's all Relative: Monocular 3D Human Pose Estimation from Weakly Supervised Data

Author: Eng Robert
Mac Aodha Oisin
Perona Pietro
Ronchi Matteo Ruggero
Publication venue
Publication date: 17/05/2018
Field of study

We address the problem of 3D human pose estimation from 2D input images using only weakly supervised training data. Despite showing considerable success for 2D pose estimation, the application of supervised machine learning to 3D pose estimation in real world images is currently hampered by the lack of varied training images with corresponding 3D poses. Most existing 3D pose estimation algorithms train on data that has either been collected in carefully controlled studio settings or has been generated synthetically. Instead, we take a different approach, and propose a 3D human pose estimation algorithm that only requires relative estimates of depth at training time. Such training signal, although noisy, can be easily collected from crowd annotators, and is of sufficient quality for enabling successful training and evaluation of 3D pose algorithms. Our results are competitive with fully supervised regression based approaches on the Human3.6M dataset, despite using significantly weaker training data. Our proposed algorithm opens the door to using existing widespread 2D datasets for 3D pose estimation by allowing fine-tuning with noisy relative constraints, resulting in more accurate 3D poses.Comment: BMVC 2018. Project page available at http://www.vision.caltech.edu/~mronchi/projects/RelativePos

arXiv.org e-Print Archive

Caltech Authors

Double Refinement Network for Efficient Indoor Monocular Depth Estimation

Author: Bogomolov Pavel
Bubnova Valeriya
Durasov Nikita
Konushin Anton
Romanov Mikhail
Publication venue
Publication date: 04/04/2019
Field of study

Monocular depth estimation is the task of obtaining a measure of distance for each pixel using a single image. It is an important problem in computer vision and is usually solved using neural networks. Though recent works in this area have shown significant improvement in accuracy, the state-of-the-art methods tend to require massive amounts of memory and time to process an image. The main purpose of this work is to improve the performance of the latest solutions with no decrease in accuracy. To this end, we introduce the Double Refinement Network architecture. The proposed method achieves state-of-the-art results on the standard benchmark RGB-D dataset NYU Depth v2, while its frames per second rate is significantly higher (up to 18 times speedup per image at batch size 1) and the RAM usage per image is lower

arXiv.org e-Print Archive

In the Wild Human Pose Estimation Using Explicit 2D Features and Intermediate 3D Representations

Author: Habibie Ikhsanul
Mehta Dushyant
Pons-Moll Gerard
Theobalt Christian
Xu Weipeng
Publication venue
Publication date: 01/01/2019
Field of study

Convolutional Neural Network based approaches for monocular 3D human pose estimation usually require a large amount of training images with 3D pose annotations. While it is feasible to provide 2D joint annotations for large corpora of in-the-wild images with humans, providing accurate 3D annotations to such in-the-wild corpora is hardly feasible in practice. Most existing 3D labelled data sets are either synthetically created or feature in-studio images. 3D pose estimation algorithms trained on such data often have limited ability to generalize to real world scene diversity. We therefore propose a new deep learning based method for monocular 3D human pose estimation that shows high accuracy and generalizes better to in-the-wild scenes. It has a network architecture that comprises a new disentangled hidden space encoding of explicit 2D and 3D features, and uses supervision by a new learned projection model from predicted 3D pose. Our algorithm can be jointly trained on image data with 3D labels and image data with only 2D labels. It achieves state-of-the-art accuracy on challenging in-the-wild data.Comment: Accepted to CVPR 201

arXiv.org e-Print Archive

MPG.PuRe

In the Wild Human Pose Estimation Using Explicit 2D Features and Intermediate 3D Representations

Author: Habibie I.
Mehta D.
Pons-Moll G.
Theobalt C.
Xu W.
Publication venue
Publication date: 01/01/2019
Field of study

MPG.PuRe