Search CORE

6,871 research outputs found

Real-time marker-less multi-person 3D pose estimation in RGB-Depth camera networks

Author: A Elhayek
A Jaimes
C Morato
Francesca Stival
J Shotton
K Buys
K-Y Yeung
M Carraro
M Munaro
Marco Carraro
Marco Carraro
S Ghidoni
Y Kim
Y Yang
Z Gao
Z Zivkovic
Publication venue
Publication date: 17/10/2017
Field of study

This paper proposes a novel system to estimate and track the 3D poses of multiple persons in calibrated RGB-Depth camera networks. The multi-view 3D pose of each person is computed by a central node which receives the single-view outcomes from each camera of the network. Each single-view outcome is computed by using a CNN for 2D pose estimation and extending the resulting skeletons to 3D by means of the sensor depth. The proposed system is marker-less, multi-person, independent of background and does not make any assumption on people appearance and initial pose. The system provides real-time outcomes, thus being perfectly suited for applications requiring user interaction. Experimental results show the effectiveness of this work with respect to a baseline multi-view approach in different scenarios. To foster research and applications based on this work, we released the source code in OpenPTrack, an open source project for RGB-D people tracking.Comment: Submitted to the 2018 IEEE International Conference on Robotics and Automatio

arXiv.org e-Print Archive

Archivio istituzionale della ricerca - Università di Padova

T-LESS: An RGB-D Dataset for 6D Pose Estimation of Texture-less Objects

Author: Haluza Pavel
Hodan Tomas
Lourakis Manolis
Matas Jiri
Obdrzalek Stepan
Zabulis Xenophon
Publication venue
Publication date: 19/01/2017
Field of study

We introduce T-LESS, a new public dataset for estimating the 6D pose, i.e. translation and rotation, of texture-less rigid objects. The dataset features thirty industry-relevant objects with no significant texture and no discriminative color or reflectance properties. The objects exhibit symmetries and mutual similarities in shape and/or size. Compared to other datasets, a unique property is that some of the objects are parts of others. The dataset includes training and test images that were captured with three synchronized sensors, specifically a structured-light and a time-of-flight RGB-D sensor and a high-resolution RGB camera. There are approximately 39K training and 10K test images from each sensor. Additionally, two types of 3D models are provided for each object, i.e. a manually created CAD model and a semi-automatically reconstructed one. Training images depict individual objects against a black background. Test images originate from twenty test scenes having varying complexity, which increases from simple scenes with several isolated objects to very challenging ones with multiple instances of several objects and with a high amount of clutter and occlusion. The images were captured from a systematically sampled view sphere around the object/scene, and are annotated with accurate ground truth 6D poses of all modeled objects. Initial evaluation results indicate that the state of the art in 6D object pose estimation has ample room for improvement, especially in difficult cases with significant occlusion. The T-LESS dataset is available online at cmp.felk.cvut.cz/t-less.Comment: WACV 201

arXiv.org e-Print Archive