Search CORE

1,479 research outputs found

Capturing Hands in Action using Discriminative Salient Points and Physics Simulation

Author: Aponte Pablo
Ballan Luca
Gall Juergen
Pollefeys Marc
Srikantha Abhilash
Tzionas Dimitrios
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 07/03/2016
Field of study

Hand motion capture is a popular research field, recently gaining more attention due to the ubiquity of RGB-D sensors. However, even most recent approaches focus on the case of a single isolated hand. In this work, we focus on hands that interact with other hands or objects and present a framework that successfully captures motion in such interaction scenarios for both rigid and articulated objects. Our framework combines a generative model with discriminatively trained salient points to achieve a low tracking error and with collision detection and physics simulation to achieve physically plausible estimates even in case of occlusions and missing visual data. Since all components are unified in a single objective function which is almost everywhere differentiable, it can be optimized with standard optimization techniques. Our approach works for monocular RGB-D sequences as well as setups with multiple synchronized RGB cameras. For a qualitative and quantitative evaluation, we captured 29 sequences with a large variety of interactions and up to 150 degrees of freedom.Comment: Accepted for publication by the International Journal of Computer Vision (IJCV) on 16.02.2016 (submitted on 17.10.14). A combination into a single framework of an ECCV'12 multicamera-RGB and a monocular-RGBD GCPR'14 hand tracking paper with several extensions, additional experiments and detail

arXiv.org e-Print Archive

MPG.PuRe

Co-Fusion: Real-time Segmentation, Tracking and Fusion of Multiple Objects

Author: Agapito Lourdes
Rünz Martin
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 20/06/2017
Field of study

In this paper we introduce Co-Fusion, a dense SLAM system that takes a live stream of RGB-D images as input and segments the scene into different objects (using either motion or semantic cues) while simultaneously tracking and reconstructing their 3D shape in real time. We use a multiple model fitting approach where each object can move independently from the background and still be effectively tracked and its shape fused over time using only the information from pixels associated with that object label. Previous attempts to deal with dynamic scenes have typically considered moving regions as outliers, and consequently do not model their shape or track their motion over time. In contrast, we enable the robot to maintain 3D models for each of the segmented objects and to improve them over time through fusion. As a result, our system can enable a robot to maintain a scene description at the object level which has the potential to allow interactions with its working environment; even in the case of dynamic scenes.Comment: International Conference on Robotics and Automation (ICRA) 2017, http://visual.cs.ucl.ac.uk/pubs/cofusion, https://github.com/martinruenz/co-fusio

arXiv.org e-Print Archive

Crossref

UCL Discovery

Unsupervised learning of human motion

Author: Goncalves Luis
Perona Pietro
Song Yang
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2003
Field of study

An unsupervised learning algorithm that can obtain a probabilistic model of an object composed of a collection of parts (a moving human body in our examples) automatically from unlabeled training data is presented. The training data include both useful "foreground" features as well as features that arise from irrelevant background clutter - the correspondence between parts and detected features is unknown. The joint probability density function of the parts is represented by a mixture of decomposable triangulated graphs which allow for fast detection. To learn the model structure as well as model parameters, an EM-like algorithm is developed where the labeling of the data (part assignments) is treated as hidden variables. The unsupervised learning technique is not limited to decomposable triangulated graphs. The efficiency and effectiveness of our algorithm is demonstrated by applying it to generate models of human motion automatically from unlabeled image sequences, and testing the learned models on a variety of sequences

CiteSeerX

Caltech Authors

Model-Based High-Dimensional Pose Estimation with Application to Hand Tracking

Author: Mohr Daniel
Publication venue
Publication date: 01/01/2012
Field of study

This thesis presents novel techniques for computer vision based full-DOF human hand motion estimation. Our main contributions are: A robust skin color estimation approach; A novel resolution-independent and memory efficient representation of hand pose silhouettes, which allows us to compute area-based similarity measures in near-constant time; A set of new segmentation-based similarity measures; A new class of similarity measures that work for nearly arbitrary input modalities; A novel edge-based similarity measure that avoids any problematic thresholding or discretizations and can be computed very efficiently in Fourier space; A template hierarchy to minimize the number of similarity computations needed for finding the most likely hand pose observed; And finally, a novel image space search method, which we naturally combine with our hierarchy. Consequently, matching can efficiently be formulated as a simultaneous template tree traversal and function maximization

E-LIB Dokumentserver - Staats und Universitätsbibliothek Bremen

Cognitive Robotics in Industrial Environments

Author: Heinz Wörn
Jürgen Graf
Stephan Puls
Publication venue: 'IntechOpen'
Publication date: 01/01/2012
Field of study

IntechOpen

Crossref

KITopen

Computer-based training system for cataract surgery

Author: Allard Jérémie
Comas Olivier
Cotin Stéphane
Courtecuisse Hadrien
Dequidt Jérémie
Dumortier Elodie
Duriez Christian
Rouland Jean-Francois
Wavreille Olivier
Publication venue: 'SAGE Publications'
Publication date: 01/01/2013
Field of study

International audienceThis paper describes a single simulation framework to perform interactive cataract surgery simulations. Contributions includes advanced bio-mechanical models and intensive use of modern graphics hard- ware to provide fast computation times. Surgical de- vices are replicated and located in a real-time thanks to infra-red tracking. Combination of a high-fidelity simulation and actual surgical tools are able to im- prove surgeon immersion while training. Preliminary tests have been performed by experienced ophthal- mologists to qualitatively assess the face-validity of the simulator and the faithfulness of the behavior of the anatomical structures as well as the interactions with the surgical tools

HAL - Lille 3

INRIA a CCSD electronic archive server

HAL Descartes

Finite Element Based Tracking of Deforming Surfaces

Author: Lang Jochen
Shu Chang
Tekieh Motahareh
Wuhrer Stefanie
Publication venue: 'Elsevier BV'
Publication date: 28/10/2014
Field of study

We present an approach to robustly track the geometry of an object that deforms over time from a set of input point clouds captured from a single viewpoint. The deformations we consider are caused by applying forces to known locations on the object's surface. Our method combines the use of prior information on the geometry of the object modeled by a smooth template and the use of a linear finite element method to predict the deformation. This allows the accurate reconstruction of both the observed and the unobserved sides of the object. We present tracking results for noisy low-quality point clouds acquired by either a stereo camera or a depth camera, and simulations with point clouds corrupted by different error terms. We show that our method is also applicable to large non-linear deformations.Comment: additional experiment

arXiv.org e-Print Archive

CiteSeerX

NRC Publications Archive