Search CORE

834 research outputs found

Hand Keypoint Detection in Single Images using Multiview Bootstrapping

Author: Joo Hanbyul
Matthews Iain
Sheikh Yaser
Simon Tomas
Publication venue
Publication date: 25/04/2017
Field of study

We present an approach that uses a multi-camera system to train fine-grained detectors for keypoints that are prone to occlusion, such as the joints of a hand. We call this procedure multiview bootstrapping: first, an initial keypoint detector is used to produce noisy labels in multiple views of the hand. The noisy detections are then triangulated in 3D using multiview geometry or marked as outliers. Finally, the reprojected triangulations are used as new labeled training data to improve the detector. We repeat this process, generating more labeled data in each iteration. We derive a result analytically relating the minimum number of views to achieve target true and false positive rates for a given detector. The method is used to train a hand keypoint detector for single images. The resulting keypoint detector runs in realtime on RGB images and has accuracy comparable to methods that use depth sensors. The single view detector, triangulated over multiple views, enables 3D markerless hand motion capture with complex object interactions.Comment: CVPR 201

arXiv.org e-Print Archive

Crossref

Markerless Motion Capture in the Crowd

Author: Bregler Christoph
Huston Thomas
Spiro Ian
Publication venue
Publication date: 01/01/2012
Field of study

This work uses crowdsourcing to obtain motion capture data from video recordings. The data is obtained by information workers who click repeatedly to indicate body configurations in the frames of a video, resulting in a model of 2D structure over time. We discuss techniques to optimize the tracking task and strategies for maximizing accuracy and efficiency. We show visualizations of a variety of motions captured with our pipeline then apply reconstruction techniques to derive 3D structure.Comment: Presented at Collective Intelligence conference, 2012 (arXiv:1204.2991

arXiv.org e-Print Archive

CiteSeerX

EventCap: Monocular 3D Capture of High-Speed Human Motions using an Event Camera

Author: Fang L.
Golyanik V.
Habermann M.
Theobalt C.
Xu L.
Xu W.
Publication venue
Publication date: 01/01/2019
Field of study

The high frame rate is a critical requirement for capturing fast human motions. In this setting, existing markerless image-based methods are constrained by the lighting requirement, the high data bandwidth and the consequent high computation overhead. In this paper, we propose EventCap --- the first approach for 3D capturing of high-speed human motions using a single event camera. Our method combines model-based optimization and CNN-based human pose detection to capture high-frequency motion details and to reduce the drifting in the tracking. As a result, we can capture fast motions at millisecond resolution with significantly higher data efficiency than using high frame rate videos. Experiments on our new event-based fast human motion dataset demonstrate the effectiveness and accuracy of our method, as well as its robustness to challenging lighting conditions

MPG.PuRe

Total Capture: A 3D Deformation Model for Tracking Faces, Hands, and Bodies

Author: Joo Hanbyul
Sheikh Yaser
Simon Tomas
Publication venue
Publication date: 04/01/2018
Field of study

We present a unified deformation model for the markerless capture of multiple scales of human movement, including facial expressions, body motion, and hand gestures. An initial model is generated by locally stitching together models of the individual parts of the human body, which we refer to as the "Frankenstein" model. This model enables the full expression of part movements, including face and hands by a single seamless model. Using a large-scale capture of people wearing everyday clothes, we optimize the Frankenstein model to create "Adam". Adam is a calibrated model that shares the same skeleton hierarchy as the initial model but can express hair and clothing geometry, making it directly usable for fitting people as they normally appear in everyday life. Finally, we demonstrate the use of these models for total motion tracking, simultaneously capturing the large-scale body movements and the subtle face and hand motion of a social group of people

arXiv.org e-Print Archive

Crossref

Multi-frame scene-flow estimation using a patch model and smooth motion prior

Author: Bhalerao Abhir
Popham T. J.
Wilson Roland
Publication venue: BMVA Press
Publication date: 01/01/2010
Field of study

This paper addresses the problem of estimating the dense 3D motion of a scene over several frames using a set of calibrated cameras. Most current 3D motion estimation techniques are limited to estimating the motion over a single frame, unless a strong prior model of the scene (such as a skeleton) is introduced. Estimating the 3D motion of a general scene is difficult due to untextured surfaces, complex movements and occlusions. In this paper, we show that it is possible to track the surfaces of a scene over several frames, by introducing an effective prior on the scene motion. Experimental results show that the proposed method estimates the dense scene-flow over multiple frames, without the need for multiple-view reconstructions at every frame. Furthermore, the accuracy of the proposed method is demonstrated by comparing the estimated motion against a ground truth

CiteSeerX

Warwick Research Archives Portal Repository

{EventCap}: {M}onocular {3D} Capture of High-Speed Human Motions Using an Event Camera

Author: Fang L.
Golyanik V.
Habermann M.
Theobalt C.
Xu L.
Xu W.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2020
Field of study

Crossref

MPG.PuRe