Search CORE

3,986 research outputs found

Capturing Hands in Action using Discriminative Salient Points and Physics Simulation

Author: Aponte Pablo
Ballan Luca
Gall Juergen
Pollefeys Marc
Srikantha Abhilash
Tzionas Dimitrios
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 07/03/2016
Field of study

Hand motion capture is a popular research field, recently gaining more attention due to the ubiquity of RGB-D sensors. However, even most recent approaches focus on the case of a single isolated hand. In this work, we focus on hands that interact with other hands or objects and present a framework that successfully captures motion in such interaction scenarios for both rigid and articulated objects. Our framework combines a generative model with discriminatively trained salient points to achieve a low tracking error and with collision detection and physics simulation to achieve physically plausible estimates even in case of occlusions and missing visual data. Since all components are unified in a single objective function which is almost everywhere differentiable, it can be optimized with standard optimization techniques. Our approach works for monocular RGB-D sequences as well as setups with multiple synchronized RGB cameras. For a qualitative and quantitative evaluation, we captured 29 sequences with a large variety of interactions and up to 150 degrees of freedom.Comment: Accepted for publication by the International Journal of Computer Vision (IJCV) on 16.02.2016 (submitted on 17.10.14). A combination into a single framework of an ECCV'12 multicamera-RGB and a monocular-RGBD GCPR'14 hand tracking paper with several extensions, additional experiments and detail

arXiv.org e-Print Archive

MPG.PuRe

Reconstruction of Patient-Specific Bone Models from X-Ray Radiography

Author: El Dakhakhni Hatem Amin Abdel Fattah
Publication venue: TRACE: Tennessee Research and Creative Exchange
Publication date: 01/12/2013
Field of study

The availability of a patient‐specific bone model has become an increasingly invaluable addition to orthopedic case evaluation and planning [1]. Utilized within a wide range of specialized visualization and analysis tools, such models provide unprecedented wealth of bone shape information previously unattainable using traditional radiographic imaging [2]. In this work, a novel bone reconstruction method from two or more x‐ray images is described. This method is superior to previous attempts in terms of accuracy and repeatability. The new technique accurately models the radiological scene in a way that eliminates the need for expensive multi‐planar radiographic imaging systems. It is also flexible enough to allow for both short and long film imaging using standard radiological protocols, which makes the technology easily utilized in standard clinical setups

University of Tennessee, Knoxville: Trace

XNect: Real-time Multi-Person 3D Motion Capture with a Single RGB Camera

Author: Elgharib Mohamed
Fua Pascal
Mehta Dushyant
Mueller Franziska
Pons-Moll Gerard
Rhodin Helge
Seidel Hans-Peter
Sotnychenko Oleksandr
Theobalt Christian
Xu Weipeng
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2020
Field of study

We present a real-time approach for multi-person 3D motion capture at over 30 fps using a single RGB camera. It operates successfully in generic scenes which may contain occlusions by objects and by other people. Our method operates in subsequent stages. The first stage is a convolutional neural network (CNN) that estimates 2D and 3D pose features along with identity assignments for all visible joints of all individuals.We contribute a new architecture for this CNN, called SelecSLS Net, that uses novel selective long and short range skip connections to improve the information flow allowing for a drastically faster network without compromising accuracy. In the second stage, a fully connected neural network turns the possibly partial (on account of occlusion) 2Dpose and 3Dpose features for each subject into a complete 3Dpose estimate per individual. The third stage applies space-time skeletal model fitting to the predicted 2D and 3D pose per subject to further reconcile the 2D and 3D pose, and enforce temporal coherence. Our method returns the full skeletal pose in joint angles for each subject. This is a further key distinction from previous work that do not produce joint angle results of a coherent skeleton in real time for multi-person scenes. The proposed system runs on consumer hardware at a previously unseen speed of more than 30 fps given 512x320 images as input while achieving state-of-the-art accuracy, which we will demonstrate on a range of challenging real-world scenes.Comment: To appear in ACM Transactions on Graphics (SIGGRAPH) 202

arXiv.org e-Print Archive

MPG.PuRe

XNect: Real-time Multi-person 3D Human Pose Estimation with a Single RGB Camera

Author: Elgharib M.
Fua P.
Mehta D.
Mueller F.
Pons-Moll G.
Rhodin H.
Seidel H.
Sotnychenko O.
Theobalt C.
Xu W.
Publication venue
Publication date: 01/01/2019
Field of study

We present a real-time approach for multi-person 3D motion capture at over 30 fps using a single RGB camera. It operates in generic scenes and is robust to difficult occlusions both by other people and objects. Our method operates in subsequent stages. The first stage is a convolutional neural network (CNN) that estimates 2D and 3D pose features along with identity assignments for all visible joints of all individuals. We contribute a new architecture for this CNN, called SelecSLS Net, that uses novel selective long and short range skip connections to improve the information flow allowing for a drastically faster network without compromising accuracy. In the second stage, a fully-connected neural network turns the possibly partial (on account of occlusion) 2D pose and 3D pose features for each subject into a complete 3D pose estimate per individual. The third stage applies space-time skeletal model fitting to the predicted 2D and 3D pose per subject to further reconcile the 2D and 3D pose, and enforce temporal coherence. Our method returns the full skeletal pose in joint angles for each subject. This is a further key distinction from previous work that neither extracted global body positions nor joint angle results of a coherent skeleton in real time for multi-person scenes. The proposed system runs on consumer hardware at a previously unseen speed of more than 30 fps given 512x320 images as input while achieving state-of-the-art accuracy, which we will demonstrate on a range of challenging real-world scenes

arXiv.org e-Print Archive

MPG.PuRe

Hand Keypoint Detection in Single Images using Multiview Bootstrapping

Author: Joo Hanbyul
Matthews Iain
Sheikh Yaser
Simon Tomas
Publication venue
Publication date: 25/04/2017
Field of study

We present an approach that uses a multi-camera system to train fine-grained detectors for keypoints that are prone to occlusion, such as the joints of a hand. We call this procedure multiview bootstrapping: first, an initial keypoint detector is used to produce noisy labels in multiple views of the hand. The noisy detections are then triangulated in 3D using multiview geometry or marked as outliers. Finally, the reprojected triangulations are used as new labeled training data to improve the detector. We repeat this process, generating more labeled data in each iteration. We derive a result analytically relating the minimum number of views to achieve target true and false positive rates for a given detector. The method is used to train a hand keypoint detector for single images. The resulting keypoint detector runs in realtime on RGB images and has accuracy comparable to methods that use depth sensors. The single view detector, triangulated over multiple views, enables 3D markerless hand motion capture with complex object interactions.Comment: CVPR 201

arXiv.org e-Print Archive

Crossref

3D Elbow Kinematics with Monoplanar Fluoroscopy: In Silico Evaluation

Author: L.Tersi
R.Stagni
S. Fantozzi
Publication venue
Publication date: 01/01/2010
Field of study

An in-silico assessment of the performance of 3D video-fluoroscopy for the analysis of the kinematics of long bones is proposed. A reliable knowledge of in-vivo joints kinematics in physiological conditions is fundamental in the clinical field. 3D video-fluoroscopy theoretically permits a mm/deg level of accuracy in joint motion analysis, but the optimization algorithm for the pose estimation is highly dependent on the geometry of the bone segment analyzed. An automated technique based on distance maps and tangency condition was applied to the elbow bones. The convergence domain was explored to quantify and optimize measurement accuracy in terms of bias and precision. By conditioning the optimization algorithm using simple image features, the estimation error had small dispersion (interquartile range within 0.5 and 0.025 mm/deg for out-of-plane and in-plane pose parameters, resp.), but with occasional bias and outliers. 3D video-fluoroscopy produced promising results for the elbow joint, but further in-vitro validation studies should be carried out

Springer - Publisher Connector

Directory of Open Access Journals

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

Open Access Repository

Registration of 3D models to 2D X-ray images using fast X-ray simulation and global optimisation algorithms

Author: Wen Tianci
Publication venue
Publication date: 31/07/2023
Field of study

Bangor University Research Portal