7 research outputs found

    Expressive Body Capture: 3D Hands, Face, and Body from a Single Image

    Full text link
    To facilitate the analysis of human actions, interactions and emotions, we compute a 3D model of human body pose, hand pose, and facial expression from a single monocular image. To achieve this, we use thousands of 3D scans to train a new, unified, 3D model of the human body, SMPL-X, that extends SMPL with fully articulated hands and an expressive face. Learning to regress the parameters of SMPL-X directly from images is challenging without paired images and 3D ground truth. Consequently, we follow the approach of SMPLify, which estimates 2D features and then optimizes model parameters to fit the features. We improve on SMPLify in several significant ways: (1) we detect 2D features corresponding to the face, hands, and feet and fit the full SMPL-X model to these; (2) we train a new neural network pose prior using a large MoCap dataset; (3) we define a new interpenetration penalty that is both fast and accurate; (4) we automatically detect gender and the appropriate body models (male, female, or neutral); (5) our PyTorch implementation achieves a speedup of more than 8x over Chumpy. We use the new method, SMPLify-X, to fit SMPL-X to both controlled images and images in the wild. We evaluate 3D accuracy on a new curated dataset comprising 100 images with pseudo ground-truth. This is a step towards automatic expressive human capture from monocular RGB data. The models, code, and data are available for research purposes at https://smpl-x.is.tue.mpg.de.Comment: To appear in CVPR 201

    Alleviating Human-level Shift : A Robust Domain Adaptation Method for Multi-person Pose Estimation

    Full text link
    Human pose estimation has been widely studied with much focus on supervised learning requiring sufficient annotations. However, in real applications, a pretrained pose estimation model usually need be adapted to a novel domain with no labels or sparse labels. Such domain adaptation for 2D pose estimation hasn't been explored. The main reason is that a pose, by nature, has typical topological structure and needs fine-grained features in local keypoints. While existing adaptation methods do not consider topological structure of object-of-interest and they align the whole images coarsely. Therefore, we propose a novel domain adaptation method for multi-person pose estimation to conduct the human-level topological structure alignment and fine-grained feature alignment. Our method consists of three modules: Cross-Attentive Feature Alignment (CAFA), Intra-domain Structure Adaptation (ISA) and Inter-domain Human-Topology Alignment (IHTA) module. The CAFA adopts a bidirectional spatial attention module (BSAM)that focuses on fine-grained local feature correlation between two humans to adaptively aggregate consistent features for adaptation. We adopt ISA only in semi-supervised domain adaptation (SSDA) to exploit the corresponding keypoint semantic relationship for reducing the intra-domain bias. Most importantly, we propose an IHTA to learn more domain-invariant human topological representation for reducing the inter-domain discrepancy. We model the human topological structure via the graph convolution network (GCN), by passing messages on which, high-order relations can be considered. This structure preserving alignment based on GCN is beneficial to the occluded or extreme pose inference. Extensive experiments are conducted on two popular benchmarks and results demonstrate the competency of our method compared with existing supervised approaches.Comment: Accepted By ACM MM'202

    PoTion: Pose MoTion Representation for Action Recognition

    Get PDF
    International audienceMost state-of-the-art methods for action recognition rely on a two-stream architecture that processes appearance and motion independently. In this paper, we claim that considering them jointly offers rich information for action recognition. We introduce a novel representation that gracefully encodes the movement of some semantic keypoints. We use the human joints as these keypoints and term our Pose moTion representation PoTion. Specifically, we first run a state-of-the-art human pose estimator [4] and extract heatmaps for the human joints in each frame. We obtain our PoTion representation by temporally aggregating these probability maps. This is achieved by 'colorizing' each of them depending on the relative time of the frames in the video clip and summing them. This fixed-size representation for an entire video clip is suitable to classify actions using a shallow convolutional neural network. Our experimental evaluation shows that PoTion outper-forms other state-of-the-art pose representations [6, 48]. Furthermore, it is complementary to standard appearance and motion streams. When combining PoTion with the recent two-stream I3D approach [5], we obtain state-of-the-art performance on the JHMDB, HMDB and UCF101 datasets

    Accurate 3D Body Shape Regression using Metric and Semantic Attributes

    Full text link
    While methods that regress 3D human meshes from images have progressed rapidly, the estimated body shapes often do not capture the true human shape. This is problematic since, for many applications, accurate body shape is as important as pose. The key reason that body shape accuracy lags pose accuracy is the lack of data. While humans can label 2D joints, and these constrain 3D pose, it is not so easy to "label" 3D body shape. Since paired data with images and 3D body shape are rare, we exploit two sources of information: (1) we collect internet images of diverse "fashion" models together with a small set of anthropometric measurements; (2) we collect linguistic shape attributes for a wide range of 3D body meshes and the model images. Taken together, these datasets provide sufficient constraints to infer dense 3D shape. We exploit the anthropometric measurements and linguistic shape attributes in several novel ways to train a neural network, called SHAPY, that regresses 3D human pose and shape from an RGB image. We evaluate SHAPY on public benchmarks, but note that they either lack significant body shape variation, ground-truth shape, or clothing variation. Thus, we collect a new dataset for evaluating 3D human shape estimation, called HBW, containing photos of "Human Bodies in the Wild" for which we have ground-truth 3D body scans. On this new benchmark, SHAPY significantly outperforms state-of-the-art methods on the task of 3D body shape estimation. This is the first demonstration that 3D body shape regression from images can be trained from easy-to-obtain anthropometric measurements and linguistic shape attributes. Our model and data are available at: shapy.is.tue.mpg.deComment: First two authors contributed equall

    Is prolonged infusion of piperacillin/tazobactam and meropenem in critically ill patients associated with improved pharmacokinetic/pharmacodynamic and patient outcomes? An observation from the Defining Antibiotic Levels in Intensive care unit patients (DALI) cohort

    No full text
    Objectives: We utilized the database of the Defining Antibiotic Levels in Intensive care unit patients (DALI) study to statistically compare the pharmacokinetic/pharmacodynamic and clinical outcomes between prolonged- infusion and intermittent-bolus dosing of piperacillin/tazobactam and meropenem in critically ill patients using inclusion criteria similar to those used in previous prospective studies. Methods: This was a post hoc analysis of a prospective, multicentre pharmacokinetic point-prevalence study (DALI), which recruited a large cohort of critically ill patients from 68 ICUs across 10 countries. Results: Of the 211 patients receiving piperacillin/tazobactam and meropenem in the DALI study, 182 met inclusion criteria. Overall, 89.0% (162/182) of patients achieved the most conservative target of 50% fT 65MIC (time over which unbound or free drug concentration remains above the MIC). Decreasing creatinine clearance and the use of prolonged infusion significantly increased the PTA for most pharmacokinetic/pharmacodynamic targets. In the subgroup of patients who had respiratory infection, patients receiving \u3b2-lactams via prolonged infusion demonstrated significantly better 30 day survival when compared with intermittent-bolus patients [86.2% (25/29) versus 56.7% (17/30); P=0.012]. Additionally, in patients with a SOFA score of 65 9, administration by prolonged infusion compared with intermittent-bolus dosing demonstrated significantly better clinical cure [73.3% (11/15) versus 35.0% (7/20); P=0.035] and survival rates [73.3% (11/15) versus 25.0% (5/20); P=0.025]. Conclusions: Analysis of this large dataset has provided additional data on the niche benefits of administration of piperacillin/tazobactam and meropenem by prolonged infusion in critically ill patients, particularly for patients with respiratory infection
    corecore