D&D: Learning Human Dynamics from Dynamic Camera

Bian, Siyuan; Li, Jiefeng; Liu, Gang; Lu, Cewu; Xu, Chao; Yu, Gang

D&D: Learning Human Dynamics from Dynamic Camera

Authors: Siyuan Bian
Jiefeng Li
Gang Liu
Cewu Lu
Chao Xu
Gang Yu
Publication date: 19 September 2022
Publisher

Abstract

3D human pose estimation from a monocular video has recently seen significant improvements. However, most state-of-the-art methods are kinematics-based, which are prone to physically implausible motions with pronounced artifacts. Current dynamics-based methods can predict physically plausible motion but are restricted to simple scenarios with static camera view. In this work, we present D&D (Learning Human Dynamics from Dynamic Camera), which leverages the laws of physics to reconstruct 3D human motion from the in-the-wild videos with a moving camera. D&D introduces inertial force control (IFC) to explain the 3D human motion in the non-inertial local frame by considering the inertial forces of the dynamic camera. To learn the ground contact with limited annotations, we develop probabilistic contact torque (PCT), which is computed by differentiable sampling from contact probabilities and used to generate motions. The contact state can be weakly supervised by encouraging the model to generate correct motions. Furthermore, we propose an attentive PD controller that adjusts target pose states using temporal information to obtain smooth and accurate pose control. Our approach is entirely neural-based and runs without offline optimization or simulation in physics engines. Experiments on large-scale 3D human motion benchmarks demonstrate the effectiveness of D&D, where we exhibit superior performance against both state-of-the-art kinematics-based and dynamics-based methods. Code is available at https://github.com/Jeffsjtu/DnDComment: ECCV 2022 (Oral

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2209.08790

Last time updated on 14/11/2022