Search CORE

991 research outputs found

Flowing ConvNets for Human Pose Estimation in Videos

Author: Charles James
Pfister Tomas
Zisserman Andrew
Publication venue
Publication date: 08/11/2015
Field of study

The objective of this work is human pose estimation in videos, where multiple frames are available. We investigate a ConvNet architecture that is able to benefit from temporal context by combining information across the multiple frames using optical flow. To this end we propose a network architecture with the following novelties: (i) a deeper network than previously investigated for regressing heatmaps; (ii) spatial fusion layers that learn an implicit spatial model; (iii) optical flow is used to align heatmap predictions from neighbouring frames; and (iv) a final parametric pooling layer which learns to combine the aligned heatmaps into a pooled confidence map. We show that this architecture outperforms a number of others, including one that uses optical flow solely at the input layers, one that regresses joint coordinates directly, and one that predicts heatmaps without spatial fusion. The new architecture outperforms the state of the art by a large margin on three video pose estimation datasets, including the very challenging Poses in the Wild dataset, and outperforms other deep methods that don't use a graphical model on the single-image FLIC benchmark (and also Chen & Yuille and Tompson et al. in the high precision region).Comment: ICCV'1

arXiv.org e-Print Archive

CiteSeerX

Oxford University Research Archive

Recommended from our members

Segmentation of Exercise Repetitions Enabling Real-Time Patient Analysis and Feedback Using a Single Exemplar

Author: Brown David
Langensiepen Caroline
Lewis James
Logan Pip
Sarsfield Joe
Selwood Louise
Sherkat Nasser
Standen Penny
Taheri Mohammad
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 11/04/2019
Field of study

We present a segmentation algorithm capable of segmenting exercise repetitions in real-time. This approach uses subsequence dynamic time warping and requires only a single exemplar repetition of an exercise to correctly segment repetitions from other subjects, including those with limited mobility. This approach is invariant to low range of motion, instability in movements and sensor noise while remaining selective to different exercises. This algorithm enables responsive feedback for technology-assisted physical rehabilitation systems. We evaluated the algorithm against a publicly available dataset (CMU) and against a healthy population and stroke patient population performing rehabilitation exercises captured on a consumer-level depth sensor. We show the algorithm can consistently achieve correct segmentation in real-time

Nottingham Trent Institutional Repository (IRep)

Sheffield Hallam University Research Archive

3D Point Capsule Networks

Author: Birdal Tolga
Deng Haowen
Tombari Federico
Zhao Yongheng
Publication venue
Publication date: 01/01/2018
Field of study

In this paper, we propose 3D point-capsule networks, an auto-encoder designed to process sparse 3D point clouds while preserving spatial arrangements of the input data. 3D capsule networks arise as a direct consequence of our novel unified 3D auto-encoder formulation. Their dynamic routing scheme and the peculiar 2D latent space deployed by our approach bring in improvements for several common point cloud-related tasks, such as object classification, object reconstruction and part segmentation as substantiated by our extensive evaluations. Moreover, it enables new applications such as part interpolation and replacement.Comment: As published in CVPR 2019 (camera ready version), with supplementary materia

arXiv.org e-Print Archive

Crossref

Archivio istituzionale della ricerca - Università di Padova

3D Point Capsule Networks

Author: Birdal Tolga
Deng Haowen
Tombari Federico
ZHAO YONGHENG
Publication venue
Publication date: 01/01/2018
Field of study

Archivio istituzionale della ricerca - Università di Padova