Search CORE

14,060 research outputs found

Deep representation learning for human motion prediction and classification

Author: Black Michael
Bütepage Judith
Kjellström Hedvig
Kragic Danica
Publication venue
Publication date: 01/01/2017
Field of study

Generative models of 3D human motion are often restricted to a small number of activities and can therefore not generalize well to novel movements or applications. In this work we propose a deep learning framework for human motion capture data that learns a generic representation from a large corpus of motion capture data and generalizes well to new, unseen, motions. Using an encoding-decoding network that learns to predict future 3D poses from the most recent past, we extract a feature representation of human motion. Most work on deep learning for sequence prediction focuses on video and speech. Since skeletal data has a different structure, we present and evaluate different network architectures that make different assumptions about time dependencies and limb correlations. To quantify the learned features, we use the output of different layers for action classification and visualize the receptive fields of the network units. Our method outperforms the recent state of the art in skeletal motion prediction even though these use action specific training data. Our results show that deep feedforward networks, trained from a generic mocap database, can successfully be used for feature extraction from human motion data and that this representation can be used as a foundation for classification and prediction.Comment: This paper is published at the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 201

arXiv.org e-Print Archive

Crossref

MPG.PuRe

Discovery and recognition of motion primitives in human activities

Author: Ntouskos Valsamis
Pirri Fiora
Sanzari Marta
Publication venue
Publication date: 01/01/2019
Field of study

We present a novel framework for the automatic discovery and recognition of motion primitives in videos of human activities. Given the 3D pose of a human in a video, human motion primitives are discovered by optimizing the `motion flux', a quantity which captures the motion variation of a group of skeletal joints. A normalization of the primitives is proposed in order to make them invariant with respect to a subject anatomical variations and data sampling rate. The discovered primitives are unknown and unlabeled and are unsupervisedly collected into classes via a hierarchical non-parametric Bayes mixture model. Once classes are determined and labeled they are further analyzed for establishing models for recognizing discovered primitives. Each primitive model is defined by a set of learned parameters. Given new video data and given the estimated pose of the subject appearing on the video, the motion is segmented into primitives, which are recognized with a probability given according to the parameters of the learned models. Using our framework we build a publicly available dataset of human motion primitives, using sequences taken from well-known motion capture datasets. We expect that our framework, by providing an objective way for discovering and categorizing human motion, will be a useful tool in numerous research fields including video analysis, human inspired motion generation, learning by demonstration, intuitive human-robot interaction, and human behavior analysis

arXiv.org e-Print Archive

Directory of Open Access Journals

Archivio della ricerca- Università di Roma La Sapienza

FigShare

Single camera pose estimation using Bayesian filtering and Kinect motion priors

Author: Burke Michael
Lasenby Joan
Publication venue
Publication date: 17/06/2014
Field of study

Traditional approaches to upper body pose estimation using monocular vision rely on complex body models and a large variety of geometric constraints. We argue that this is not ideal and somewhat inelegant as it results in large processing burdens, and instead attempt to incorporate these constraints through priors obtained directly from training data. A prior distribution covering the probability of a human pose occurring is used to incorporate likely human poses. This distribution is obtained offline, by fitting a Gaussian mixture model to a large dataset of recorded human body poses, tracked using a Kinect sensor. We combine this prior information with a random walk transition model to obtain an upper body model, suitable for use within a recursive Bayesian filtering framework. Our model can be viewed as a mixture of discrete Ornstein-Uhlenbeck processes, in that states behave as random walks, but drift towards a set of typically observed poses. This model is combined with measurements of the human head and hand positions, using recursive Bayesian estimation to incorporate temporal information. Measurements are obtained using face detection and a simple skin colour hand detector, trained using the detected face. The suggested model is designed with analytical tractability in mind and we show that the pose tracking can be Rao-Blackwellised using the mixture Kalman filter, allowing for computational efficiency while still incorporating bio-mechanical properties of the upper body. In addition, the use of the proposed upper body model allows reliable three-dimensional pose estimates to be obtained indirectly for a number of joints that are often difficult to detect using traditional object recognition strategies. Comparisons with Kinect sensor results and the state of the art in 2D pose estimation highlight the efficacy of the proposed approach.Comment: 25 pages, Technical report, related to Burke and Lasenby, AMDO 2014 conference paper. Code sample: https://github.com/mgb45/SignerBodyPose Video: https://www.youtube.com/watch?v=dJMTSo7-uF

arXiv.org e-Print Archive

CiteSeerX

Skeleton based action recognition using translation-scale invariant image mapping and multi-scale deep cnn

Author: Chen Yucheng
Cheng Xuelian
Dai Yuchao
He Mingyi
Li Bo
Publication venue
Publication date: 12/06/2017
Field of study

This paper presents an image classification based approach for skeleton-based video action recognition problem. Firstly, A dataset independent translation-scale invariant image mapping method is proposed, which transformes the skeleton videos to colour images, named skeleton-images. Secondly, A multi-scale deep convolutional neural network (CNN) architecture is proposed which could be built and fine-tuned on the powerful pre-trained CNNs, e.g., AlexNet, VGGNet, ResNet etal.. Even though the skeleton-images are very different from natural images, the fine-tune strategy still works well. At last, we prove that our method could also work well on 2D skeleton video data. We achieve the state-of-the-art results on the popular benchmard datasets e.g. NTU RGB+D, UTD-MHAD, MSRC-12, and G3D. Especially on the largest and challenge NTU RGB+D, UTD-MHAD, and MSRC-12 dataset, our method outperforms other methods by a large margion, which proves the efficacy of the proposed method

arXiv.org e-Print Archive

Crossref

P-CNN: Pose-based CNN Features for Action Recognition

Author: Chéron Guilhem
Laptev Ivan
Schmid Cordelia
Publication venue
Publication date: 23/09/2015
Field of study

This work targets human action recognition in video. While recent methods typically represent actions by statistics of local video features, here we argue for the importance of a representation derived from human pose. To this end we propose a new Pose-based Convolutional Neural Network descriptor (P-CNN) for action recognition. The descriptor aggregates motion and appearance information along tracks of human body parts. We investigate different schemes of temporal aggregation and experiment with P-CNN features obtained both for automatically estimated and manually annotated human poses. We evaluate our method on the recent and challenging JHMDB and MPII Cooking datasets. For both datasets our method shows consistent improvement over the state of the art.Comment: ICCV, December 2015, Santiago, Chil

arXiv.org e-Print Archive

Crossref

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

A flexible sensor technology for the distributed measurement of interaction pressure

Author: Bram Koopman
Francesco Giovacchini
Janez Podobnik
Marco Donati
Maria Chiara Carrozza
Maria De Rossi
Marko Munih
Nicola Vitiello
Ro Persichetti
Simona Crea
Stefano Marco
Tommaso Lenzi
Publication venue: MDPI
Publication date: 01/01/2013
Field of study

We present a sensor technology for the measure of the physical human-robot interaction pressure developed in the last years at Scuola Superiore Sant'Anna. The system is composed of flexible matrices of opto-electronic sensors covered by a soft silicone cover. This sensory system is completely modular and scalable, allowing one to cover areas of any sizes and shapes, and to measure different pressure ranges. In this work we present the main application areas for this technology. A first generation of the system was used to monitor human-robot interaction in upper- (NEUROExos; Scuola Superiore Sant'Anna) and lower-limb (LOPES; University of Twente) exoskeletons for rehabilitation. A second generation, with increased resolution and wireless connection, was used to develop a pressure-sensitive foot insole and an improved human-robot interaction measurement systems. The experimental characterization of the latter system along with its validation on three healthy subjects is presented here for the first time. A perspective on future uses and development of the technology is finally drafted

Multidisciplinary Digital Publishing Institute

CiteSeerX

Directory of Open Access Journals

PubMed Central

Archivio della ricerca della Scuola Superiore Sant'Anna

University of Twente Research Information

Sensing with the Motor Cortex

Author: Aaron J. Suminski
Abbott
Albe-Fessard
Ashe
Cabel
Caminiti
Carmena
Cheney
Cheng
Churchland
Cisek
Cover
Desmurget
Desmurget
Dhillon
di Pellegrino
Dum
Dum
Dushanova
Evarts
Evarts
Fadiga
Fetz
Fitzsimmons
Flament
Flanagan
Fritsch
Fromm
Fu
Gallese
Georgopoulos
Georgopoulos
Ghez
Gielen
Gilja
Goldring
Hatsopoulos
Hatsopoulos
Hepp-Reymond
Herter
Hochberg
Järveläinen
Kalaska
Kandel
Kawato
Kim
Lemon
Leuthardt
Lucier
Maeda
Mason
Mattar
Matyas
Moran
Moran
Mulliken
Mulliken
Musallam
Muthukumaraswamy
Naito
Nelson
Nicholas G. Hatsopoulos
Nii
Nishitani
O'Doherty
O'Doherty
Paninski
Penfield
Phillips
Porter
Pruszynski
Pruszynski
Pruszynski
Raos
Rizzolatti
Romo
Sainburg
Sainburg
Saleh
Scott
Scott
Sergio
Sergio
Serruya
Simeral
Smith
Stark
Stefan
Suminski
Suminski
Taira
Taylor
Tkach
Todorov
Truccolo
Velliste
Wahnoun
Wiesendanger
Wise
Wolpaw
Wong
Woolsey
Wu
Publication venue: e-Publications@Marquette
Publication date: 01/11/2011
Field of study

The primary motor cortex is a critical node in the network of brain regions responsible for voluntary motor behavior. It has been less appreciated, however, that the motor cortex exhibits sensory responses in a variety of modalities including vision and somatosensation. We review current work that emphasizes the heterogeneity in sensorimotor responses in the motor cortex and focus on its implications for cortical control of movement as well as for brain-machine interface development

epublications@Marquette

Elsevier - Publisher Connector

Crossref

PubMed Central

Behavioral analysis with movement cluster model for concurrent actions

Author: Green Patrick Richey
Husz Z L
Wallace A M
Publication venue: 'Hindawi Limited'
Publication date: 27/09/2010
Field of study

Heriot Watt Pure

Crossref

Springer - Publisher Connector

Directory of Open Access Journals