Search CORE

1,176 research outputs found

Activity Representation from Video Using Statistical Models on Shape Manifolds

Author: Abdelkader Mohamed F.
Publication venue
Publication date: 01/01/2010
Field of study

Activity recognition from video data is a key computer vision problem with applications in surveillance, elderly care, etc. This problem is associated with modeling a representative shape which contains significant information about the underlying activity. In this dissertation, we represent several approaches for view-invariant activity recognition via modeling shapes on various shape spaces and Riemannian manifolds. The first two parts of this dissertation deal with activity modeling and recognition using tracks of landmark feature points. The motion trajectories of points extracted from objects involved in the activity are used to build deformation shape models for each activity, and these models are used for classification and detection of unusual activities. In the first part of the dissertation, these models are represented by the recovered 3D deformation basis shapes corresponding to the activity using a non-rigid structure from motion formulation. We use a theory for estimating the amount of deformation for these models from the visual data. We study the special case of ground plane activities in detail because of its importance in video surveillance applications. In the second part of the dissertation, we propose to model the activity by learning an affine invariant deformation subspace representation that captures the space of possible body poses associated with the activity. These subspaces can be viewed as points on a Grassmann manifold. We propose several statistical classification models on Grassmann manifold that capture the statistical variations of the shape data while following the intrinsic Riemannian geometry of these manifolds. The last part of this dissertation addresses the problem of recognizing human gestures from silhouette images. We represent a human gesture as a temporal sequence of human poses, each characterized by a contour of the associated human silhouette. The shape of a contour is viewed as a point on the shape space of closed curves and, hence, each gesture is characterized and modeled as a trajectory on this shape space. We utilize the Riemannian geometry of this space to propose a template-based and a graphical-based approaches for modeling these trajectories. The two models are designed in such a way to account for the different invariance requirements in gesture recognition, and also capture the statistical variations associated with the contour data

Digital Repository at the University of Maryland

Gesture Modeling by Hanklet-based Hidden Markov Model

Author: Camps O.
LA CASCIA Marco
LO PRESTI Liliana
Sclaroff S
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

In this paper we propose a novel approach for gesture modeling. We aim at decomposing a gesture into sub-trajectories that are the output of a sequence of atomic linear time invariant (LTI) systems, and we use a Hidden Markov Model to model the transitions from the LTI system to another. For this purpose, we represent the human body motion in a temporal window as a set of body joint trajectories that we assume are the output of an LTI system. We describe the set of trajectories in a temporal window by the corresponding Hankel matrix (Hanklet), which embeds the observability matrix of the LTI system that produced it. We train a set of HMMs (one for each gesture class) with a discriminative approach. To account for the sharing of body motion templates we allow the HMMs to share the same state space. We demonstrate by means of experiments on two publicly available datasets that, even with just considering the trajectories of the 3D joints, our method achieves state-of-the-art accuracy while competing well with methods that employ more complex models and feature representations

CiteSeerX

Crossref

Archivio istituzionale della ricerca - Università di Palermo

A review on computer vision-based methods for human action recognition

Author: Ahmed Ahmed Isam
Al-Faris Mahmoud
Chiverton John
Ndzi David
Publication venue: 'MDPI AG'
Publication date: 10/06/2020
Field of study

Research Repository and Portal - University of the West of Scotland

Comparison of Activity Recognition Using 2D and 3D Skeletal Joint Data

Author: Marshall Fiona
Scotney Bryan
Zhang Shuai
Publication venue: Technological University Dublin
Publication date: 01/01/2019
Field of study

Arrow@TUDublin

Learning Self-Similarities for Action Recognition Using Conditional Random Fields

Author: Imran Junejo
Publication venue: 'IntechOpen'
Publication date: 18/08/2010
Field of study

IntechOpen

Crossref

Arbitrary view action recognition via transfer dictionary learning on synthetic training data

Author: Shao Ling
Shum Hubert P. H.
Zhang Jingtian
Zhang Lining
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 16/05/2016
Field of study

Human action recognition is an important problem in robotic vision. Traditional recognition algorithms usually require the knowledge of view angle, which is not always available in robotic applications such as active vision. In this paper, we propose a new framework to recognize actions with arbitrary views. A main feature of our algorithm is that view-invariance is learned from synthetic 2D and 3D training data using transfer dictionary learning. This guarantees the availability of training data, and removes the hassle of obtaining real world video in specific viewing angles. The result of the process is a dictionary that can project real world 2D video into a view-invariant sparse representation. This facilitates the training of a view-invariant classifier. Experimental results on the IXMAS and N-UCLA datasets show significant improvements over existing algorithms

Northumbria Research Link

Crossref

University of East Anglia digital repository