Search CORE

4,711 research outputs found

STV-based Video Feature Processing for Action Recognition

Author: Wang Jing
Xu Zhijie
Publication venue: 'Elsevier BV'
Publication date: 01/08/2012
Field of study

In comparison to still image-based processes, video features can provide rich and intuitive information about dynamic events occurred over a period of time, such as human actions, crowd behaviours, and other subject pattern changes. Although substantial progresses have been made in the last decade on image processing and seen its successful applications in face matching and object recognition, video-based event detection still remains one of the most difficult challenges in computer vision research due to its complex continuous or discrete input signals, arbitrary dynamic feature definitions, and the often ambiguous analytical methods. In this paper, a Spatio-Temporal Volume (STV) and region intersection (RI) based 3D shape-matching method has been proposed to facilitate the definition and recognition of human actions recorded in videos. The distinctive characteristics and the performance gain of the devised approach stemmed from a coefficient factor-boosted 3D region intersection and matching mechanism developed in this research. This paper also reported the investigation into techniques for efficient STV data filtering to reduce the amount of voxels (volumetric-pixels) that need to be processed in each operational cycle in the implemented system. The encouraging features and improvements on the operational performance registered in the experiments have been discussed at the end

University of Huddersfield Repository

Huddersfield Research Portal

Dance-the-music : an educational platform for the modeling, recognition and audiovisual monitoring of dance steps using spatiotemporal motion templates

Author: Amelynck Denis
Leman Marc
Maes Pieter-Jan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

In this article, a computational platform is presented, entitled “Dance-the-Music”, that can be used in a dance educational context to explore and learn the basics of dance steps. By introducing a method based on spatiotemporal motion templates, the platform facilitates to train basic step models from sequentially repeated dance figures performed by a dance teacher. Movements are captured with an optical motion capture system. The teachers’ models can be visualized from a first-person perspective to instruct students how to perform the specific dance steps in the correct manner. Moreover, recognition algorithms-based on a template matching method can determine the quality of a student’s performance in real time by means of multimodal monitoring techniques. The results of an evaluation study suggest that the Dance-the-Music is effective in helping dance students to master the basics of dance figures

Springer - Publisher Connector

Ghent University Academic Bibliography

Dynamic gesture recognition using PCA with multi-scale theory and HMM

Author: Sutherland Alistair
Wu Hai
Publication venue: 'SPIE-Intl Soc Optical Eng'
Publication date: 01/10/2001
Field of study

In this paper, a dynamic gesture recognition system is presented which requires no special hardware other than a Webcam. The system is based on a novel method combining Principal Component Analysis (PCA) with hierarchical multi-scale theory and Discrete Hidden Markov Models (DHMM). We use a hierarchical decision tree based on multiscale theory. Firstly we convolve all members of the training data with a Gaussian kernel, which blurs differences between images and reduces their separation in feature space. This reduces the number of eigenvectors needed to describe the data. A principal component space is computed from the convolved data. We divide the data in this space into two clusters using the k-means algorithm. Then the level of blurring is reduced and PCA is applied to each of the clusters separately. A new principal component space is formed from each cluster. Each of these spaces is then divided into two and the process is repeated. We thus produce a binary tree of principal component spaces where each level of the tree represents a different degree of blurring. The search time is then proportional to the depth of the tree, which makes it possible to search hundreds of gestures in real time. The output of the decision tree is then input into DHMM to recognize temporal information

Crossref

Irish Universities

DCU Online Research Access Service

Sensitivity analysis of hand movement classification technique using motion templates

Author: Kumar D
Kumar S
Sharma A
Publication venue: IEEE (Piscataway, USA)
Publication date: 01/01/2004
Field of study

This paper presents the sensitivity analysis of a new technique for automated classification of human hand gestures based on Hu moments for robotics applications. It uses view-based approach for representation, and statistical technique for classification. This approach uses a cumulative image-difference technique where the time between the sequences of images is implicitly captured in the representation of action. This results in the construction of temporal history templates (THTs). These THTs are used to compute the 7 Hu image moments that are invariant to scale, rotation, and translation. The recognition criterion is established using K-nearest neighbor (K-NN) Mahalanobis distance. The preliminary experiments show that such a system can classify human hand gestures with a classification accuracy of 92%. This research has been conducted for medical and robotics framework. The overall goal of our research is to test for accuracy of the recognition of hand gestures using this computationally inexpensive way of dimensionality-reduced representation of gestures for its suitability for medical and robotic application

RMIT Research Repository

Action Recognition in Videos: from Motion Capture Labs to the Web

Author: Ana Paula Br
Arnaldo Albuquerque De Araújo
De Almeida
Eduardo Alves
Jussara Marques
Publication venue
Publication date: 17/06/2010
Field of study

This paper presents a survey of human action recognition approaches based on visual data recorded from a single video camera. We propose an organizing framework which puts in evidence the evolution of the area, with techniques moving from heavily constrained motion capture scenarios towards more challenging, realistic, "in the wild" videos. The proposed organization is based on the representation used as input for the recognition task, emphasizing the hypothesis assumed and thus, the constraints imposed on the type of video that each technique is able to address. Expliciting the hypothesis and constraints makes the framework particularly useful to select a method, given an application. Another advantage of the proposed organization is that it allows categorizing newest approaches seamlessly with traditional ones, while providing an insightful perspective of the evolution of the action recognition task up to now. That perspective is the basis for the discussion in the end of the paper, where we also present the main open issues in the area.Comment: Preprint submitted to CVIU, survey paper, 46 pages, 2 figures, 4 table

arXiv.org e-Print Archive

CiteSeerX

Descriptive temporal template features for visual motion recognition

Author: Aggarwal
Bobick
Bradski
Cristianini
Davis
Farmer
Green
Hongying Meng
Meng
Moeslund
Nick Pears
Ogata
Stauffer
Publication venue: 'Elsevier BV'
Publication date: 01/01/2009
Field of study

In this paper, a human action recognition system is proposed. The system is based on new, descriptive `temporal template' features in order to achieve high-speed recognition in real-time, embedded applications. The limitations of the well known `Motion History Image' (MHI) temporal template are addressed and a new `Motion History Histogram' (MHH) feature is proposed to capture more motion information in the video. MHH not only provides rich motion information, but also remains computationally inexpensive. To further improve classification performance, we combine both MHI and MHH into a low dimensional feature vector which is processed by a support vector machine (SVM). Experimental results show that our new representation can achieve a significant improvement in the performance of human action recognition over existing comparable methods, which use 2D temporal template based representations

University of Lincoln Institutional Repository

Crossref

Brunel University Research Archive