Search CORE

24,297 research outputs found

Human action classification based on sequential bag-of-words model

Author: LIU Hong
SUN Qianru
ZHANG Qiaoduo
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2014
Field of study

Recently, approaches utilizing spatial-temporal features have achieved great success in human action classification. However, they typically rely on bag-of-words (BoWs) model, and ignore the spatial and temporal structure information of visual words, bringing ambiguities among similar actions. In this paper, we present a novel approach called sequential BoWs for efficient human action classification. It captures temporal sequential structure by segmenting the entire action into sub-actions. Each sub-action has a tiny movement within a narrow range of action. Then the sequential BoWs are created, in which each sub-action is assigned with a certain weight and salience to highlight the distinguishing sections. It is noted that the weight and salience are figured out in advance according to the sub-action's discrimination evaluated by training data. Finally, those sub-actions are used for classification respectively, and voting for united result. Experiments are conducted on UT-interaction dataset and Rochester dataset. The results show its higher robustness and accuracy over most state-of-the-art classification approaches. ? 2014 IEEE.EI2280-228

Crossref

Institutional Knowledge at Singapore Management University

Mining Mid-level Features for Action Recognition Based on Effective Skeleton Representation

Author: Gao Zhimin
Li Wanqing
Ogunbona Philip
Wang Pichao
Zhang Hanling
Publication venue
Publication date: 01/01/2014
Field of study

Recently, mid-level features have shown promising performance in computer vision. Mid-level features learned by incorporating class-level information are potentially more discriminative than traditional low-level local features. In this paper, an effective method is proposed to extract mid-level features from Kinect skeletons for 3D human action recognition. Firstly, the orientations of limbs connected by two skeleton joints are computed and each orientation is encoded into one of the 27 states indicating the spatial relationship of the joints. Secondly, limbs are combined into parts and the limb's states are mapped into part states. Finally, frequent pattern mining is employed to mine the most frequent and relevant (discriminative, representative and non-redundant) states of parts in continuous several frames. These parts are referred to as Frequent Local Parts or FLPs. The FLPs allow us to build powerful bag-of-FLP-based action representation. This new representation yields state-of-the-art results on MSR DailyActivity3D and MSR ActionPairs3D

arXiv.org e-Print Archive

Crossref

Research Online