Search CORE

466 research outputs found

Efficient and effective human action recognition in video through motion boundary description with a compact set of trajectories

Author: De Neve Wesley
Kim Hyung-Il
Ro Yong Man
Seo Jeong-Jik
Son Jisoo
Publication venue
Publication date: 01/01/2015
Field of study

Human action recognition (HAR) is at the core of human-computer interaction and video scene understanding. However, achieving effective HAR in an unconstrained environment is still a challenging task. To that end, trajectory-based video representations are currently widely used. Despite the promising levels of effectiveness achieved by these approaches, problems regarding computational complexity and the presence of redundant trajectories still need to be addressed in a satisfactory way. In this paper, we propose a method for trajectory rejection, reducing the number of redundant trajectories without degrading the effectiveness of HAR. Furthermore, to realize efficient optical flow estimation prior to trajectory extraction, we integrate a method for dynamic frame skipping. Experiments with four publicly available human action datasets show that the proposed approach outperforms state-of-the-art HAR approaches in terms of effectiveness, while simultaneously mitigating the computational complexity

Ghent University Academic Bibliography

A robust and efficient video representation for action recognition

Author: Oneata Dan
Schmid Cordelia
Verbeek Jakob
Wang Heng
Publication venue
Publication date: 21/04/2015
Field of study

This paper introduces a state-of-the-art video representation and applies it to efficient action recognition and detection. We first propose to improve the popular dense trajectory features by explicit camera motion estimation. More specifically, we extract feature point matches between frames using SURF descriptors and dense optical flow. The matches are used to estimate a homography with RANSAC. To improve the robustness of homography estimation, a human detector is employed to remove outlier matches from the human body as human motion is not constrained by the camera. Trajectories consistent with the homography are considered as due to camera motion, and thus removed. We also use the homography to cancel out camera motion from the optical flow. This results in significant improvement on motion-based HOF and MBH descriptors. We further explore the recent Fisher vector as an alternative feature encoding approach to the standard bag-of-words histogram, and consider different ways to include spatial layout information in these encodings. We present a large and varied set of evaluations, considering (i) classification of short basic actions on six datasets, (ii) localization of such actions in feature-length movies, and (iii) large-scale recognition of complex events. We find that our improved trajectory features significantly outperform previous dense trajectories, and that Fisher vectors are superior to bag-of-words encodings for video recognition tasks. In all three tasks, we show substantial improvements over the state-of-the-art results

arXiv.org e-Print Archive

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

Action Recognition with Improved Trajectories

Author: Schmid Cordelia
Wang Heng
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/12/2013
Field of study

International audienceRecently dense trajectories were shown to be an efficient video representation for action recognition and achieved state-of-the-art results on a variety of datasets. This paper improves their performance by taking into account camera motion to correct them. To estimate camera motion, we match feature points between frames using SURF descriptors and dense optical flow, which are shown to be complementary. These matches are, then, used to robustly estimate a homography with RANSAC. Human motion is in general different from camera motion and generates inconsistent matches. To improve the estimation, a human detector is employed to remove these matches. Given the estimated camera motion, we remove trajectories consistent with it. We also use this estimation to cancel out camera motion from the optical flow. This significantly improves motion-based descriptors, such as HOF and MBH. Experimental results on four challenging action datasets (i.e., Hollywood2, HMDB51, Olympic Sports and UCF50) significantly outperform the current state of the art

Crossref

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

Capturing the relative distribution of features for action recognition

Author: Bowden R
Gilbert A
Oshin O
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/03/2011
Field of study

This paper presents an approach to the categorisation of spatio-temporal activity in video, which is based solely on the relative distribution of feature points. Introducing a Relative Motion Descriptor for actions in video, we show that the spatio-temporal distribution of features alone (without explicit appearance information) effectively describes actions, and demonstrate performance consistent with state-of-the-art. Furthermore, we propose that for actions where noisy examples exist, it is not optimal to group all action examples as a single class. Therefore, rather than engineering features that attempt to generalise over noisy examples, our method follows a different approach: We make use of Random Sampling Consensus (RANSAC) to automatically discover and reject outlier examples within classes. We evaluate the Relative Motion Descriptor and outlier rejection approaches on four action datasets, and show that outlier rejection using RANSAC provides a consistent and notable increase in performance, and demonstrate superior performance to more complex multiple-feature based approaches

Crossref

University of Surrey

Surrey Research Insight

Dense trajectories and motion boundary descriptors for action recognition

Author: Kläser Alexander
Liu Cheng-Lin
Schmid Cordelia
Wang Heng
Publication venue: HAL CCSD
Publication date: 01/01/2012
Field of study

This paper introduces a video representation based on dense trajectories and motion boundary descriptors. Trajectories capture the local motion information of the video. A dense representation guarantees a good coverage of foreground motion as well as of the surrounding context. A state-of-the-art optical flow algorithm enables a robust and efficient extraction of the dense trajectories. As descriptors we extract features aligned with the trajectories to characterize shape (point coordinates), appearance (histograms of oriented gradients) and motion (histograms of optical flow). Additionally, we introduce a descriptor based on motion boundary histograms (MBH) which rely on differential optical flow. The MBH descriptor shows to consistently outperform other state-of-the-art descriptors, in particular on real-world videos that contain a significant amount of camera motion. We evaluate our video representation in the context of action classification on eight datasets, namely KTH, YouTube, Hollywood2, UCF sports, IXMAS, UIUC, Olympic Sports and UCF50. On all datasets our approach outperforms current state-of-the-art results

CiteSeerX

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

HAL-CIRAD

Professional-Scientific Education: Rethinking the Concept of Knowledge: a Cultural-historical ‘Recontextualization’ Perspective

Author: Guile David
Publication venue: 'Verlag Barbara Budrich GmbH'
Publication date: 01/05/2022
Field of study

Throughout the second half of the twentieth century the mode of knowledge production diversified and now embraces disciplinary and inter- and trans-disciplinary knowledge. This chapter addresses the implications of these changes for the concept of professional-scientific education by firstly creating a conversation between two different perspectives – ‘reflective practice’ and the ‘trinary’ – on professional scientific education. Secondly, critically appraising these perspectives in relation to ongoing changes in knowledge production. Thirdly, offering a new perspective on professional-scientific knowledge – ‘continuous recontextualisation’ – which incorporates the insights of the reflective and trinary positions, anticipates future changes in knowledge production and, importantly, relates both to work practice. professional-scientific knowledge, reflective practice, the trinary, recontextualization, machine learning

UCL Discovery