Search CORE

22,983 research outputs found

Efficient Human Vision Inspired Action Recognition using Adaptive Spatiotemporal Sampling

Author: Do Minh N.
Mac Khoi-Nguyen C.
Vo Minh P.
Publication venue
Publication date: 14/07/2022
Field of study

Adaptive sampling that exploits the spatiotemporal redundancy in videos is critical for always-on action recognition on wearable devices with limited computing and battery resources. The commonly used fixed sampling strategy is not context-aware and may under-sample the visual content, and thus adversely impacts both computation efficiency and accuracy. Inspired by the concepts of foveal vision and pre-attentive processing from the human visual perception mechanism, we introduce a novel adaptive spatiotemporal sampling scheme for efficient action recognition. Our system pre-scans the global scene context at low-resolution and decides to skip or request high-resolution features at salient regions for further processing. We validate the system on EPIC-KITCHENS and UCF-101 datasets for action recognition, and show that our proposed approach can greatly speed up inference with a tolerable loss of accuracy compared with those from state-of-the-art baselines. Source code is available in https://github.com/knmac/adaptive_spatiotemporal

arXiv.org e-Print Archive

Machine Analysis of Facial Expressions

Author: Bartlett M.S.
Pantic M.
Publication venue: I-Tech Education and Publishing
Publication date: 01/01/2007
Field of study

No abstract

IntechOpen

CiteSeerX

Crossref

University of Twente Research Information

Selected extended abstracts from the 7th Annual Postgraduate Conference

Author: Battersby Stuart
Pachoud Samuel
Publication venue
Publication date: 30/12/2013
Field of study

Queen Mary Research Online

On the Integration of Optical Flow and Action Recognition

Author: D Sun
DJ Butler
G Johansson
G Varol
J Koenderink
JL Barron
L Wang
S Baker
S Roth
T Brox
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Crossref

Edinburgh Research Explorer

MPG.PuRe

Deep Learning Approaches for Seizure Video Analysis: A Review

Author: Ahmedt-Aristizabal David
Armin Mohammad Ali
Denman Simon
Fookes Clinton
Garcia-Cairasco Norberto
Hayder Zeeshan
McGonigal Aileen
Petersson Lars
Publication venue
Publication date: 04/03/2024
Field of study

Seizure events can manifest as transient disruptions in the control of movements which may be organized in distinct behavioral sequences, accompanied or not by other observable features such as altered facial expressions. The analysis of these clinical signs, referred to as semiology, is subject to observer variations when specialists evaluate video-recorded events in the clinical setting. To enhance the accuracy and consistency of evaluations, computer-aided video analysis of seizures has emerged as a natural avenue. In the field of medical applications, deep learning and computer vision approaches have driven substantial advancements. Historically, these approaches have been used for disease detection, classification, and prediction using diagnostic data; however, there has been limited exploration of their application in evaluating video-based motion detection in the clinical epileptology setting. While vision-based technologies do not aim to replace clinical expertise, they can significantly contribute to medical decision-making and patient care by providing quantitative evidence and decision support. Behavior monitoring tools offer several advantages such as providing objective information, detecting challenging-to-observe events, reducing documentation efforts, and extending assessment capabilities to areas with limited expertise. The main applications of these could be (1) improved seizure detection methods; (2) refined semiology analysis for predicting seizure type and cerebral localization. In this paper, we detail the foundation technologies used in vision-based systems in the analysis of seizure videos, highlighting their success in semiology detection and analysis, focusing on work published in the last 7 years. Additionally, we illustrate how existing technologies can be interconnected through an integrated system for video-based semiology analysis.Comment: Accepted in Epilepsy & Behavio

arXiv.org e-Print Archive