Search CORE

768 research outputs found

Hierarchical eyelid and face tracking

Author: Gonzàlez Jordi
Orozco Francisco J.
Rius Ignasi
Roca Francesc Xavier
Publication venue: Springer Verlag
Publication date: 01/01/2007
Field of study

Most applications on Human Computer Interaction (HCI) require to extract the movements of user faces, while avoiding high memory and time expenses. Moreover, HCI systems usually use low-cost cameras, while current face tracking techniques strongly depend on the image resolution. In this paper, we tackle the problem of eyelid tracking by using Appearance-Based Models, thus achieving accurate estimations of the movements of the eyelids, while avoiding cues, which require high-resolution faces, such as edge detectors or colour information. Consequently, we can track the fast and spontaneous movements of the eyelids, a very hard task due to the small resolution of the eye regions. Subsequently, we combine the results of eyelid tracking with the estimations of other facial features, such as the eyebrows and the lips. As a result, a hierarchical tracking framework is obtained: we demonstrate that combining two appearance-based trackers allows to get accurate estimates for the eyelid, eyebrows, lips and also the 3D head pose by using low-cost video cameras and in real-time. Therefore, our approach is shown suitable to be used for further facial-expression analysis.Peer Reviewe

Learning Grimaces by Watching TV

Author: Albanie Samuel
Vedaldi Andrea
Publication venue
Publication date: 01/01/2016
Field of study

Differently from computer vision systems which require explicit supervision, humans can learn facial expressions by observing people in their environment. In this paper, we look at how similar capabilities could be developed in machine vision. As a starting point, we consider the problem of relating facial expressions to objectively measurable events occurring in videos. In particular, we consider a gameshow in which contestants play to win significant sums of money. We extract events affecting the game and corresponding facial expressions objectively and automatically from the videos, obtaining large quantities of labelled data for our study. We also develop, using benchmarks such as FER and SFEW 2.0, state-of-the-art deep neural networks for facial expression recognition, showing that pre-training on face verification data can be highly beneficial for this task. Then, we extend these models to use facial expressions to predict events in videos and learn nameable expressions from them. The dataset and emotion recognition models are available at http://www.robots.ox.ac.uk/~vgg/data/facevalueComment: British Machine Vision Conference (BMVC) 201

arXiv.org e-Print Archive

Oxford University Research Archive

Accountable, Explainable Artificial Intelligence Incorporation Framework for a Real-Time Affective State Assessment Module

Author: Vice Jordan Joshua
Publication venue: Curtin University
Publication date: 01/01/2022
Field of study

The rapid growth of artificial intelligence (AI) and machine learning (ML) solutions has seen it adopted across various industries. However, the concern of ‘black-box’ approaches has led to an increase in the demand for high accuracy, transparency, accountability, and explainability in AI/ML approaches. This work contributes through an accountable, explainable AI (AXAI) framework for delineating and assessing AI systems. This framework has been incorporated into the development of a real-time, multimodal affective state assessment system

espace@Curtin

User Experience and Digitally Transformed/Converted Emotions

Author: Muhammad Rafaat Mohammad Odeh
محمد رأفت محمد عوده
Publication venue: جامعة القدس
Publication date
Field of study

Al-Quds University Digital Repository

Eye center localization and gaze gesture recognition for human-computer interaction

Author: Abdul Farooq
Adams
Baek
Barbosa
Daugman
Daugman
Georghiades
Hamouz
Jaimes
Kapoor
Kim
Koenderink
Leo
Lyndon N. Smith
Melvyn L. Smith
Morgante
Pieters
Rozado
Valenti
Vidal
Villanueva
Viola
Wang
Wenhao Zhang
Zhu
Publication venue: 'The Optical Society'
Publication date: 01/03/2016
Field of study

© 2016 Optical Society of America. This paper introduces an unsupervised modular approach for accurate and real-time eye center localization in images and videos, thus allowing a coarse-to-fine, global-to-regional scheme. The trajectories of eye centers in consecutive frames, i.e., gaze gestures, are further analyzed, recognized, and employed to boost the human-computer interaction (HCI) experience. This modular approach makes use of isophote and gradient features to estimate the eye center locations. A selective oriented gradient filter has been specifically designed to remove strong gradients from eyebrows, eye corners, and shadows, which sabotage most eye center localization methods. A real-world implementation utilizing these algorithms has been designed in the form of an interactive advertising billboard to demonstrate the effectiveness of our method for HCI. The eye center localization algorithm has been compared with 10 other algorithms on the BioID database and six other algorithms on the GI4E database. It outperforms all the other algorithms in comparison in terms of localization accuracy. Further tests on the extended Yale Face Database b and self-collected data have proved this algorithm to be robust against moderate head poses and poor illumination conditions. The interactive advertising billboard has manifested outstanding usability and effectiveness in our tests and shows great potential for benefiting a wide range of real-world HCI applications