Search CORE

363,887 research outputs found

A Differential Approach for Gaze Estimation

Author: Liu Gang
Mora Kenneth A. Funes
Odobez Jean-Marc
Yu Yu
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2019
Field of study

Non-invasive gaze estimation methods usually regress gaze directions directly from a single face or eye image. However, due to important variabilities in eye shapes and inner eye structures amongst individuals, universal models obtain limited accuracies and their output usually exhibit high variance as well as biases which are subject dependent. Therefore, increasing accuracy is usually done through calibration, allowing gaze predictions for a subject to be mapped to his/her actual gaze. In this paper, we introduce a novel image differential method for gaze estimation. We propose to directly train a differential convolutional neural network to predict the gaze differences between two eye input images of the same subject. Then, given a set of subject specific calibration images, we can use the inferred differences to predict the gaze direction of a novel eye sample. The assumption is that by allowing the comparison between two eye images, annoyance factors (alignment, eyelid closing, illumination perturbations) which usually plague single image prediction methods can be much reduced, allowing better prediction altogether. Experiments on 3 public datasets validate our approach which constantly outperforms state-of-the-art methods even when using only one calibration sample or when the latter methods are followed by subject specific gaze adaptation.Comment: Extension to our paper A differential approach for gaze estimation with calibration (BMVC 2018) Submitted to PAMI on Aug. 7th, 2018 Accepted by PAMI short on Dec. 2019, in IEEE Transactions on Pattern Analysis and Machine Intelligenc

arXiv.org e-Print Archive

Crossref

The Profiling Potential of Computer Vision and the Challenge of Computational Empiricism

Author: Alley Thomas
Andrejevic Mark
Apter Emily
Citron Danielle Keats
Clemens Justin
Gandy Oscar H
Humphries Paul
Jäger Jens
Krizhevsky Alex
Mann Monique
Marien Mary Warner
Selinger Evan
Todorov Alexander
Vanian Jonathan
Weatherby Leif
Willis J
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 22/04/2019
Field of study

Computer vision and other biometrics data science applications have commenced a new project of profiling people. Rather than using 'transaction generated information', these systems measure the 'real world' and produce an assessment of the 'world state' - in this case an assessment of some individual trait. Instead of using proxies or scores to evaluate people, they increasingly deploy a logic of revealing the truth about reality and the people within it. While these profiling knowledge claims are sometimes tentative, they increasingly suggest that only through computation can these excesses of reality be captured and understood. This article explores the bases of those claims in the systems of measurement, representation, and classification deployed in computer vision. It asks if there is something new in this type of knowledge claim, sketches an account of a new form of computational empiricism being operationalised, and questions what kind of human subject is being constructed by these technological systems and practices. Finally, the article explores legal mechanisms for contesting the emergence of computational empiricism as the dominant knowledge platform for understanding the world and the people within it

arXiv.org e-Print Archive

Crossref

Learning Grimaces by Watching TV

Author: Albanie Samuel
Vedaldi Andrea
Publication venue
Publication date: 01/01/2016
Field of study

Differently from computer vision systems which require explicit supervision, humans can learn facial expressions by observing people in their environment. In this paper, we look at how similar capabilities could be developed in machine vision. As a starting point, we consider the problem of relating facial expressions to objectively measurable events occurring in videos. In particular, we consider a gameshow in which contestants play to win significant sums of money. We extract events affecting the game and corresponding facial expressions objectively and automatically from the videos, obtaining large quantities of labelled data for our study. We also develop, using benchmarks such as FER and SFEW 2.0, state-of-the-art deep neural networks for facial expression recognition, showing that pre-training on face verification data can be highly beneficial for this task. Then, we extend these models to use facial expressions to predict events in videos and learn nameable expressions from them. The dataset and emotion recognition models are available at http://www.robots.ox.ac.uk/~vgg/data/facevalueComment: British Machine Vision Conference (BMVC) 201

arXiv.org e-Print Archive

Crossref

Oxford University Research Archive

Deep Impression: Audiovisual Deep Residual Networks for Multimodal Apparent Personality Trait Recognition

Author: A Todorov
A Vinciarelli
A Vinciarelli
AG Wright
Arulkumar Subramaniam
B Schuller
CY Olivola
F Mairesse
GL Lorenzo
J Schmidhuber
J Willis
JI Biel
Kaiming He
L Teijeiro-Mosquera
LP Naumann
N Srivastava
P Borkenau
RJW Vernon
S Hochreiter
Y LeCun
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 16/09/2016
Field of study

Here, we develop an audiovisual deep residual network for multimodal apparent personality trait recognition. The network is trained end-to-end for predicting the Big Five personality traits of people from their videos. That is, the network does not require any feature engineering or visual analysis such as face detection, face landmark alignment or facial expression recognition. Recently, the network won the third place in the ChaLearn First Impressions Challenge with a test accuracy of 0.9109

arXiv.org e-Print Archive

Crossref