Search CORE

1,489 research outputs found

Facial Expression Recognition

Author: Matuszewski Bogdan
Quan Wei
Shark Lik
Publication venue: 'IntechOpen'
Publication date: 04/04/2011
Field of study

IntechOpen

CLoK

Crossref

Interpretable Transformations with Encoder-Decoder Networks

Author: Brostow Gabriel J.
Garbin Stephan J.
Turmukhambetov Daniyar
Worrall Daniel E.
Publication venue
Publication date: 19/10/2017
Field of study

Deep feature spaces have the capacity to encode complex transformations of their input data. However, understanding the relative feature-space relationship between two transformed encoded images is difficult. For instance, what is the relative feature space relationship between two rotated images? What is decoded when we interpolate in feature space? Ideally, we want to disentangle confounding factors, such as pose, appearance, and illumination, from object identity. Disentangling these is difficult because they interact in very nonlinear ways. We propose a simple method to construct a deep feature space, with explicitly disentangled representations of several known transformations. A person or algorithm can then manipulate the disentangled representation, for example, to re-render an image with explicit control over parameterized degrees of freedom. The feature space is constructed using a transforming encoder-decoder network with a custom feature transform layer, acting on the hidden representations. We demonstrate the advantages of explicit disentangling on a variety of datasets and transformations, and as an aid for traditional tasks, such as classification.Comment: Accepted at ICCV 201

arXiv.org e-Print Archive

Crossref

Distinguishing Posed and Spontaneous Smiles by Facial Dynamics

Author: B Mandal
B Mandal
B Mandal
B Mandal
C Cortes
EG Krumhuber
G Farnebäck
H Dibeklioglu
H Dibeklioğlu
HY Wu
J Cohn
J Hadwin
K Schmidt
K Schmidt
M Hoque
P Ekman
P Ekman
P Ekman
P Ekman
PF Felzenszwalb
Q Xu
V Ojansivu
X Yu
Z Ambadar
Z Zeng
Publication venue
Publication date: 17/02/2017
Field of study

Smile is one of the key elements in identifying emotions and present state of mind of an individual. In this work, we propose a cluster of approaches to classify posed and spontaneous smiles using deep convolutional neural network (CNN) face features, local phase quantization (LPQ), dense optical flow and histogram of gradient (HOG). Eulerian Video Magnification (EVM) is used for micro-expression smile amplification along with three normalization procedures for distinguishing posed and spontaneous smiles. Although the deep CNN face model is trained with large number of face images, HOG features outperforms this model for overall face smile classification task. Using EVM to amplify micro-expressions did not have a significant impact on classification accuracy, while the normalizing facial features improved classification accuracy. Unlike many manual or semi-automatic methodologies, our approach aims to automatically classify all smiles into either `spontaneous' or `posed' categories, by using support vector machines (SVM). Experimental results on large UvA-NEMO smile database show promising results as compared to other relevant methods.Comment: 16 pages, 8 figures, ACCV 2016, Second Workshop on Spontaneous Facial Behavior Analysi

arXiv.org e-Print Archive

Crossref

Machine Analysis of Facial Expressions

Author: Bartlett M.S.
Pantic M.
Publication venue: I-Tech Education and Publishing
Publication date: 01/01/2007
Field of study

No abstract

IntechOpen

CiteSeerX

Crossref

University of Twente Research Information

UR-FUNNY: A Multimodal Language Dataset for Understanding Humor

Author: Hasan Md Kamrul
Hoque
Mohammed
Morency Louis-Philippe
Rahman Wasifur
Tanveer Md Iftekhar
Zadeh Amir
Zhong Jianyuan
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2019
Field of study

Humor is a unique and creative communicative behavior displayed during social interactions. It is produced in a multimodal manner, through the usage of words (text), gestures (vision) and prosodic cues (acoustic). Understanding humor from these three modalities falls within boundaries of multimodal language; a recent research trend in natural language processing that models natural language as it happens in face-to-face communication. Although humor detection is an established research area in NLP, in a multimodal context it is an understudied area. This paper presents a diverse multimodal dataset, called UR-FUNNY, to open the door to understanding multimodal language used in expressing humor. The dataset and accompanying studies, present a framework in multimodal humor detection for the natural language processing community. UR-FUNNY is publicly available for research

arXiv.org e-Print Archive

Crossref