Search CORE

1,806 research outputs found

Every Smile is Unique: Landmark-Guided Diverse Smile Generation

Author: Alameda-Pineda Xavier
Fua Pascal
Ricci Elisa
Sebe Nicu
Wang Wei
Xu Dan
Publication venue
Publication date: 01/01/2018
Field of study

Each smile is unique: one person surely smiles in different ways (e.g., closing/opening the eyes or mouth). Given one input image of a neutral face, can we generate multiple smile videos with distinctive characteristics? To tackle this one-to-many video generation problem, we propose a novel deep learning architecture named Conditional Multi-Mode Network (CMM-Net). To better encode the dynamics of facial expressions, CMM-Net explicitly exploits facial landmarks for generating smile sequences. Specifically, a variational auto-encoder is used to learn a facial landmark embedding. This single embedding is then exploited by a conditional recurrent network which generates a landmark embedding sequence conditioned on a specific expression (e.g., spontaneous smile). Next, the generated landmark embeddings are fed into a multi-mode recurrent landmark generator, producing a set of landmark sequences still associated to the given smile class but clearly distinct from each other. Finally, these landmark sequences are translated into face videos. Our experimental results demonstrate the effectiveness of our CMM-Net in generating realistic videos of multiple smile expressions.Comment: Accepted as a poster in Conference on Computer Vision and Pattern Recognition (CVPR), 201

arXiv.org e-Print Archive

Infoscience - École polytechnique fédérale de Lausanne

Hal - Université Grenoble Alpes

Archivio della ricerca - Fondazione Bruno Kessler

INRIA a CCSD electronic archive server

Hal-Diderot

HAL-Rennes 1

Distinguishing Posed and Spontaneous Smiles by Facial Dynamics

Author: B Mandal
B Mandal
B Mandal
B Mandal
C Cortes
EG Krumhuber
G Farnebäck
H Dibeklioglu
H Dibeklioğlu
HY Wu
J Cohn
J Hadwin
K Schmidt
K Schmidt
M Hoque
P Ekman
P Ekman
P Ekman
P Ekman
PF Felzenszwalb
Q Xu
V Ojansivu
X Yu
Z Ambadar
Z Zeng
Publication venue
Publication date: 17/02/2017
Field of study

Smile is one of the key elements in identifying emotions and present state of mind of an individual. In this work, we propose a cluster of approaches to classify posed and spontaneous smiles using deep convolutional neural network (CNN) face features, local phase quantization (LPQ), dense optical flow and histogram of gradient (HOG). Eulerian Video Magnification (EVM) is used for micro-expression smile amplification along with three normalization procedures for distinguishing posed and spontaneous smiles. Although the deep CNN face model is trained with large number of face images, HOG features outperforms this model for overall face smile classification task. Using EVM to amplify micro-expressions did not have a significant impact on classification accuracy, while the normalizing facial features improved classification accuracy. Unlike many manual or semi-automatic methodologies, our approach aims to automatically classify all smiles into either `spontaneous' or `posed' categories, by using support vector machines (SVM). Experimental results on large UvA-NEMO smile database show promising results as compared to other relevant methods.Comment: 16 pages, 8 figures, ACCV 2016, Second Workshop on Spontaneous Facial Behavior Analysi

arXiv.org e-Print Archive

Crossref

Enriched Long-term Recurrent Convolutional Network for Facial Micro-Expression Recognition

Author: Khor Huai-Qian
Lin Weiyao
Phan Raphael C. W.
See John
Publication venue
Publication date: 01/01/2018
Field of study

Facial micro-expression (ME) recognition has posed a huge challenge to researchers for its subtlety in motion and limited databases. Recently, handcrafted techniques have achieved superior performance in micro-expression recognition but at the cost of domain specificity and cumbersome parametric tunings. In this paper, we propose an Enriched Long-term Recurrent Convolutional Network (ELRCN) that first encodes each micro-expression frame into a feature vector through CNN module(s), then predicts the micro-expression by passing the feature vector through a Long Short-term Memory (LSTM) module. The framework contains two different network variants: (1) Channel-wise stacking of input data for spatial enrichment, (2) Feature-wise stacking of features for temporal enrichment. We demonstrate that the proposed approach is able to achieve reasonably good performance, without data augmentation. In addition, we also present ablation studies conducted on the framework and visualizations of what CNN "sees" when predicting the micro-expression classes.Comment: Published in Micro-Expression Grand Challenge 2018, Workshop of 13th IEEE Facial & Gesture 201

arXiv.org e-Print Archive

Heriot Watt Pure

Crossref

SHDL@MMU Digital Repository