Search CORE

2,071 research outputs found

ICface: Interpretable and Controllable Face Reenactment Using GANs

Author: Kannala Juho
Rahtu Esa
Tripathy Soumya
Publication venue
Publication date: 17/01/2020
Field of study

This paper presents a generic face animator that is able to control the pose and expressions of a given face image. The animation is driven by human interpretable control signals consisting of head pose angles and the Action Unit (AU) values. The control information can be obtained from multiple sources including external driving videos and manual controls. Due to the interpretable nature of the driving signal, one can easily mix the information between multiple sources (e.g. pose from one image and expression from another) and apply selective post-production editing. The proposed face animator is implemented as a two-stage neural network model that is learned in a self-supervised manner using a large video collection. The proposed Interpretable and Controllable face reenactment network (ICface) is compared to the state-of-the-art neural network-based face animation techniques in multiple tasks. The results indicate that ICface produces better visual quality while being more versatile than most of the comparison methods. The introduced model could provide a lightweight and easy to use tool for a multitude of advanced image and video editing tasks.Comment: Accepted in WACV-202

arXiv.org e-Print Archive

Crossref

Trepo - Institutional Repository of Tampere University

Inner Space Preserving Generative Pose Machine

Author: A Dosovitskiy
A Newell
B Hariharan
C Farabet
D Anguelov
D Yoo
F Ning
G Larsson
GE Hinton
J Walker
LC Chen
M Bergtholdt
M Eitz
M Loper
MM Loper
O Ronneberger
PY Laffont
R Zhang
S Iizuka
V Badrinarayanan
V Jampani
X Yan
Y Shih
Y Yang
Publication venue
Publication date: 06/08/2018
Field of study

Image-based generative methods, such as generative adversarial networks (GANs) have already been able to generate realistic images with much context control, specially when they are conditioned. However, most successful frameworks share a common procedure which performs an image-to-image translation with pose of figures in the image untouched. When the objective is reposing a figure in an image while preserving the rest of the image, the state-of-the-art mainly assumes a single rigid body with simple background and limited pose shift, which can hardly be extended to the images under normal settings. In this paper, we introduce an image "inner space" preserving model that assigns an interpretable low-dimensional pose descriptor (LDPD) to an articulated figure in the image. Figure reposing is then generated by passing the LDPD and the original image through multi-stage augmented hourglass networks in a conditional GAN structure, called inner space preserving generative pose machine (ISP-GPM). We evaluated ISP-GPM on reposing human figures, which are highly articulated with versatile variations. Test of a state-of-the-art pose estimator on our reposed dataset gave an accuracy over 80% on PCK0.5 metric. The results also elucidated that our ISP-GPM is able to preserve the background with high accuracy while reasonably recovering the area blocked by the figure to be reposed.Comment: http://www.northeastern.edu/ostadabbas/2018/07/23/inner-space-preserving-generative-pose-machine

arXiv.org e-Print Archive

Crossref