Search CORE

3,630 research outputs found

ICface: Interpretable and Controllable Face Reenactment Using GANs

Author: Kannala Juho
Rahtu Esa
Tripathy Soumya
Publication venue
Publication date: 17/01/2020
Field of study

This paper presents a generic face animator that is able to control the pose and expressions of a given face image. The animation is driven by human interpretable control signals consisting of head pose angles and the Action Unit (AU) values. The control information can be obtained from multiple sources including external driving videos and manual controls. Due to the interpretable nature of the driving signal, one can easily mix the information between multiple sources (e.g. pose from one image and expression from another) and apply selective post-production editing. The proposed face animator is implemented as a two-stage neural network model that is learned in a self-supervised manner using a large video collection. The proposed Interpretable and Controllable face reenactment network (ICface) is compared to the state-of-the-art neural network-based face animation techniques in multiple tasks. The results indicate that ICface produces better visual quality while being more versatile than most of the comparison methods. The introduced model could provide a lightweight and easy to use tool for a multitude of advanced image and video editing tasks.Comment: Accepted in WACV-202

arXiv.org e-Print Archive

Crossref

Trepo - Institutional Repository of Tampere University

Neural Face Editing with Intrinsic Image Disentangling

Author: Hadap Sunil
Samaras Dimitris
Shechtman Eli
Shu Zhixin
Sunkavalli Kalyan
Yumer Ersin
Publication venue
Publication date: 13/04/2017
Field of study

Traditional face editing methods often require a number of sophisticated and task specific algorithms to be applied one after the other --- a process that is tedious, fragile, and computationally intensive. In this paper, we propose an end-to-end generative adversarial network that infers a face-specific disentangled representation of intrinsic face properties, including shape (i.e. normals), albedo, and lighting, and an alpha matte. We show that this network can be trained on "in-the-wild" images by incorporating an in-network physically-based image formation module and appropriate loss functions. Our disentangling latent representation allows for semantically relevant edits, where one aspect of facial appearance can be manipulated while keeping orthogonal properties fixed, and we demonstrate its use for a number of facial editing applications.Comment: CVPR 2017 ora

arXiv.org e-Print Archive

Crossref

Bidirectional Conditional Generative Adversarial Networks

Author: AbdAlmageed Wael
Jaiswal Ayush
Natarajan Premkumar
Wu Yue
Publication venue
Publication date: 03/11/2018
Field of study

Conditional Generative Adversarial Networks (cGANs) are generative models that can produce data samples (

x

) conditioned on both latent variables (

z

) and known auxiliary information (

c

). We propose the Bidirectional cGAN (BiCoGAN), which effectively disentangles

z

and

c

in the generation process and provides an encoder that learns inverse mappings from

x

to both

z

and

c

, trained jointly with the generator and the discriminator. We present crucial techniques for training BiCoGANs, which involve an extrinsic factor loss along with an associated dynamically-tuned importance weight. As compared to other encoder-based cGANs, BiCoGANs encode

c

more accurately, and utilize

z

and

c

more effectively and in a more disentangled way to generate samples.Comment: To appear in Proceedings of ACCV 201

arXiv.org e-Print Archive

Crossref

Geometry-Aware Face Completion and Editing

Author: Cao Jie
He Ran
Hu Yibo
Song Linsen
Song Linxiao
Publication venue
Publication date: 13/02/2019
Field of study

Face completion is a challenging generation task because it requires generating visually pleasing new pixels that are semantically consistent with the unmasked face region. This paper proposes a geometry-aware Face Completion and Editing NETwork (FCENet) by systematically studying facial geometry from the unmasked region. Firstly, a facial geometry estimator is learned to estimate facial landmark heatmaps and parsing maps from the unmasked face image. Then, an encoder-decoder structure generator serves to complete a face image and disentangle its mask areas conditioned on both the masked face image and the estimated facial geometry images. Besides, since low-rank property exists in manually labeled masks, a low-rank regularization term is imposed on the disentangled masks, enforcing our completion network to manage occlusion area with various shape and size. Furthermore, our network can generate diverse results from the same masked input by modifying estimated facial geometry, which provides a flexible mean to edit the completed face appearance. Extensive experimental results qualitatively and quantitatively demonstrate that our network is able to generate visually pleasing face completion results and edit face attributes as well

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications