Search CORE

5,906 research outputs found

HyperReenact: One-Shot Reenactment via Jointly Learning to Refine and Retarget Faces

Author: Christos Tzelepis
Georgios Tzimiropoulos
Ioannis Patras
Stella Bounareli
Vasileios Argyriou
Publication venue
Publication date: 20/07/2023
Field of study

In this paper, we present our method for neural face reenactment, called HyperReenact, that aims to generate realistic talking head images of a source identity, driven by a target facial pose. Existing state-of-the-art face reenactment methods train controllable generative models that learn to synthesize realistic facial images, yet producing reenacted faces that are prone to significant visual artifacts, especially under the challenging condition of extreme head pose changes, or requiring expensive few-shot fine-tuning to better preserve the source identity characteristics. We propose to address these limitations by leveraging the photorealistic generation ability and the disentangled properties of a pretrained StyleGAN2 generator, by first inverting the real images into its latent space and then using a hypernetwork to perform:(i) refinement of the source identity characteristics and (ii) facial pose re-targeting, eliminating this way the dependence on external editing methods that typically produce artifacts. Our method operates under the one-shot setting (ie, using a single source frame) and allows for cross-subject reenactment, without requiring any subject-specific fine-tuning. We compare our method both quantitatively and qualitatively against several state-of-the-art techniques on the standard benchmarks of VoxCeleb1 and VoxCeleb2, demonstrating the superiority of our approach in producing artifact-free images, exhibiting remarkable robustness even under extreme head pose changes

ZENODO

HyperReenact: one-shot reenactment via jointly learning to refine and retarget faces

Author: Argyriou V
Bounareli S
International Conference on Computer Vision
Patras I
TZELEPIS C
Tzimiropoulos G
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 02/10/2023
Field of study

In this paper, we present our method for neural face reenactment, called HyperReenact, that aims to generate realistic talking head images of a source identity, driven by a target facial pose. Existing state-of-the-art face reenactment methods train controllable generative models that learn to synthesize realistic facial images, yet producing reenacted faces that are prone to significant visual artifacts, especially under the challenging condition of extreme head pose changes, or requiring expensive few-shot fine-tuning to better preserve the source identity characteristics. We propose to address these limitations by leveraging the photorealistic generation ability and the disentangled properties of a pretrained StyleGAN2 generator, by first inverting the real images into its latent space and then using a hypernetwork to perform: (i) refinement of the source identity characteristics and (ii) facial pose re-targeting, eliminating this way the dependence on external editing methods that typically produce artifacts. Our method operates under the one-shot setting (i.e., using a single source frame) and allows for cross-subject reenactment, without requiring any subject-specific fine-tuning. We compare our method both quantitatively and qualitatively against several state-of-the-art techniques on the standard benchmarks of VoxCeleb1 and VoxCeleb2, demonstrating the superiority of our approach in producing artifact-free images, exhibiting remarkable robustness even under extreme head pose changes. We make the code and the pretrained models publicly available at: https://github.com/ StelaBou/HyperReenact

Queen Mary Research Online

ToonTalker: Cross-Domain Face Reenactment

Author: Cun Xiaodong
Fan Yanbo
Gong Yuan
Wang Xuan
Wu Baoyuan
Yang Yujiu
Yin Fei
Zhang Yong
Publication venue
Publication date: 24/08/2023
Field of study

We target cross-domain face reenactment in this paper, i.e., driving a cartoon image with the video of a real person and vice versa. Recently, many works have focused on one-shot talking face generation to drive a portrait with a real video, i.e., within-domain reenactment. Straightforwardly applying those methods to cross-domain animation will cause inaccurate expression transfer, blur effects, and even apparent artifacts due to the domain shift between cartoon and real faces. Only a few works attempt to settle cross-domain face reenactment. The most related work AnimeCeleb requires constructing a dataset with pose vector and cartoon image pairs by animating 3D characters, which makes it inapplicable anymore if no paired data is available. In this paper, we propose a novel method for cross-domain reenactment without paired data. Specifically, we propose a transformer-based framework to align the motions from different domains into a common latent space where motion transfer is conducted via latent code addition. Two domain-specific motion encoders and two learnable motion base memories are used to capture domain properties. A source query transformer and a driving one are exploited to project domain-specific motion to the canonical space. The edited motion is projected back to the domain of the source with a transformer. Moreover, since no paired data is provided, we propose a novel cross-domain training scheme using data from two domains with the designed analogy constraint. Besides, we contribute a cartoon dataset in Disney style. Extensive evaluations demonstrate the superiority of our method over competing methods

arXiv.org e-Print Archive

HyperReenact: One-Shot Reenactment via Jointly Learning to Refine and Retarget Faces

Author: Argyriou Vasileios
Bounareli Stella
Patras Ioannis
Tzelepis Christos
Tzimiropoulos Georgios
Publication venue
Publication date: 20/07/2023
Field of study

In this paper, we present our method for neural face reenactment, called HyperReenact, that aims to generate realistic talking head images of a source identity, driven by a target facial pose. Existing state-of-the-art face reenactment methods train controllable generative models that learn to synthesize realistic facial images, yet producing reenacted faces that are prone to significant visual artifacts, especially under the challenging condition of extreme head pose changes, or requiring expensive few-shot fine-tuning to better preserve the source identity characteristics. We propose to address these limitations by leveraging the photorealistic generation ability and the disentangled properties of a pretrained StyleGAN2 generator, by first inverting the real images into its latent space and then using a hypernetwork to perform: (i) refinement of the source identity characteristics and (ii) facial pose re-targeting, eliminating this way the dependence on external editing methods that typically produce artifacts. Our method operates under the one-shot setting (i.e., using a single source frame) and allows for cross-subject reenactment, without requiring any subject-specific fine-tuning. We compare our method both quantitatively and qualitatively against several state-of-the-art techniques on the standard benchmarks of VoxCeleb1 and VoxCeleb2, demonstrating the superiority of our approach in producing artifact-free images, exhibiting remarkable robustness even under extreme head pose changes. We make the code and the pretrained models publicly available at: https://github.com/StelaBou/HyperReenact .Comment: Accepted for publication in ICCV 2023. Project page: https://stelabou.github.io/hyperreenact.github.io/ Code: https://github.com/StelaBou/HyperReenac

arXiv.org e-Print Archive

EgoFace: Egocentric Face Performance Capture and Videorealistic Reenactment

Author: Elgharib M.
Kim H.
Liu W.
Mallikarjun B R
Seidel H.
Tewari A.
Theobalt C.
Publication venue
Publication date: 01/01/2019
Field of study

Face performance capture and reenactment techniques use multiple cameras and sensors, positioned at a distance from the face or mounted on heavy wearable devices. This limits their applications in mobile and outdoor environments. We present EgoFace, a radically new lightweight setup for face performance capture and front-view videorealistic reenactment using a single egocentric RGB camera. Our lightweight setup allows operations in uncontrolled environments, and lends itself to telepresence applications such as video-conferencing from dynamic environments. The input image is projected into a low dimensional latent space of the facial expression parameters. Through careful adversarial training of the parameter-space synthetic rendering, a videorealistic animation is produced. Our problem is challenging as the human visual system is sensitive to the smallest face irregularities that could occur in the final results. This sensitivity is even stronger for video results. Our solution is trained in a pre-processing stage, through a supervised manner without manual annotations. EgoFace captures a wide variety of facial expressions, including mouth movements and asymmetrical expressions. It works under varying illuminations, background, movements, handles people from different ethnicities and can operate in real time

MPG.PuRe

Stuff White People Like #1863

Author: Slowinski Joseph Stephen
Publication venue: The Cupola: Scholarship at Gettysburg College
Publication date: 14/08/2013
Field of study

There I sat: sun burning my neck, sweat pouring down my face, watching grown men play at death. I’d been meaning for years to get to Gettysburg to see the reenactment, and this past July, I was lucky enough to be there for the 150th anniversary of the battle. And so there I was, sitting in a grandstand in the middle of a farm in rural Pennsylvania, surrounded by fellow white people, watching a Confederate soldier get shot in the back for pretending to desert in the face of the Union cavalry. He flopped to the ground in front of the grandstand; the announcer gave us paying customers a resounding play-by-play. “They love doing that,” my wife said in my ear, “Very dramatic.” [excerpt

Gettysburg College

Re-enacting Early Video Art as a Research Tool for Media Art Histories

Author: Leuzzi Laura
Publication venue: Donau-Universität Krems
Publication date: 01/01/2019
Field of study

This paper will discuss re-enactment as a relevant tool for practice-based research to investigate pioneering video performances and video artworks from the 1970s and 1980s from a theoretical, art-historical and curatorial point of view. Since the early 2000s, the re-enactment of artists’ performance has been growing as an art practice internationally and has been investigated in several studies and exhibitions. In this paper, I will pro- pose that the re-enactment of early video artworks can open up critical analysis on the original work—its nature, form and content—as well as on collective and personal memory and mediation. Re-enactment becomes a research tool that investigates the nature of video which was at the time a relatively new medium. Re-enactment informs the research into the original piece, its documentation, the relationships between the artist and the body, the work and the viewer. It investigates the effects of analogue video on the viewer and the artist in comparison with the digital video employed in the re-enactment and its documentation. The paper will analyse case studies from the research projects REWIND, REWINDItalia and EWVA (European Women’s Video Art in the 70s and 80s)

Open Access Institutional Repository at Robert Gordon University

University of Dundee Online Publications

Archivio della ricerca- Università di Roma La Sapienza