Search CORE

264 research outputs found

Automatic 3D facial modelling with deformable models.

Author: Xiang Guofu
Publication venue
Publication date: 31/12/2012
Field of study

Facial modelling and animation has been an active research subject in computer graphics since the 1970s. Due to extremely complex biomechanical structures of human faces and peoples visual familiarity with human faces, modelling and animating realistic human faces is still one of greatest challenges in computer graphics. Since we are so familiar with human faces and very sensitive to unnatural subtle changes in human faces, it usually requires a tremendous amount of artistry and manual work to create a convincing facial model and animation. There is a clear need of developing automatic techniques for facial modelling in order to reduce manual labouring. In order to obtain a realistic facial model of an individual, it is now common to make use of 3D scanners to capture range scans from the individual and then fit a template to the range scans. However, most existing template-fitting methods require manually selected landmarks to warp the template to the range scans. It would be tedious to select landmarks by hand over a large set of range scans. Another way to reduce repeated work is synthesis by reusing existing data. One example is expression cloning, which copies facial expression from one face to another instead of creating them from scratch. This aim of this study is to develop a fully automatic framework for template-based facial modelling, facial expression transferring and facial expression tracking from range scans. In this thesis, the author developed an extension of the iterative closest points (ICP) algorithm, which is able to match a template with range scans in different scales, and a deformable model, which can be used to recover the shapes of range scans and to establish correspondences between facial models. With the registration method and the deformable model, the author proposed a fully automatic approach to reconstructing facial models and textures from range scans without re-quiring any manual interventions. In order to reuse existing data for facial modelling, the author formulated and solved the problem of facial expression transferring in the framework of discrete differential geometry. The author also applied his methods to face tracking for 4D range scans. The results demonstrated the robustness of the registration method and the capabilities of the deformable model. A number of possible directions for future work were pointed out

Open Access Institutional Repository at Robert Gordon University

FDLS: A Deep Learning Approach to Production Quality, Controllable, and Retargetable Facial Performances

Author: Choi Byungkuk
Eom Haekwang
Ghifary Muhammad
Lewis J. P.
Ma Wan-Duo Kurt
Publication venue
Publication date: 26/09/2023
Field of study

Visual effects commonly requires both the creation of realistic synthetic humans as well as retargeting actors' performances to humanoid characters such as aliens and monsters. Achieving the expressive performances demanded in entertainment requires manipulating complex models with hundreds of parameters. Full creative control requires the freedom to make edits at any stage of the production, which prohibits the use of a fully automatic ``black box'' solution with uninterpretable parameters. On the other hand, producing realistic animation with these sophisticated models is difficult and laborious. This paper describes FDLS (Facial Deep Learning Solver), which is Weta Digital's solution to these challenges. FDLS adopts a coarse-to-fine and human-in-the-loop strategy, allowing a solved performance to be verified and edited at several stages in the solving process. To train FDLS, we first transform the raw motion-captured data into robust graph features. Secondly, based on the observation that the artists typically finalize the jaw pass animation before proceeding to finer detail, we solve for the jaw motion first and predict fine expressions with region-based networks conditioned on the jaw position. Finally, artists can optionally invoke a non-linear finetuning process on top of the FDLS solution to follow the motion-captured virtual markers as closely as possible. FDLS supports editing if needed to improve the results of the deep learning solution and it can handle small daily changes in the actor's face shape. FDLS permits reliable and production-quality performance solving with minimal training and little or no manual effort in many cases, while also allowing the solve to be guided and edited in unusual and difficult cases. The system has been under development for several years and has been used in major movies.Comment: DigiPro '22: The Digital Production Symposiu

arXiv.org e-Print Archive

Speech-driven Animation with Meaningful Behaviors

Author: Busso Carlos
Sadoughi Najmeh
Publication venue
Publication date: 04/08/2017
Field of study

Conversational agents (CAs) play an important role in human computer interaction. Creating believable movements for CAs is challenging, since the movements have to be meaningful and natural, reflecting the coupling between gestures and speech. Studies in the past have mainly relied on rule-based or data-driven approaches. Rule-based methods focus on creating meaningful behaviors conveying the underlying message, but the gestures cannot be easily synchronized with speech. Data-driven approaches, especially speech-driven models, can capture the relationship between speech and gestures. However, they create behaviors disregarding the meaning of the message. This study proposes to bridge the gap between these two approaches overcoming their limitations. The approach builds a dynamic Bayesian network (DBN), where a discrete variable is added to constrain the behaviors on the underlying constraint. The study implements and evaluates the approach with two constraints: discourse functions and prototypical behaviors. By constraining on the discourse functions (e.g., questions), the model learns the characteristic behaviors associated with a given discourse class learning the rules from the data. By constraining on prototypical behaviors (e.g., head nods), the approach can be embedded in a rule-based system as a behavior realizer creating trajectories that are timely synchronized with speech. The study proposes a DBN structure and a training approach that (1) models the cause-effect relationship between the constraint and the gestures, (2) initializes the state configuration models increasing the range of the generated behaviors, and (3) captures the differences in the behaviors across constraints by enforcing sparse transitions between shared and exclusive states per constraint. Objective and subjective evaluations demonstrate the benefits of the proposed approach over an unconstrained model.Comment: 13 pages, 12 figures, 5 table

arXiv.org e-Print Archive

From Capture to Display: A Survey on Volumetric Video

Author: Hu Kaiyuan
Jin Yili
Liu Junhua
Liu Xue
Wang Fangxin
Publication venue
Publication date: 11/09/2023
Field of study

Volumetric video, which offers immersive viewing experiences, is gaining increasing prominence. With its six degrees of freedom, it provides viewers with greater immersion and interactivity compared to traditional videos. Despite their potential, volumetric video services poses significant challenges. This survey conducts a comprehensive review of the existing literature on volumetric video. We firstly provide a general framework of volumetric video services, followed by a discussion on prerequisites for volumetric video, encompassing representations, open datasets, and quality assessment metrics. Then we delve into the current methodologies for each stage of the volumetric video service pipeline, detailing capturing, compression, transmission, rendering, and display techniques. Lastly, we explore various applications enabled by this pioneering technology and we present an array of research challenges and opportunities in the domain of volumetric video services. This survey aspires to provide a holistic understanding of this burgeoning field and shed light on potential future research trajectories, aiming to bring the vision of volumetric video to fruition.Comment: Submitte

arXiv.org e-Print Archive

Reconstruction of machine-made shapes from bitmap sketches

Author: Bessmeltsev Mikhail
Kry Paul G.
Martens Cedric
Puhachov Ivan
Publication venue: Association for computing machinery
Publication date: 01/12/2023
Field of study

We propose a method of reconstructing 3D machine-made shapes from bitmap sketches by separating an input image into individual patches and jointly optimizing their geometry. We rely on two main observations: (1) human observers interpret sketches of man-made shapes as a collection of simple geometric primitives, and (2) sketch strokes often indicate occlusion contours or sharp ridges between those primitives. Using these main observations we design a system that takes a single bitmap image of a shape, estimates image depth and segmentation into primitives with neural networks, then fits primitives to the predicted depth while determining occlusion contours and aligning intersections with the input drawing via optimization. Unlike previous work, our approach does not require additional input, annotation, or templates, and does not require retraining for a new category of man-made shapes. Our method produces triangular meshes that display sharp geometric features and are suitable for downstream applications, such as editing, rendering, and shading

Dépôt Institutionnel Numérique

Estimating Outdoor Illumination Conditions Based on Detection of Dynamic Shadows

Author: Lal Brajesh Behari
Madsen Claus B.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/03/2013
Field of study

VBN

Seventh Biennial Report : June 2003 - March 2005

Author
Publication venue: Max-Planck-Institut für Informatik
Publication date: 01/01/2005
Field of study

MPG.PuRe

Fifth Biennial Report : June 1999 - August 2001

Author
Publication venue: Max-Planck-Institut für Informatik
Publication date: 01/01/2001
Field of study

MPG.PuRe

Morphing arquitectónico: transformaciones entre las casas usonianas de Frank Lloyd Wright

Author: Herrera Velazco Rodrigo
Publication venue: Universitat Politècnica de Catalunya
Publication date: 01/01/2012
Field of study

Esta tesis investiga sobre el proceso de transformación de la forma arquitectónica, analizando una técnica específica denominada morphing. La técnica del morphing se utiliza en los gráficos por ordenador para la transformación de la forma entre dos o más objetos dados. Desde un punto de vista técnico, se revisan y actualizan las metodologías y aplicaciones existentes, sus características específicas y sus incidencias sobre la arquitectura. Desde un punto de vista práctico, se utilizan una serie de modelos de las casas Usonianas de Frank Lloyd Wright, con el fin de experimentar la técnica y ver qué utilidades se pueden obtener a partir de su lógica de diseño. Como resultado de este análisis se obtiene una metodología genérica para el procedimiento de un morphing arquitectónico.This thesis investigates the transformation of architectural form, analyzing a specific technique called morphing. Morphing is a technique used in computer graphics to transform a form between two or more given objects. From a technical point of view, the existing techniques are reviewed and updated, as well as their specific characteristics and impact on architecture. From a practical point of view, some models of Usonian houses of Frank Lloyd Wright are used to experience the technique and see which utilities are available from his design logic. As a result of this analysis a generic methodology for the process of architectural morphing is obtained.Postprint (published version

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC

Tesis Doctorals en Xarxa

Secretaría de Estado de Cultura