356 research outputs found
High-Accuracy Facial Depth Models derived from 3D Synthetic Data
In this paper, we explore how synthetically generated 3D face models can be
used to construct a high accuracy ground truth for depth. This allows us to
train the Convolutional Neural Networks (CNN) to solve facial depth estimation
problems. These models provide sophisticated controls over image variations
including pose, illumination, facial expressions and camera position. 2D
training samples can be rendered from these models, typically in RGB format,
together with depth information. Using synthetic facial animations, a dynamic
facial expression or facial action data can be rendered for a sequence of image
frames together with ground truth depth and additional metadata such as head
pose, light direction, etc. The synthetic data is used to train a CNN based
facial depth estimation system which is validated on both synthetic and real
images. Potential fields of application include 3D reconstruction, driver
monitoring systems, robotic vision systems, and advanced scene understanding
AI-Generated Images as Data Source: The Dawn of Synthetic Era
The advancement of visual intelligence is intrinsically tethered to the
availability of large-scale data. In parallel, generative Artificial
Intelligence (AI) has unlocked the potential to create synthetic images that
closely resemble real-world photographs. This prompts a compelling inquiry: how
much visual intelligence could benefit from the advance of generative AI? This
paper explores the innovative concept of harnessing these AI-generated images
as new data sources, reshaping traditional modeling paradigms in visual
intelligence. In contrast to real data, AI-generated data exhibit remarkable
advantages, including unmatched abundance and scalability, the rapid generation
of vast datasets, and the effortless simulation of edge cases. Built on the
success of generative AI models, we examine the potential of their generated
data in a range of applications, from training machine learning models to
simulating scenarios for computational modeling, testing, and validation. We
probe the technological foundations that support this groundbreaking use of
generative AI, engaging in an in-depth discussion on the ethical, legal, and
practical considerations that accompany this transformative paradigm shift.
Through an exhaustive survey of current technologies and applications, this
paper presents a comprehensive view of the synthetic era in visual
intelligence. A project associated with this paper can be found at
https://github.com/mwxely/AIGS .Comment: 20 pages, 11 figure
- …