4,455 research outputs found
Learning to Dress {3D} People in Generative Clothing
Three-dimensional human body models are widely used in the analysis of human
pose and motion. Existing models, however, are learned from minimally-clothed
3D scans and thus do not generalize to the complexity of dressed people in
common images and videos. Additionally, current models lack the expressive
power needed to represent the complex non-linear geometry of pose-dependent
clothing shapes. To address this, we learn a generative 3D mesh model of
clothed people from 3D scans with varying pose and clothing. Specifically, we
train a conditional Mesh-VAE-GAN to learn the clothing deformation from the
SMPL body model, making clothing an additional term in SMPL. Our model is
conditioned on both pose and clothing type, giving the ability to draw samples
of clothing to dress different body shapes in a variety of styles and poses. To
preserve wrinkle detail, our Mesh-VAE-GAN extends patchwise discriminators to
3D meshes. Our model, named CAPE, represents global shape and fine local
structure, effectively extending the SMPL body model to clothing. To our
knowledge, this is the first generative model that directly dresses 3D human
body meshes and generalizes to different poses. The model, code and data are
available for research purposes at https://cape.is.tue.mpg.de.Comment: CVPR-2020 camera ready. Code and data are available at
https://cape.is.tue.mpg.d
Multiresolution Approaches for Edge Detection and Classification Based on Discrete Wavelet Transform
Fast Landmark Localization with 3D Component Reconstruction and CNN for Cross-Pose Recognition
Two approaches are proposed for cross-pose face recognition, one is based on
the 3D reconstruction of facial components and the other is based on the deep
Convolutional Neural Network (CNN). Unlike most 3D approaches that consider
holistic faces, the proposed approach considers 3D facial components. It
segments a 2D gallery face into components, reconstructs the 3D surface for
each component, and recognizes a probe face by component features. The
segmentation is based on the landmarks located by a hierarchical algorithm that
combines the Faster R-CNN for face detection and the Reduced Tree Structured
Model for landmark localization. The core part of the CNN-based approach is a
revised VGG network. We study the performances with different settings on the
training set, including the synthesized data from 3D reconstruction, the
real-life data from an in-the-wild database, and both types of data combined.
We investigate the performances of the network when it is employed as a
classifier or designed as a feature extractor. The two recognition approaches
and the fast landmark localization are evaluated in extensive experiments, and
compared to stateof-the-art methods to demonstrate their efficacy.Comment: 14 pages, 12 figures, 4 table
Segmentation of Photovoltaic Module Cells in Electroluminescence Images
High resolution electroluminescence (EL) images captured in the infrared
spectrum allow to visually and non-destructively inspect the quality of
photovoltaic (PV) modules. Currently, however, such a visual inspection
requires trained experts to discern different kinds of defects, which is
time-consuming and expensive. Automated segmentation of cells is therefore a
key step in automating the visual inspection workflow. In this work, we propose
a robust automated segmentation method for extraction of individual solar cells
from EL images of PV modules. This enables controlled studies on large amounts
of data to understanding the effects of module degradation over time-a process
not yet fully understood. The proposed method infers in several steps a
high-level solar module representation from low-level edge features. An
important step in the algorithm is to formulate the segmentation problem in
terms of lens calibration by exploiting the plumbline constraint. We evaluate
our method on a dataset of various solar modules types containing a total of
408 solar cells with various defects. Our method robustly solves this task with
a median weighted Jaccard index of 94.47% and an score of 97.54%, both
indicating a very high similarity between automatically segmented and ground
truth solar cell masks
- …