91,096 research outputs found
Object recognition using shape-from-shading
This paper investigates whether surface topography information extracted from intensity images using a recently reported shape-from-shading (SFS) algorithm can be used for the purposes of 3D object recognition. We consider how curvature and shape-index information delivered by this algorithm can be used to recognize objects based on their surface topography. We explore two contrasting object recognition strategies. The first of these is based on a low-level attribute summary and uses histograms of curvature and orientation measurements. The second approach is based on the structural arrangement of constant shape-index maximal patches and their associated region attributes. We show that region curvedness and a string ordering of the regions according to size provides recognition accuracy of about 96 percent. By polling various recognition schemes. including a graph matching method. we show that a recognition rate of 98-99 percent is achievable
CNN-based Real-time Dense Face Reconstruction with Inverse-rendered Photo-realistic Face Images
With the powerfulness of convolution neural networks (CNN), CNN based face
reconstruction has recently shown promising performance in reconstructing
detailed face shape from 2D face images. The success of CNN-based methods
relies on a large number of labeled data. The state-of-the-art synthesizes such
data using a coarse morphable face model, which however has difficulty to
generate detailed photo-realistic images of faces (with wrinkles). This paper
presents a novel face data generation method. Specifically, we render a large
number of photo-realistic face images with different attributes based on
inverse rendering. Furthermore, we construct a fine-detailed face image dataset
by transferring different scales of details from one image to another. We also
construct a large number of video-type adjacent frame pairs by simulating the
distribution of real video data. With these nicely constructed datasets, we
propose a coarse-to-fine learning framework consisting of three convolutional
networks. The networks are trained for real-time detailed 3D face
reconstruction from monocular video as well as from a single image. Extensive
experimental results demonstrate that our framework can produce high-quality
reconstruction but with much less computation time compared to the
state-of-the-art. Moreover, our method is robust to pose, expression and
lighting due to the diversity of data.Comment: Accepted by IEEE Transactions on Pattern Analysis and Machine
Intelligence, 201
Redefining A in RGBA: Towards a Standard for Graphical 3D Printing
Advances in multimaterial 3D printing have the potential to reproduce various
visual appearance attributes of an object in addition to its shape. Since many
existing 3D file formats encode color and translucency by RGBA textures mapped
to 3D shapes, RGBA information is particularly important for practical
applications. In contrast to color (encoded by RGB), which is specified by the
object's reflectance, selected viewing conditions and a standard observer,
translucency (encoded by A) is neither linked to any measurable physical nor
perceptual quantity. Thus, reproducing translucency encoded by A is open for
interpretation.
In this paper, we propose a rigorous definition for A suitable for use in
graphical 3D printing, which is independent of the 3D printing hardware and
software, and which links both optical material properties and perceptual
uniformity for human observers. By deriving our definition from the absorption
and scattering coefficients of virtual homogeneous reference materials with an
isotropic phase function, we achieve two important properties. First, a simple
adjustment of A is possible, which preserves the translucency appearance if an
object is re-scaled for printing. Second, determining the value of A for a real
(potentially non-homogeneous) material, can be achieved by minimizing a
distance function between light transport measurements of this material and
simulated measurements of the reference materials. Such measurements can be
conducted by commercial spectrophotometers used in graphic arts.
Finally, we conduct visual experiments employing the method of constant
stimuli, and derive from them an embedding of A into a nearly perceptually
uniform scale of translucency for the reference materials.Comment: 20 pages (incl. appendices), 20 figures. Version with higher quality
images: https://cloud-ext.igd.fraunhofer.de/s/pAMH67XjstaNcrF (main article)
and https://cloud-ext.igd.fraunhofer.de/s/4rR5bH3FMfNsS5q (appendix).
Supplemental material including code:
https://cloud-ext.igd.fraunhofer.de/s/9BrZaj5Uh5d0cOU/downloa
A Fusion of Variational Distribution Priors and Saliency Map Replay for Continual 3D Reconstruction
Single-image 3D reconstruction is a research challenge focused on predicting
3D object shapes from single-view images. This task requires significant data
acquisition to predict both visible and occluded portions of the shape.
Furthermore, learning-based methods face the difficulty of creating a
comprehensive training dataset for all possible classes. To this end, we
propose a continual learning-based 3D reconstruction method where our goal is
to design a model using Variational Priors that can still reconstruct the
previously seen classes reasonably even after training on new classes.
Variational Priors represent abstract shapes and combat forgetting, whereas
saliency maps preserve object attributes with less memory usage. This is vital
due to resource constraints in storing extensive training data. Additionally,
we introduce saliency map-based experience replay to capture global and
distinct object features. Thorough experiments show competitive results
compared to established methods, both quantitatively and qualitatively.Comment: 15 page
Texture analysis using volume-radius fractal dimension
Texture plays an important role in computer vision. It is one of the most
important visual attributes used in image analysis, once it provides
information about pixel organization at different regions of the image. This
paper presents a novel approach for texture characterization, based on
complexity analysis. The proposed approach expands the idea of the Mass-radius
fractal dimension, a method originally developed for shape analysis, to a set
of coordinates in 3D-space that represents the texture under analysis in a
signature able to characterize efficiently different texture classes in terms
of complexity. An experiment using images from the Brodatz album illustrates
the method performance.Comment: 4 pages, 4 figure
- …