Search CORE

2,395 research outputs found

Spherical Regression: Learning Viewpoints, Surface Normals and 3D Rotations on n-Spheres

Author: Gavves Efstratios
Liao Shuai
Snoek Cees G. M.
Publication venue
Publication date: 01/01/2019
Field of study

Many computer vision challenges require continuous outputs, but tend to be solved by discrete classification. The reason is classification's natural containment within a probability

n

-simplex, as defined by the popular softmax activation function. Regular regression lacks such a closed geometry, leading to unstable training and convergence to suboptimal local minima. Starting from this insight we revisit regression in convolutional neural networks. We observe many continuous output problems in computer vision are naturally contained in closed geometrical manifolds, like the Euler angles in viewpoint estimation or the normals in surface normal estimation. A natural framework for posing such continuous output problems are

n

-spheres, which are naturally closed geometric manifolds defined in the

\mathbb{R}^{(n+1)}

space. By introducing a spherical exponential mapping on

n

-spheres at the regression output, we obtain well-behaved gradients, leading to stable training. We show how our spherical regression can be utilized for several computer vision challenges, specifically viewpoint estimation, surface normal estimation and 3D rotation estimation. For all these problems our experiments demonstrate the benefit of spherical regression. All paper resources are available at https://github.com/leoshine/Spherical_Regression.Comment: CVPR 2019 camera read

arXiv.org e-Print Archive

UvA-DARE

Convolutional Color Constancy

Author: Barron Jonathan T.
Publication venue
Publication date: 18/09/2015
Field of study

Color constancy is the problem of inferring the color of the light that illuminated a scene, usually so that the illumination color can be removed. Because this problem is underconstrained, it is often solved by modeling the statistical regularities of the colors of natural objects and illumination. In contrast, in this paper we reformulate the problem of color constancy as a 2D spatial localization task in a log-chrominance space, thereby allowing us to apply techniques from object detection and structured prediction to the color constancy problem. By directly learning how to discriminate between correctly white-balanced images and poorly white-balanced images, our model is able to improve performance on standard benchmarks by nearly 40%

arXiv.org e-Print Archive

CiteSeerX