Search CORE

4,728 research outputs found

Semantic Visual Localization

Author: Geiger Andreas
Pollefeys Marc
Sattler Torsten
Schönberger Johannes L.
Publication venue
Publication date: 01/01/2018
Field of study

Robust visual localization under a wide range of viewing conditions is a fundamental problem in computer vision. Handling the difficult cases of this problem is not only very challenging but also of high practical relevance, e.g., in the context of life-long localization for augmented reality or autonomous robots. In this paper, we propose a novel approach based on a joint 3D geometric and semantic understanding of the world, enabling it to succeed under conditions where previous approaches failed. Our method leverages a novel generative model for descriptor learning, trained on semantic scene completion as an auxiliary task. The resulting 3D descriptors are robust to missing observations by encoding high-level 3D geometric and semantic information. Experiments on several challenging large-scale localization datasets demonstrate reliable localization under extreme viewpoint, illumination, and geometry changes

arXiv.org e-Print Archive

MPG.PuRe

Learning to Find Good Correspondences

Author: Fua Pascal
Lepetit Vincent
Ono Yuki
Salzmann Mathieu
Trulls Eduard
Yi Kwang Moo
Publication venue
Publication date: 25/03/2018
Field of study

We develop a deep architecture to learn to find good correspondences for wide-baseline stereo. Given a set of putative sparse matches and the camera intrinsics, we train our network in an end-to-end fashion to label the correspondences as inliers or outliers, while simultaneously using them to recover the relative pose, as encoded by the essential matrix. Our architecture is based on a multi-layer perceptron operating on pixel coordinates rather than directly on the image, and is thus simple and small. We introduce a novel normalization technique, called Context Normalization, which allows us to process each data point separately while imbuing it with global information, and also makes the network invariant to the order of the correspondences. Our experiments on multiple challenging datasets demonstrate that our method is able to drastically improve the state of the art with little training data.Comment: CVPR 2018 (Oral

arXiv.org e-Print Archive

Infoscience - École polytechnique fédérale de Lausanne

Crossref

Learning and Matching Multi-View Descriptors for Registration of Point Clouds

Author: A.E. Johnson
Angela Dai
DG Lowe
Emanuele Rodolà
F Endres
F Fraundorfer
F Tombari
François Pomerleau
GF Cooper
H Huang
J Johnson
J Yang
JL Schönberger
KM Yi
MG Dissanayake
MJ Black
N Snavely
PJ Besl
Q-Y Zhou
R Raguram
S Li
T Shen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 15/07/2018
Field of study

Critical to the registration of point clouds is the establishment of a set of accurate correspondences between points in 3D space. The correspondence problem is generally addressed by the design of discriminative 3D local descriptors on the one hand, and the development of robust matching strategies on the other hand. In this work, we first propose a multi-view local descriptor, which is learned from the images of multiple views, for the description of 3D keypoints. Then, we develop a robust matching approach, aiming at rejecting outlier matches based on the efficient inference via belief propagation on the defined graphical model. We have demonstrated the boost of our approaches to registration on the public scanning and multi-view stereo datasets. The superior performance has been verified by the intensive comparisons against a variety of descriptors and matching methods

arXiv.org e-Print Archive

Crossref