Search CORE

7,919 research outputs found

Learning to Find Good Correspondences

Author: Fua Pascal
Lepetit Vincent
Ono Yuki
Salzmann Mathieu
Trulls Eduard
Yi Kwang Moo
Publication venue
Publication date: 01/01/2018
Field of study

We develop a deep architecture to learn to find good correspondences for wide-baseline stereo. Given a set of putative sparse matches and the camera intrinsics, we train our network in an end-to-end fashion to label the correspondences as inliers or outliers, while simultaneously using them to recover the relative pose, as encoded by the essential matrix. Our architecture is based on a multi-layer perceptron operating on pixel coordinates rather than directly on the image, and is thus simple and small. We introduce a novel normalization technique, called Context Normalization, which allows us to process each data point separately while imbuing it with global information, and also makes the network invariant to the order of the correspondences. Our experiments on multiple challenging datasets demonstrate that our method is able to drastically improve the state of the art with little training data.Comment: CVPR 2018 (Oral

arXiv.org e-Print Archive

Infoscience - École polytechnique fédérale de Lausanne

Crossref

HAL Descartes

Hal-Diderot

Unsupervised Deep Epipolar Flow for Stationary or Dynamic Scenes

Author: Dai Yuchao
Ji Pan
Li Hongdong
Wang Jianyuan
Zhong Yiran
Publication venue
Publication date: 14/07/2022
Field of study

Unsupervised deep learning for optical flow computation has achieved promising results. Most existing deep-net based methods rely on image brightness consistency and local smoothness constraint to train the networks. Their performance degrades at regions where repetitive textures or occlusions occur. In this paper, we propose Deep Epipolar Flow, an unsupervised optical flow method which incorporates global geometric constraints into network learning. In particular, we investigate multiple ways of enforcing the epipolar constraint in flow estimation. To alleviate a "chicken-and-egg" type of problem encountered in dynamic scenes where multiple motions may be present, we propose a low-rank constraint as well as a union-of-subspaces constraint for training. Experimental results on various benchmarking datasets show that our method achieves competitive performance compared with supervised methods and outperforms state-of-the-art unsupervised deep-learning methods.Comment: CVPR 201

arXiv.org e-Print Archive

Compact Model Representation for 3D Reconstruction

Author: Eriksson Anders
Fookes Clinton
Kong Chen
Lucey Simon
Pontes Jhony K.
Sridharan Sridha
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2017
Field of study

3D reconstruction from 2D images is a central problem in computer vision. Recent works have been focusing on reconstruction directly from a single image. It is well known however that only one image cannot provide enough information for such a reconstruction. A prior knowledge that has been entertained are 3D CAD models due to its online ubiquity. A fundamental question is how to compactly represent millions of CAD models while allowing generalization to new unseen objects with fine-scaled geometry. We introduce an approach to compactly represent a 3D mesh. Our method first selects a 3D model from a graph structure by using a novel free-form deformation FFD 3D-2D registration, and then the selected 3D model is refined to best fit the image silhouette. We perform a comprehensive quantitative and qualitative analysis that demonstrates impressive dense and realistic 3D reconstruction from single images.Comment: 9 pages, 6 figure

arXiv.org e-Print Archive

Crossref

Queensland University of Technology ePrints Archive

University of Queensland eSpace

Multi-Image Semantic Matching by Mining Consistent Features

Author: Daniilidis Kostas
Wang Qianqian
Zhou Xiaowei
Publication venue
Publication date: 01/05/2018
Field of study

This work proposes a multi-image matching method to estimate semantic correspondences across multiple images. In contrast to the previous methods that optimize all pairwise correspondences, the proposed method identifies and matches only a sparse set of reliable features in the image collection. In this way, the proposed method is able to prune nonrepeatable features and also highly scalable to handle thousands of images. We additionally propose a low-rank constraint to ensure the geometric consistency of feature correspondences over the whole image collection. Besides the competitive performance on multi-graph matching and semantic flow benchmarks, we also demonstrate the applicability of the proposed method for reconstructing object-class models and discovering object-class landmarks from images without using any annotation.Comment: CVPR 201

arXiv.org e-Print Archive

Crossref