Search CORE

9,845 research outputs found

Deep Shape Matching

Author: A Chalechale
A Gordo
A Khosla
AS Razavian
EJ Crowley
F Radenović
H Tabia
LVD Maaten
M Eitz
P Sangkloy
P Xu
R Hu
S Bai
S Parui
S Wang
S Zhang
Y Kalantidis
Z Xu
Publication venue
Publication date: 25/07/2018
Field of study

We cast shape matching as metric learning with convolutional networks. We break the end-to-end process of image representation into two parts. Firstly, well established efficient methods are chosen to turn the images into edge maps. Secondly, the network is trained with edge maps of landmark images, which are automatically obtained by a structure-from-motion pipeline. The learned representation is evaluated on a range of different tasks, providing improvements on challenging cases of domain generalization, generic sketch-based image retrieval or its fine-grained counterpart. In contrast to other methods that learn a different model per task, object category, or domain, we use the same network throughout all our experiments, achieving state-of-the-art results in multiple benchmarks.Comment: ECCV 201

arXiv.org e-Print Archive

Crossref

Enhanced spatial pyramid matching using log-polar-based image subdivision and representation

Author: Mayo Michael
Zhang Edmond Yiwen
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2010
Field of study

This paper presents a new model for capturing spatial information for object categorization with bag-of-words (BOW). BOW models have recently become popular for the task of object recognition, owing to their good performance and simplicity. Much work has been proposed over the years to improve the BOW model, where the Spatial Pyramid Matching (SPM) technique is the most notable. We propose a new method to exploit spatial relationships between image features, based on binned log-polar grids. Our model works by partitioning the image into grids of different scales and orientations and computing histogram of local features within each grid. Experimental results show that our approach improves the results on three diverse datasets over the SPM technique

Crossref

Research Commons@Waikato

On the Design and Analysis of Multiple View Descriptors

Author: Balzer Jonathan
Davis Damek
Dong Jingming
Hernandez Joshua
Soatto Stefano
Publication venue
Publication date: 23/11/2013
Field of study

We propose an extension of popular descriptors based on gradient orientation histograms (HOG, computed in a single image) to multiple views. It hinges on interpreting HOG as a conditional density in the space of sampled images, where the effects of nuisance factors such as viewpoint and illumination are marginalized. However, such marginalization is performed with respect to a very coarse approximation of the underlying distribution. Our extension leverages on the fact that multiple views of the same scene allow separating intrinsic from nuisance variability, and thus afford better marginalization of the latter. The result is a descriptor that has the same complexity of single-view HOG, and can be compared in the same manner, but exploits multiple views to better trade off insensitivity to nuisance variability with specificity to intrinsic variability. We also introduce a novel multi-view wide-baseline matching dataset, consisting of a mixture of real and synthetic objects with ground truthed camera motion and dense three-dimensional geometry

arXiv.org e-Print Archive

CiteSeerX