Search CORE

5,947 research outputs found

Aggregated Deep Local Features for Remote Sensing Image Retrieval

Author: Bondarev Egor
de With Peter H. N.
Imbriaco Raffaele
Sebastian Clint
Publication venue: 'MDPI AG'
Publication date: 01/02/2019
Field of study

Remote Sensing Image Retrieval remains a challenging topic due to the special nature of Remote Sensing Imagery. Such images contain various different semantic objects, which clearly complicates the retrieval task. In this paper, we present an image retrieval pipeline that uses attentive, local convolutional features and aggregates them using the Vector of Locally Aggregated Descriptors (VLAD) to produce a global descriptor. We study various system parameters such as the multiplicative and additive attention mechanisms and descriptor dimensionality. We propose a query expansion method that requires no external inputs. Experiments demonstrate that even without training, the local convolutional features and global representation outperform other systems. After system tuning, we can achieve state-of-the-art or competitive results. Furthermore, we observe that our query expansion method increases overall system performance by about 3%, using only the top-three retrieved images. Finally, we show how dimensionality reduction produces compact descriptors with increased retrieval performance and fast retrieval computation times, e.g. 50% faster than the current systems.Comment: Published in Remote Sensing. The first two authors have equal contributio

arXiv.org e-Print Archive

Pure OAI Repository

Directory of Open Access Journals

ProSLAM: Graph SLAM from a Programmer's Perspective

Author: Colosi Mirco
Grisetti Giorgio
Schlegel Dominik
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 13/09/2017
Field of study

In this paper we present ProSLAM, a lightweight stereo visual SLAM system designed with simplicity in mind. Our work stems from the experience gathered by the authors while teaching SLAM to students and aims at providing a highly modular system that can be easily implemented and understood. Rather than focusing on the well known mathematical aspects of Stereo Visual SLAM, in this work we highlight the data structures and the algorithmic aspects that one needs to tackle during the design of such a system. We implemented ProSLAM using the C++ programming language in combination with a minimal set of well known used external libraries. In addition to an open source implementation, we provide several code snippets that address the core aspects of our approach directly in this paper. The results of a thorough validation performed on standard benchmark datasets show that our approach achieves accuracy comparable to state of the art methods, while requiring substantially less computational resources.Comment: 8 pages, 8 figure

arXiv.org e-Print Archive

Crossref

Archivio della ricerca- Università di Roma La Sapienza

Automated annotation of landmark images using community contributed datasets and web resources

Author: Byrne Daragh
Hughes Mark
Jones Gareth J.F.
O'Connor Noel E.
Salway Andrew
Publication venue
Publication date: 01/12/2010
Field of study

A novel solution to the challenge of automatic image annotation is described. Given an image with GPS data of its location of capture, our system returns a semantically-rich annotation comprising tags which both identify the landmark in the image, and provide an interesting fact about it, e.g. "A view of the Eiffel Tower, which was built in 1889 for an international exhibition in Paris". This exploits visual and textual web mining in combination with content-based image analysis and natural language processing. In the first stage, an input image is matched to a set of community contributed images (with keyword tags) on the basis of its GPS information and image classification techniques. The depicted landmark is inferred from the keyword tags for the matched set. The system then takes advantage of the information written about landmarks available on the web at large to extract a fact about the landmark in the image. We report component evaluation results from an implementation of our solution on a mobile device. Image localisation and matching oers 93.6% classication accuracy; the selection of appropriate tags for use in annotation performs well (F1M of 0.59), and it subsequently automatically identies a correct toponym for use in captioning and fact extraction in 69.0% of the tested cases; finally the fact extraction returns an interesting caption in 78% of cases

DCU Online Research Access Service

A comparative evaluation of interest point detectors and local descriptors for visual SLAM

Author: A.J. Davison
Arturo Gil
D. Lowe
F. Zernike
G. Grisetti
K. Mikolajczyk
K. Mikolajczyk
Monica Ballesta
Oscar Martinez Mozos
Oscar Reinoso
S. Theodoridis
Z. Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/10/2010
Field of study

Abstract In this paper we compare the behavior of different interest points detectors and descriptors under the conditions needed to be used as landmarks in vision-based simultaneous localization and mapping (SLAM). We evaluate the repeatability of the detectors, as well as the invariance and distinctiveness of the descriptors, under different perceptual conditions using sequences of images representing planar objects as well as 3D scenes. We believe that this information will be useful when selecting an appropriat

University of Lincoln Institutional Repository

Crossref

Biologically Motivated Vergence Control System Based on Stereo Saliency Map Model

Author: Minho Lee
Sang-Woo Bana
Publication venue: 'IntechOpen'
Publication date: 01/06/2007
Field of study

IntechOpen

Crossref