Search CORE

811 research outputs found

Image retrieval by information fusion based on scalable vocabulary tree and robust Hausdorff distance

Author: A Bosch
A Krizhevsky
A Pronobis
A Qamra
B Girod
D Nister
DG Lowe
DG Lowe
DM Chen
DP Huttenlocher
DW Scot
E Tola
E Tola
G Baatz
G Baatz
J Zhang
JC van Gemert
K Mikolajczyk
K Sentz
K-H Yap
L Bo
L Greengard
L Van Gool
MJ Swain
N Khan
O Jesorsky
P Dreuw
R Ji
S Haykin
S Lazebnik
TJ Chin
V-A Nguyen
W Rucklidge
Y LeCun
Y Li
Y Liu
Y Rubner
Z Li
Z Li
Z Li
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Visual Place Recognition under Severe Viewpoint and Appearance Changes

Author: Khaliq Ahmad
Publication venue
Publication date: 01/01/2020
Field of study

Over the last decade, the eagerness of the robotic and computer vision research communities unfolded extensive advancements in long-term robotic vision. Visual localization is the constituent of this active research domain; an ability of an object to correctly localize itself while mapping the environment simultaneously, technically termed as Simultaneous Localization and Mapping (SLAM). Visual Place Recognition (VPR), a core component of SLAM is a well-known paradigm. In layman terms, at a certain place/location within an environment, a robot needs to decide whether it’s the same place experienced before? Visual Place Recognition utilizing Convolutional Neural Networks (CNNs) has made a major contribution in the last few years. However, the image retrieval-based VPR becomes more challenging when the same places experience strong viewpoint and seasonal transitions. This thesis concentrates on improving the retrieval performance of VPR system, generally targeting the place correspondence. Despite the remarkable performances of state-of-the-art deep CNNs for VPR, the significant computation- and memory-overhead limit their practical deployment for resource constrained mobile robots. This thesis investigates the utility of shallow CNNs for power-efficient VPR applications. The proposed VPR frameworks focus on novel image regions that can contribute in recognizing places under dubious environment and viewpoint variations. Employing challenging place recognition benchmark datasets, this thesis further illustrates and evaluates the robustness of shallow CNN-based regional features against viewpoint and appearance changes coupled with dynamic instances, such as pedestrians, vehicles etc. Finally, the presented computation-efficient and light-weight VPR methodologies have shown boostup in matching performance in terms of Area under Precision-Recall curves (AUC-PR curves) over state-of-the-art deep neural network based place recognition and SLAM algorithms

University of Essex Research Repository

GEO-REFERENCED VIDEO RETRIEVAL: TEXT ANNOTATION AND SIMILARITY SEARCH

Author: YIN YIFANG
Publication venue
Publication date: 20/01/2016
Field of study

Ph.DDOCTOR OF PHILOSOPH

ScholarBank@NUS

Recommended from our members

Deep Structured Multi-Task Learning for Computer Vision in Autonomous Driving

Author: Teichmann Marvin
Publication venue: University of Cambridge
Publication date: 28/07/2020
Field of study

The field of computer vision is currently dominated by deep learning advances. Convolutional Neural Networks (CNNs) have become the predominant tool for solving almost any computer vision task, so state-of-the-art systems have been built by using the predictive capabilities of Convolutional Neural Networks (CNNs). Many of those systems use simple encoder–decoder based design, where an off-the-shelf CNN architecture is combined with a task-specific decoder and loss function in order to create an end-to-end trainable model. This ultimately raises the question of whether these kinds of models are the future of computer vision. In this thesis we argue that this is not the case. We start off by discussing three limitations of simple end-to-end training. We proceed by showing how it is possible to overcome those limitations by using an approach that we call structured modelling. The idea is to use CNNs to compute a rich semantic intermediate representation which is then used to solve the actual problem by applying a geometric and task-related structure. In this work we solve the localization, segmentation and landmark recognition task using structured modelling, and we show that this approach can improve generalization, interpretability and robustness. We also discuss how this approach is particularly useful for real-time applications such as autonomous driving. Visual perception is a multi-module problem that requires several different computer vision tasks to be solved. We discuss how, by sharing computations, we can improve not only the inference speed but also the prediction performance by using the structural relationship between the tasks. Lastly, we demonstrate that structured modelling is able to achieve state-of-the-art performance, making it a very relevant approach for solving current and future computer vision problems.Trinity College, ESPCR, Qualcom

Apollo (Cambridge)