856 research outputs found
Localization from semantic observations via the matrix permanent
Most approaches to robot localization rely on low-level geometric features such as points, lines, and planes. In this paper, we use object recognition to obtain semantic information from the robot’s sensors and consider the task of localizing the robot within a prior map of landmarks, which are annotated with semantic labels. As object recognition algorithms miss detections and produce false alarms, correct data association between the detections and the landmarks on the map is central to the semantic localization problem. Instead of the traditional vector-based representation, we propose a sensor model, which encodes the semantic observations via random finite sets and enables a unified treatment of missed detections, false alarms, and data association. Our second contribution is to reduce the problem of computing the likelihood of a set-valued observation to the problem of computing a matrix permanent. It is this crucial transformation that allows us to solve the semantic localization problem with a polynomial-time approximation to the set-based Bayes filter. Finally, we address the active semantic localization problem, in which the observer’s trajectory is planned in order to improve the accuracy and efficiency of the localization process. The performance of our approach is demonstrated in simulation and in real environments using deformable-part-model-based object detectors. Robust global localization from semantic observations is demonstrated for a mobile robot, for the Project Tango phone, and on the KITTI visual odometry dataset. Comparisons are made with the traditional lidar-based geometric Monte Carlo localization
Keyframe-based monocular SLAM: design, survey, and future directions
Extensive research in the field of monocular SLAM for the past fifteen years
has yielded workable systems that found their way into various applications in
robotics and augmented reality. Although filter-based monocular SLAM systems
were common at some time, the more efficient keyframe-based solutions are
becoming the de facto methodology for building a monocular SLAM system. The
objective of this paper is threefold: first, the paper serves as a guideline
for people seeking to design their own monocular SLAM according to specific
environmental constraints. Second, it presents a survey that covers the various
keyframe-based monocular SLAM systems in the literature, detailing the
components of their implementation, and critically assessing the specific
strategies made in each proposed solution. Third, the paper provides insight
into the direction of future research in this field, to address the major
limitations still facing monocular SLAM; namely, in the issues of illumination
changes, initialization, highly dynamic motion, poorly textured scenes,
repetitive textures, map maintenance, and failure recovery
Modeling Perceptual Aliasing in SLAM via Discrete-Continuous Graphical Models
Perceptual aliasing is one of the main causes of failure for Simultaneous
Localization and Mapping (SLAM) systems operating in the wild. Perceptual
aliasing is the phenomenon where different places generate a similar visual
(or, in general, perceptual) footprint. This causes spurious measurements to be
fed to the SLAM estimator, which typically results in incorrect localization
and mapping results. The problem is exacerbated by the fact that those outliers
are highly correlated, in the sense that perceptual aliasing creates a large
number of mutually-consistent outliers. Another issue stems from the fact that
most state-of-the-art techniques rely on a given trajectory guess (e.g., from
odometry) to discern between inliers and outliers and this makes the resulting
pipeline brittle, since the accumulation of error may result in incorrect
choices and recovery from failures is far from trivial. This work provides a
unified framework to model perceptual aliasing in SLAM and provides practical
algorithms that can cope with outliers without relying on any initial guess. We
present two main contributions. The first is a Discrete-Continuous Graphical
Model (DC-GM) for SLAM: the continuous portion of the DC-GM captures the
standard SLAM problem, while the discrete portion describes the selection of
the outliers and models their correlation. The second contribution is a
semidefinite relaxation to perform inference in the DC-GM that returns
estimates with provable sub-optimality guarantees. Experimental results on
standard benchmarking datasets show that the proposed technique compares
favorably with state-of-the-art methods while not relying on an initial guess
for optimization.Comment: 13 pages, 14 figures, 1 tabl
POCD: Probabilistic Object-Level Change Detection and Volumetric Mapping in Semi-Static Scenes
Maintaining an up-to-date map to reflect recent changes in the scene is very
important, particularly in situations involving repeated traversals by a robot
operating in an environment over an extended period. Undetected changes may
cause a deterioration in map quality, leading to poor localization, inefficient
operations, and lost robots. Volumetric methods, such as truncated signed
distance functions (TSDFs), have quickly gained traction due to their real-time
production of a dense and detailed map, though map updating in scenes that
change over time remains a challenge. We propose a framework that introduces a
novel probabilistic object state representation to track object pose changes in
semi-static scenes. The representation jointly models a stationarity score and
a TSDF change measure for each object. A Bayesian update rule that incorporates
both geometric and semantic information is derived to achieve consistent online
map maintenance. To extensively evaluate our approach alongside the
state-of-the-art, we release a novel real-world dataset in a warehouse
environment. We also evaluate on the public ToyCar dataset. Our method
outperforms state-of-the-art methods on the reconstruction quality of
semi-static environments.Comment: Published in Robotics: Science and Systems (RSS) 202
- …