16 research outputs found
Machine Learning for Multi-Robot Semantic Simultaneous Localization and Mapping
RÉSUMÉ
L’automatisation et la robotique prennent une place de plus en plus importante dans notre vie quotidienne, avec de nombreuses utilisations possibles. Les robots pourraient nous épargner des tâches dangereuses et pénibles, ou rendre des choses impossibles jusqu’à maintenant possibles. Pour que les robots s’intègrent en toute sécurité dans notre monde et dans de nouveaux environnements inconnus, il est clef qu’ils soient équipés d’une capacité de per-ception, et en particulier qu’ils puissent se localiser par rapport à leur entourage. Afin d’être réellement indépendants, les robots doivent pouvoir le faire en se basant uniquement sur leurs propres capteurs, les plus couramment utilisés étant les caméras. Une solution pour obtenir de telles estimations est d’utiliser un algorithme de cartographie et localisa-tion simultanée (SLAM), dans lequel le robot va simultanément construire une carte de son environnement et estimer son propre état. Le SLAM avec un seul robot a fait l’objet de nombreux travaux scientifiques, et est désormais considéré comme un domaine de recherche mature. Cependant, l’utilisation d’une équipe de robots peut o˙rir plusieurs avantages en termes de robustesse, d’eÿcacité et de performances pour de nombreuses tâches. Dans ce cas, des algorithmes de SLAM multi-robots sont nécessaires pour permettre à chaque robot de bénéficier de l’expérience de toute l’équipe. Le SLAM multi-robot peut s’appuyer sur des solutions SLAM classiques, mais nécessite des adaptations et fait face à des contraintes de calculs et de communications supplémentaires. Un défi particulier dans le SLAM multi-robots est la nécessité pour les robots de trouver des fermetures de boucles inter-robots: des liens entre les trajectoires de di˙érents robots qui peuvent être trouvés lorsqu’ils visitent le même endroit. Deux catégories d’approches sont possibles pour détecter les fermetures de boucles inter-robots. Dans les méthodes indirectes, les robots communiquent pour vérifier s’ils ont cartographié un espace commun, puis tentent de trouver des fermetures de boucles à partir des données recueillies par chacun des robots dans cet espace. Dans les méthodes directes, les robots s’appuient directement sur les données de leurs capteurs pour estimer les fermetures de boucles. Chaque approche a des avantages et des inconvénients, mais les méthodes indi-rectes ont été plus étudiées récemment. Ce mémoire s’appuie sur les avancées récentes de la vision par ordinateur pour présenter des contributions à chaque catégorie d’approches pour la détection de fermetures de boucles inter-robots. Une première contribution est présentée pour la détection de fermetures de boucles indirecte dans une équipe de robots entièrement en communication. Elle utilise des constellations, une représentation sémantique compacte de l’environnement basée sur les objets qui le compose.----------ABSTRACT
Automation and robotics are becoming more and more common in our daily lives, with many possible applications. Deploying robots in the world can extend what humans are capable of doing, and can save us from dangerous and strenuous tasks. For robots to be safely sent out in our real world, and in new unknown environments, one key capability they need is to perceive their environment, and particularly to localize themselves with respect to their surroundings. To truly be able to be deployed anywhere, robots should be able to do so relying only on their sensors, the most commonly used being cameras. One way to generate such an estimate is by using a simultaneous localization and mapping (SLAM) algorithm, in which the robot will concurrently build a map of its environment and estimate its state within it. Single-robot SLAM has been extensively researched and is now considered a mature field. However, using a team of robots can provide several benefits in terms of robustness, eÿciency, and performance for many tasks. In this case, multi-robot SLAM algorithms are required to allow each robot to benefit from the whole team’s experience. Multi-robot SLAM can build on top of single-robot SLAM solutions, but requires adaptations and faces computation and communication constraints. One particular challenge that arises in multi-robot SLAM is the need for robots to find inter-robot loop closures: relationships between trajectories of di˙erent robots that can be found when they visit the same place. Two categories of approaches are possible to detect inter-robot loop closures. In indirect methods, robots communicate to find if they have mapped the same area, and then attempt to find loop closures using data gathered by each robot in the place that was jointly visited. In direct methods, robots directly rely on data they gather from their sensors to estimate the loop closures. Each approach has its own benefits and challenges, with indirect methods being more popular in recent works. This thesis builds on recent computer vision advancements to present contributions to each category of approaches for inter-robot loop closure detection. A first approach is presented for indirect loop closure detection in a team of fully connected robots. It relies on constellations, a compact semantic representation of the environment based on objects that are in it. Descriptors and comparison methods for constellations are designed to robustly recognize places based on their constellation with minimal data exchange. These are used in a decentralized place recognition mechanism that is scalable as the size of the team increases. The proposed method performs comparably to state-of-the-art solutions in terms of performance and data exchanges require, while being more meaningful and interpretable
CAPRICORN: Communication Aware Place Recognition using Interpretable Constellations of Objects in Robot Networks
Using multiple robots for exploring and mapping environments can provide
improved robustness and performance, but it can be difficult to implement. In
particular, limited communication bandwidth is a considerable constraint when a
robot needs to determine if it has visited a location that was previously
explored by another robot, as it requires for robots to share descriptions of
places they have visited. One way to compress this description is to use
constellations, groups of 3D points that correspond to the estimate of a set of
relative object positions. Constellations maintain the same pattern from
different viewpoints and can be robust to illumination changes or dynamic
elements. We present a method to extract from these constellations compact
spatial and semantic descriptors of the objects in a scene. We use this
representation in a 2-step decentralized loop closure verification: first, we
distribute the compact semantic descriptors to determine which other robots
might have seen scenes with similar objects; then we query matching robots with
the full constellation to validate the match using geometric information. The
proposed method requires less memory, is more interpretable than global image
descriptors, and could be useful for other tasks and interactions with the
environment. We validate our system's performance on a TUM RGB-D SLAM sequence
and show its benefits in terms of bandwidth requirements.Comment: 8 pages, 6 figures, 1 table. 2020 IEEE International Conference on
Robotics and Automation (ICRA
Towards Collaborative Simultaneous Localization and Mapping: a Survey of the Current Research Landscape
Motivated by the tremendous progress we witnessed in recent years, this paper
presents a survey of the scientific literature on the topic of Collaborative
Simultaneous Localization and Mapping (C-SLAM), also known as multi-robot SLAM.
With fleets of self-driving cars on the horizon and the rise of multi-robot
systems in industrial applications, we believe that Collaborative SLAM will
soon become a cornerstone of future robotic applications. In this survey, we
introduce the basic concepts of C-SLAM and present a thorough literature
review. We also outline the major challenges and limitations of C-SLAM in terms
of robustness, communication, and resource management. We conclude by exploring
the area's current trends and promising research avenues.Comment: 44 pages, 3 figure
MSL-RAPTOR: A 6DoF Relative Pose Tracker for Onboard Robotic Perception
Determining the relative position and orientation of objects in an
environment is a fundamental building block for a wide range of robotics
applications. To accomplish this task efficiently in practical settings, a
method must be fast, use common sensors, and generalize easily to new objects
and environments. We present MSL-RAPTOR, a two-stage algorithm for tracking a
rigid body with a monocular camera. The image is first processed by an
efficient neural network-based front-end to detect new objects and track 2D
bounding boxes between frames. The class label and bounding box is passed to
the back-end that updates the object's pose using an unscented Kalman filter
(UKF). The measurement posterior is fed back to the 2D tracker to improve
robustness. The object's class is identified so a class-specific UKF can be
used if custom dynamics and constraints are known. Adapting to track the pose
of new classes only requires providing a trained 2D object detector or labeled
2D bounding box data, as well as the approximate size of the objects. The
performance of MSL-RAPTOR is first verified on the NOCS-REAL275 dataset,
achieving results comparable to RGB-D approaches despite not using depth
measurements. When tracking a flying drone from onboard another drone, it
outperforms the fastest comparable method in speed by a factor of 3, while
giving lower translation and rotation median errors by 66% and 23%
respectively.Comment: 12 pages, 6 figures, to be published in 2020 International Symposium
on Experimental Robotics (ISER
VDNA-PR: Using General Dataset Representations for Robust Sequential Visual Place Recognition
This paper adapts a general dataset representation technique to produce
robust Visual Place Recognition (VPR) descriptors, crucial to enable real-world
mobile robot localisation. Two parallel lines of work on VPR have shown, on one
side, that general-purpose off-the-shelf feature representations can provide
robustness to domain shifts, and, on the other, that fused information from
sequences of images improves performance. In our recent work on measuring
domain gaps between image datasets, we proposed a Visual Distribution of Neuron
Activations (VDNA) representation to represent datasets of images. This
representation can naturally handle image sequences and provides a general and
granular feature representation derived from a general-purpose model. Moreover,
our representation is based on tracking neuron activation values over the list
of images to represent and is not limited to a particular neural network layer,
therefore having access to high- and low-level concepts. This work shows how
VDNAs can be used for VPR by learning a very lightweight and simple encoder to
generate task-specific descriptors. Our experiments show that our
representation can allow for better robustness than current solutions to
serious domain shifts away from the training data distribution, such as to
indoor environments and aerial imagery.Comment: Published at ICRA 202
VDNA-PR: using general dataset representations for robust sequential visual place recognition
This paper adapts a general dataset representation technique to produce robust Visual Place Recognition (VPR) descriptors, crucial to enable real-world mobile robot localisation. Two parallel lines of work on VPR have shown, on one side, that general-purpose off-the-shelf feature representations can provide robustness to domain shifts, and, on the other, that fused information from sequences of images improves performance. In our recent work on measuring domain gaps between image datasets, we proposed a Visual Distribution of Neuron Activations (VDNA) representation to represent datasets of images. This representation can naturally handle image sequences and provides a general and granular feature representation derived from a general-purpose model. Moreover, our representation is based on tracking neuron activation values over the list of images to represent and is not limited to a particular neural network layer, therefore having access to high- and low-level concepts. This work shows how VDNAs can be used for VPR by learning a very lightweight and simple encoder to generate task-specific descriptors. Our experiments show that our representation can allow for better robustness than current solutions to serious domain shifts away from the training data distribution, such as to indoor environments and aerial imagery
DOOR-SLAM: Distributed, Online, and Outlier Resilient SLAM for Robotic Teams
To achieve collaborative tasks, robots in a team need to have a shared
understanding of the environment and their location within it. Distributed
Simultaneous Localization and Mapping (SLAM) offers a practical solution to
localize the robots without relying on an external positioning system (e.g.
GPS) and with minimal information exchange. Unfortunately, current distributed
SLAM systems are vulnerable to perception outliers and therefore tend to use
very conservative parameters for inter-robot place recognition. However, being
too conservative comes at the cost of rejecting many valid loop closure
candidates, which results in less accurate trajectory estimates. This paper
introduces DOOR-SLAM, a fully distributed SLAM system with an outlier rejection
mechanism that can work with less conservative parameters. DOOR-SLAM is based
on peer-to-peer communication and does not require full connectivity among the
robots. DOOR-SLAM includes two key modules: a pose graph optimizer combined
with a distributed pairwise consistent measurement set maximization algorithm
to reject spurious inter-robot loop closures; and a distributed SLAM front-end
that detects inter-robot loop closures without exchanging raw sensor data. The
system has been evaluated in simulations, benchmarking datasets, and field
experiments, including tests in GPS-denied subterranean environments. DOOR-SLAM
produces more inter-robot loop closures, successfully rejects outliers, and
results in accurate trajectory estimates, while requiring low communication
bandwidth. Full source code is available at
https://github.com/MISTLab/DOOR-SLAM.git.Comment: 8 pages, 11 figures, 2 table
NeBula: Team CoSTAR's robotic autonomy solution that won phase II of DARPA Subterranean Challenge
This paper presents and discusses algorithms, hardware, and software architecture developed by the TEAM CoSTAR (Collaborative SubTerranean Autonomous Robots), competing in the DARPA Subterranean Challenge. Specifically, it presents the techniques utilized within the Tunnel (2019) and Urban (2020) competitions, where CoSTAR achieved second and first place, respectively. We also discuss CoSTARÂżs demonstrations in Martian-analog surface and subsurface (lava tubes) exploration. The paper introduces our autonomy solution, referred to as NeBula (Networked Belief-aware Perceptual Autonomy). NeBula is an uncertainty-aware framework that aims at enabling resilient and modular autonomy solutions by performing reasoning and decision making in the belief space (space of probability distributions over the robot and world states). We discuss various components of the NeBula framework, including (i) geometric and semantic environment mapping, (ii) a multi-modal positioning system, (iii) traversability analysis and local planning, (iv) global motion planning and exploration behavior, (v) risk-aware mission planning, (vi) networking and decentralized reasoning, and (vii) learning-enabled adaptation. We discuss the performance of NeBula on several robot types (e.g., wheeled, legged, flying), in various environments. We discuss the specific results and lessons learned from fielding this solution in the challenging courses of the DARPA Subterranean Challenge competition.The work is partially supported by the Jet Propulsion Laboratory, California Institute of Technology,
under a contract with the National Aeronautics and Space Administration (80NM0018D0004), and
Defense Advanced Research Projects Agency (DARPA)