Search CORE

4,696 research outputs found

Neural Network Fusion of Color, Depth and Location for Object Instance Recognition on a Mobile Robot

Author: A Anand
ART Gepperth
H Ali
MA Fischler
OJ Woodford
PA Viola
PF Felzenszwalb
RJ Campbell
T Schaul
Z Zhang
Publication venue: HAL CCSD
Publication date: 12/09/2014
Field of study

International audienceThe development of mobile robots for domestic assistance re-quires solving problems integrating ideas from different fields of research like computer vision, robotic manipulation, localization and mapping. Semantic mapping, that is, the enrichment a map with high-level infor-mation like room and object identities, is an example of such a complex robotic task. Solving this task requires taking into account hard software and hardware constraints brought by the context of autonomous mobile robots, where short processing times and low energy consumption are mandatory. We present a light-weight scene segmentation and object in-stance recognition algorithm using an RGB-D camera and demonstrate it in a semantic mapping experiment. Our method uses a feed-forward neural network to fuse texture, color and depth information. Running at 3 Hz on a single laptop computer, our algorithm achieves a recognition rate of 97% in a controlled environment, and 87% in the adversarial con-ditions of a real robotic task. Our results demonstrate that state of the art recognition rates on a database does not guarantee performance in a real world experiment. We also show the benefit in these conditions of fusing several recognition decisions and data from different sources. The database we compiled for the purpose of this study is publicly available

Crossref

INRIA a CCSD electronic archive server

Co-Fusion: Real-time Segmentation, Tracking and Fusion of Multiple Objects

Author: Agapito Lourdes
Rünz Martin
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 20/06/2017
Field of study

In this paper we introduce Co-Fusion, a dense SLAM system that takes a live stream of RGB-D images as input and segments the scene into different objects (using either motion or semantic cues) while simultaneously tracking and reconstructing their 3D shape in real time. We use a multiple model fitting approach where each object can move independently from the background and still be effectively tracked and its shape fused over time using only the information from pixels associated with that object label. Previous attempts to deal with dynamic scenes have typically considered moving regions as outliers, and consequently do not model their shape or track their motion over time. In contrast, we enable the robot to maintain 3D models for each of the segmented objects and to improve them over time through fusion. As a result, our system can enable a robot to maintain a scene description at the object level which has the potential to allow interactions with its working environment; even in the case of dynamic scenes.Comment: International Conference on Robotics and Automation (ICRA) 2017, http://visual.cs.ucl.ac.uk/pubs/cofusion, https://github.com/martinruenz/co-fusio

arXiv.org e-Print Archive

Crossref

UCL Discovery

Monocular Based Navigation System for Autonomous Ground Robots Using Multiple Deep Learning Models

Author: Durdevic Petar
Machkour Zakariae
Ortiz Arroyo Daniel
Publication venue
Publication date: 10/05/2023
Field of study

VBN