Search CORE

35 research outputs found

Multi-level mapping: Real-time dense monocular SLAM

Author: Greene William N.
Lommel Peter H.
Ok Kyel
Roy Nicholas
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/05/2016
Field of study

We present a method for Simultaneous Localization and Mapping (SLAM) using a monocular camera that is capable of reconstructing dense 3D geometry online without the aid of a graphics processing unit (GPU). Our key contribution is a multi-resolution depth estimation and spatial smoothing process that exploits the correlation between low-texture image regions and simple planar structure to adaptively scale the complexity of the generated keyframe depthmaps to the texture of the input imagery. High-texture image regions are represented at higher resolutions to capture fine detail, while low-texture regions are represented at coarser resolutions for smooth surfaces. The computational savings enabled by this approach allow for significantly increased reconstruction density and quality when compared to the state-of-the-art. The increased depthmap density also improves tracking performance as more constraints can contribute to the pose estimation. A video of experimental results is available at http://groups.csail.mit.edu/rrg/multi_level_mapping.Charles Stark Draper Laboratory (Research Fellowship

DSpace@MIT

Crossref

Visual Simultaneous Localization and Mapping for a tree climbing robot

Author: Wisely Babu Benzun Pious
Publication venue: Digital WPI
Publication date: 19/09/2013
Field of study

This work addresses the problem of generating a 3D mesh grid model of a tree by a climbing robot for tree inspection. In order to generate a consistent model of the tree while climbing, the robot needs to be able to track its location while generating the model. Hence we explored this problem as a subset of Simultaneous Localization and Mapping problem. The monocular camera based Visual Simultaneous Localization and Mapping(VSLAM) algorithm was adopted to map the features on the tree. Multi-scale grid based FAST feature detector combined with Lucas Kande Optical flow was used to extract features from the tree. Inverse depth representation of feature was selected to seamlessly handle newly initialized features. The camera and the feature states along with their co-variances are managed in an Extended Kalman filter. In our VSLAM implementation we have attempted to track a large number of features. From the sparse spatial distribution of features we get using Extended Kalman filter we attempt to generate a 3D mesh grid model with the help of an unordered triangle fitting algorithm. We explored the implementation in C++ using Eigen, OpenCV and Point Cloud Library. A multi-threaded software design of the VSLAM algorithm was implemented. The algorithm was evaluated with image sets from trees susceptible to Asian Long Horn Beetle

DigitalCommons@WPI

Local Accuracy and Global Consistency for Efficient SLAM

Author: Strasdat Hauke
Publication venue: Computing, Imperial College London
Publication date: 01/11/2012
Field of study

This thesis is concerned with the problem of Simultaneous Localisation and Mapping (SLAM) using visual data only. Given the video stream of a moving camera, we wish to estimate the structure of the environment and the motion of the device most accurately and in real-time. Two effective approaches were presented in the past. Filtering methods marginalise out past poses and summarise the information gained over time with a probability distribution. Keyframe methods rely on the optimisation approach of bundle adjustment, but computationally must select only a small number of past frames to process. We perform a rigorous comparison between the two approaches for visual SLAM. Especially, we show that accuracy comes from a large number of points, while the number of intermediate frames only has a minor impact. We conclude that keyframe bundle adjustment is superior to ltering due to a smaller computational cost. Based on these experimental results, we develop an efficient framework for large-scale visual SLAM using the keyframe strategy. We demonstrate that SLAM using a single camera does not only drift in rotation and translation, but also in scale. In particular, we perform large-scale loop closure correction using a novel variant of pose-graph optimisation which also takes scale drift into account. Starting from this two stage approach which tackles local motion estimation and loop closures separately, we develop a unified framework for real-time visual SLAM. By employing a novel double window scheme, we present a constant-time approach which enables the local accuracy of bundle adjustment while ensuring global consistency. Furthermore, we suggest a new scheme for local registration using metric loop closures and present several improvements for the visual front-end of SLAM. Our contributions are evaluated exhaustively on a number of synthetic experiments and real-image data-set from single cameras and range imaging devices

Spiral - Imperial College Digital Repository

Local Accuracy and Global Consistency for Efficient SLAM

Author: Almlov J.
Andersson G.
Berger T.
Carlbring P.
Cuijpers P.
Publication venue: Computing, Imperial College London
Publication date: 01/01/2009
Field of study

VU Research Portal

Spiral - Imperial College Digital Repository

Bern Open Repository and Information System (BORIS)

Change Detection using Models derived from Point Clouds

Author: Lehmann Florian
Publication venue
Publication date: 14/02/2020
Field of study

This thesis examines the detection of geometric changes in 3D data based on models derived from the point clouds. The process chain consists of the registration of point clouds, the derivation of a model and the detection of changes in these models. For the registration AprilTags are used, which, in combination with the trajectories of the respective measuring runs, allow the estimation of a transformation between two local coordinate systems without a preceding assessment of the tag position. The point cloud is sampled down to a uniform distribution for faster processing and the normals and curvatures are calculated for the remaining points. A region growing process uses this additional information to divide the point clouds into planar and nonplanar areas. The former are modeled as planes through simplification and meshing using a modified quadtree structure. The non-planar points are clustered by supervoxels and the extracted clusters are approximated by Axis Aligned Bounding Boxes. Both planar and non-planar modeling is performed for all datasets which are to be compared. The bounding boxes are used for the detection of changes. The box model of a dataset can easily be examined for intersections with a second dataset, due to the axis alignment. The intersections allow the detection of changed areas in the derived models. The methods proposed are suitable for both indoor and outdoor applications, provided that the changed objects are well separated and the compared datasets cover overlapping areas. The accuracy depends on the chosen size of the bounding boxes as well as the size of the changed objects. The evaluation has also shown that the proposed method can be integrated into a realtime-capable system

Institute of Transport Research:Publications

Visual Motion Analysis based on a Robotic Moving System

Author: Andry Maykol Gomes Pinto
Publication venue
Publication date: 29/07/2014
Field of study

Repositório Aberto da Universidade do Porto

Multi-resolution mapping and planning for UAV navigation in attitude constrained environments

Author: Funk Nils
Publication venue: Computing, Imperial College London
Publication date: 01/10/2023
Field of study

In this thesis we aim to bridge the gap between high quality map reconstruction and Unmanned Aerial Vehicles (UAVs) SE(3) motion planning in challenging environments with narrow openings, such as disaster areas, which requires attitude to be considered. We propose an efficient system that leverages the concept of adaptive-resolution volumetric mapping, which naturally integrates with the hierarchical decomposition of space in an octree data structure. Instead of a Truncated Signed Distance Function (TSDF), we adopt mapping of occupancy probabilities in log-odds representation, which allows representation of both surfaces, as well as the entire free, i.e.\ observed space, as opposed to unobserved space. We introduce a method for choosing resolution -on the fly- in real-time by means of a multi-scale max-min pooling of the input depth image. The notion of explicit free space mapping paired with the spatial hierarchy in the data structure, as well as map resolution, allows for collision queries, as needed for robot motion planning, at unprecedented speed. Our mapping strategy supports pinhole cameras as well as spherical sensor models. Additionally, we introduce a first-of-a-kind global minimum cost path search method based on A* that considers attitude along the path. State-of-the-art methods incorporate attitude only in the refinement stage. To make the problem tractable, our method exploits an adaptive and coarse-to-fine approach using global and local A* runs, plus an efficient method to introduce the UAV attitude in the process. We integrate our method with an SE(3) trajectory optimisation method based on a safe-flight-corridor, yielding a complete path planning pipeline. We quantitatively evaluate our mapping strategy in terms of mapping accuracy, memory, runtime performance, and planning performance showing improvements over the state-of-the-art, particularly in cases requiring high resolution maps. Furthermore, extensive evaluation is undertaken using the AirSim flight simulator under closed loop control in a set of randomised maps, allowing us to quantitatively assess our path initialisation method. We show that it achieves significantly higher success rates than the baselines, at a reduced computational burden.Open Acces

Spiral - Imperial College Digital Repository