13 research outputs found

    UcoSLAM: Simultaneous Localization and Mapping by Fusion of KeyPoints and Squared Planar Markers

    Full text link
    This paper proposes a novel approach for Simultaneous Localization and Mapping by fusing natural and artificial landmarks. Most of the SLAM approaches use natural landmarks (such as keypoints). However, they are unstable over time, repetitive in many cases or insufficient for a robust tracking (e.g. in indoor buildings). On the other hand, other approaches have employed artificial landmarks (such as squared fiducial markers) placed in the environment to help tracking and relocalization. We propose a method that integrates both approaches in order to achieve long-term robust tracking in many scenarios. Our method has been compared to the start-of-the-art methods ORB-SLAM2 and LDSO in the public dataset Kitti, Euroc-MAV, TUM and SPM, obtaining better precision, robustness and speed. Our tests also show that the combination of markers and keypoints achieves better accuracy than each one of them independently.Comment: Paper submitted to Pattern Recognitio

    Vision-based Situational Graphs Generating Optimizable 3D Scene Representations

    Full text link
    3D scene graphs offer a more efficient representation of the environment by hierarchically organizing diverse semantic entities and the topological relationships among them. Fiducial markers, on the other hand, offer a valuable mechanism for encoding comprehensive information pertaining to environments and the objects within them. In the context of Visual SLAM (VSLAM), especially when the reconstructed maps are enriched with practical semantic information, these markers have the potential to enhance the map by augmenting valuable semantic information and fostering meaningful connections among the semantic objects. In this regard, this paper exploits the potential of fiducial markers to incorporate a VSLAM framework with hierarchical representations that generates optimizable multi-layered vision-based situational graphs. The framework comprises a conventional VSLAM system with low-level feature tracking and mapping capabilities bolstered by the incorporation of a fiducial marker map. The fiducial markers aid in identifying walls and doors in the environment, subsequently establishing meaningful associations with high-level entities, including corridors and rooms. Experimental results are conducted on a real-world dataset collected using various legged robots and benchmarked against a Light Detection And Ranging (LiDAR)-based framework (S-Graphs) as the ground truth. Consequently, our framework not only excels in crafting a richer, multi-layered hierarchical map of the environment but also shows enhancement in robot pose accuracy when contrasted with state-of-the-art methodologies.Comment: 7 pages, 6 figures, 2 table

    Hierarchical Visual SLAM based on Fiducial Markers

    Get PDF
    Fiducial markers can encode rich information about the environment and aid Visual SLAM (VSLAM) approaches in reconstructing maps with practical semantic information. Current marker-based VSLAM approaches mainly utilize markers for improving feature detection in low-feature environments and/or incorporating loop closure constraints, generating only low-level geometric maps of the environment prone to inaccuracies in complex environments. To bridge this gap, this paper presents a VSLAM approach utilizing a monocular camera and fiducial markers to generate hierarchical representations of the environment while improving the camera pose estimate. The proposed approach detects semantic entities from the surroundings, including walls, corridors, and rooms encoded within markers, and appropriately adds topological constraints among them. Experimental results on a real-world dataset demonstrate that the proposed approach outperforms a traditional marker-based VSLAM baseline in terms of accuracy, despite adding new constraints while creating enhanced map representations. Furthermore, it shows satisfactory results when comparing the reconstructed map quality to the one reconstructed using a LiDAR SLAM approach

    Flexible body scanning without template models

    No full text
    The apparition of low-cost depth cameras has lead to the development of several reconstruction methods that work well with rigid objects, but tend to fail when used to manually scan a standing person. Specific methods for body scanning have been proposed, but they have some ad-hoc requirements that make them unsuitable in a wide range of applications: they either require rotation platforms, multiple sensors and a priori template model. Scanning a person with a hand-held low-cost depth camera is still a challenging unsolved problem. This work proposes a novel solution to easily scan standing persons by combining depth information with fiducial markers without using a template model. In our approach, a set of markers placed in the ground are used to improve camera tracking by a novel algorithm that fuses depth information with the known location of the markers. The proposed method analyzes the video sequence and automatically divides it into fragments that are employed to build partial overlapping scans of the subject. Then, a registration step (both rigid and non-rigid) is applied to create a final mesh of the scanned subject. The proposed method has been compared with the state-of-the-art KinectFusion [1], ElasticFusion [2], ORB-SLAM [3, 4], and BundleFusion [5] methods, exhibiting superior performance

    Camera Localization in Outdoor Garden Environments Using Artificial Landmarks

    Get PDF
    In this paper, we present an outdoor monocular camera localization system based on artificial markers and test its performance in one of the test gardens of the TrimBot2020 project, in Wageningen. We use ArUco markers to construct a map of the environment and to subsequently localize the camera position within it. We combine the localization algorithm based on ArUco with a Kalman filter to smooth the trajectory and improve the localization stability with respect to fast movements of the camera, and blurred or noisy images. We recorded two sequences, with resolution 480p and l080p respectively, in the TrimBot2020 garden. We compare the localization performance of ArUco with a keypoint-based approach, namely ORB-SLAM2. We analyze and discuss the strengths and problems of both marker- and keypoint-based approaches on the considered sequences. The performed comparison suggests that the two approaches might be fused to jointly improve re-localization and reduce the drift in pose estimation

    Mixing body-parts model for 2D human pose estimation in stereo videos

    No full text
    This study targets 2D articulated human pose estimation (i.e. localisation of body limbs) in stereo videos. Although in recent years depth-based devices (e.g. Microsoft Kinect) have gained popularity, as they perform very well in controlled indoor environments (e.g. living rooms, operating theatres or gyms), they suffer clear problems in outdoor scenarios and, therefore, human pose estimation is still an interesting unsolved problem. The authors propose here a novel approach that is able to localise upper-body keypoints (i.e. shoulders, elbows, and wrists) in temporal sequences of stereo image pairs. The authors' method starts by locating and segmenting people in the image pairs by using disparity and appearance information. Then, a set of candidate body poses is computed for each view independently. Finally, temporal and stereo consistency is applied to estimate a final 2D pose. The authors' validate their model on three challenging datasets: stereo human pose estimation dataset', poses in the wild' and INRIA 3DMovie'. The experimental results show that the authors' model not only establishes new state-of-the-art results on stereo sequences, but also brings improvements in monocular sequences

    Simultaneous Multi-View Camera Pose Estimation and Object Tracking With Squared Planar Markers

    No full text
    Object tracking is a key aspect in many applications, such as augmented reality in medicine (e.g., tracking a surgical instrument) or robotics. Squared planar markers have become popular tools for tracking since their pose can be estimated from their four corners. While using a single marker and a single camera limits the working area considerably, using multiple markers attached to an object requires estimating their relative position, which is not trivial, for high-accuracy tracking. Likewise, using multiple cameras requires estimating their extrinsic parameters, also a tedious process that must be repeated whenever a camera is moved. This paper proposes a novel method to simultaneously solve the above-mentioned problems. From a video sequence showing a rigid set of planar markers recorded from multiple cameras, the proposed method is able to automatically obtain the three-dimensional configuration of the markers, the extrinsic parameters of the cameras, and the relative pose between the markers and the cameras at each frame. Our experiments show that our approach can obtain highly accurate results for estimating these parameters using the low-resolution cameras. Once the parameters are obtained, tracking of the object can be done in real time with a low computational cost. The proposed method is a step forward in the development of cost-effective solutions for object tracking

    Stereo Pictorial Structure for 2D articulated human pose estimation

    No full text
    In this paper, we consider the problem of 2D human pose estimation on stereo image pairs. In particular, we aim at estimating the location, orientation and scale of upper-body parts of people detected in stereo image pairs from realistic stereo videos that can be found in the Internet. To address this task, we propose a novel pictorial structure model to exploit the stereo information included in such stereo image pairs: the Stereo Pictorial Structure (SPS). To validate our proposed model, we contribute a new annotated dataset of stereo image pairs, the Stereo Human Pose Estimation Dataset (SHPED), obtained from YouTube stereoscopic video sequences, depicting people in challenging poses and diverse indoor and outdoor scenarios. The experimental results on SHPED indicates that SPS improves on state-of-the-art monocular models thanks to the appropriate use of the stereo information

    Global impact of the first coronavirus disease 2019 (COVID-19) pandemic wave on vascular services

    No full text
    This online structured survey has demonstrated the global impact of the COVID-19 pandemic on vascular services. The majority of centres have documented marked reductions in operating and services provided to vascular patients. In the months during recovery from the resource restrictions imposed during the pandemic peaks, there will be a significant vascular disease burden awaiting surgeons. One of the most affected specialtie
    corecore