Search CORE

1,601 research outputs found

Semantic Grid Estimation with Occupancy Grids and Semantic Segmentation Networks

Author: Erkent Özgür
Laugier Christian
Wolf Christian
Publication venue: HAL CCSD
Publication date: 18/11/2018
Field of study

International audienceWe propose a method to estimate the semantic grid for an autonomous vehicle. The semantic grid is a 2D bird's eye view map where the grid cells contain semantic characteristics such as road, car, pedestrian, signage, etc. We obtain the semantic grid by fusing the semantic segmentation information and an occupancy grid computed by using a Bayesian filter technique. To compute the semantic information from a monocular RGB image, we integrate segmentation deep neural networks into our model. We use a deep neural network to learn the relation between the semantic information and the occupancy grid which can be trained end-to-end extending our previous work on semantic grids. Furthermore, we investigate the effect of using a conditional random field to refine the results. Finally, we test our method on two datasets and compare different architecture types for semantic segmentation. We perform the experiments on KITTI dataset and Inria-Chroma dataset

Crossref

INRIA a CCSD electronic archive server

HAL Descartes

Semantic 3D Occupancy Mapping through Efficient High Order CRFs

Author: Huang Yulan
Scherer Sebastian
Yang Shichao
Publication venue
Publication date: 23/07/2017
Field of study

Semantic 3D mapping can be used for many applications such as robot navigation and virtual interaction. In recent years, there has been great progress in semantic segmentation and geometric 3D mapping. However, it is still challenging to combine these two tasks for accurate and large-scale semantic mapping from images. In the paper, we propose an incremental and (near) real-time semantic mapping system. A 3D scrolling occupancy grid map is built to represent the world, which is memory and computationally efficient and bounded for large scale environments. We utilize the CNN segmentation as prior prediction and further optimize 3D grid labels through a novel CRF model. Superpixels are utilized to enforce smoothness and form robust P N high order potential. An efficient mean field inference is developed for the graph optimization. We evaluate our system on the KITTI dataset and improve the segmentation accuracy by 10% over existing systems.Comment: IROS 201

arXiv.org e-Print Archive

Crossref

End-to-End Learning of Semantic Grid Estimation Deep Neural Network with Occupancy Grids

Author: Erkent Özgür
Laugier Christian
Wolf Christian
Publication venue: 'World Scientific Pub Co Pte Lt'
Publication date: 15/07/2019
Field of study

International audienceWe propose semantic grid, a spatial 2D map of the environment around an autonomous vehicle consisting of cells which represent the semantic information of the corresponding region such as car, road, vegetation, bikes, etc. It consists of an integration of an occupancy grid, which computes the grid states with a Bayesian filter approach, and semantic segmentation information from monocular RGB images, which is obtained with a deep neural network. The network fuses the information and can be trained in an end-to-end manner. The output of the neural network is refined with a conditional random field. The proposed method is tested in various datasets (KITTI dataset, Inria-Chroma dataset and SYNTHIA) and different deep neural network architectures are compared

INRIA a CCSD electronic archive server

HAL

Hal-Diderot

Object Detection and Classification in Occupancy Grid Maps using Deep Convolutional Networks

Author: Fischer Tom
Frias Jesus Balado
Stiller Christoph
Wirges Sascha
Publication venue
Publication date: 05/12/2018
Field of study

A detailed environment perception is a crucial component of automated vehicles. However, to deal with the amount of perceived information, we also require segmentation strategies. Based on a grid map environment representation, well-suited for sensor fusion, free-space estimation and machine learning, we detect and classify objects using deep convolutional neural networks. As input for our networks we use a multi-layer grid map efficiently encoding 3D range sensor information. The inference output consists of a list of rotated bounding boxes with associated semantic classes. We conduct extensive ablation studies, highlight important design considerations when using grid maps and evaluate our models on the KITTI Bird's Eye View benchmark. Qualitative and quantitative benchmark results show that we achieve robust detection and state of the art accuracy solely using top-view grid maps from range sensor data.Comment: 6 pages, 4 tables, 4 figure

arXiv.org e-Print Archive

Investigo

Crossref