Search CORE

20,766 research outputs found

The Cityscapes Dataset for Semantic Urban Scene Understanding

Author: Benenson Rodrigo
Cordts Marius
Enzweiler Markus
Franke Uwe
Omran Mohamed
Ramos Sebastian
Rehfeld Timo
Roth Stefan
Schiele Bernt
Publication venue
Publication date: 01/01/2016
Field of study

Visual understanding of complex urban street scenes is an enabling factor for a wide range of applications. Object detection has benefited enormously from large-scale datasets, especially in the context of deep learning. For semantic urban scene understanding, however, no current dataset adequately captures the complexity of real-world urban scenes. To address this, we introduce Cityscapes, a benchmark suite and large-scale dataset to train and test approaches for pixel-level and instance-level semantic labeling. Cityscapes is comprised of a large, diverse set of stereo video sequences recorded in streets from 50 different cities. 5000 of these images have high quality pixel-level annotations; 20000 additional images have coarse annotations to enable methods that leverage large volumes of weakly-labeled data. Crucially, our effort exceeds previous attempts in terms of dataset size, annotation richness, scene variability, and complexity. Our accompanying empirical study provides an in-depth analysis of the dataset characteristics, as well as a performance evaluation of several state-of-the-art approaches based on our benchmark.Comment: Includes supplemental materia

arXiv.org e-Print Archive

Object segmentation in depth maps with one user click and a synthetically trained fully convolutional network

Author: A Rozantsev
Bernardino Romera-Paredes
C. Lawrence Zitnick
Nathan Silberman
P Arbeláez
Pedro O. Pinheiro
Tsung-Yi Lin
Publication venue
Publication date: 06/11/2017
Field of study

With more and more household objects built on planned obsolescence and consumed by a fast-growing population, hazardous waste recycling has become a critical challenge. Given the large variability of household waste, current recycling platforms mostly rely on human operators to analyze the scene, typically composed of many object instances piled up in bulk. Helping them by robotizing the unitary extraction is a key challenge to speed up this tedious process. Whereas supervised deep learning has proven very efficient for such object-level scene understanding, e.g., generic object detection and segmentation in everyday scenes, it however requires large sets of per-pixel labeled images, that are hardly available for numerous application contexts, including industrial robotics. We thus propose a step towards a practical interactive application for generating an object-oriented robotic grasp, requiring as inputs only one depth map of the scene and one user click on the next object to extract. More precisely, we address in this paper the middle issue of object seg-mentation in top views of piles of bulk objects given a pixel location, namely seed, provided interactively by a human operator. We propose a twofold framework for generating edge-driven instance segments. First, we repurpose a state-of-the-art fully convolutional object contour detector for seed-based instance segmentation by introducing the notion of edge-mask duality with a novel patch-free and contour-oriented loss function. Second, we train one model using only synthetic scenes, instead of manually labeled training data. Our experimental results show that considering edge-mask duality for training an encoder-decoder network, as we suggest, outperforms a state-of-the-art patch-based network in the present application context.Comment: This is a pre-print of an article published in Human Friendly Robotics, 10th International Workshop, Springer Proceedings in Advanced Robotics, vol 7. The final authenticated version is available online at: https://doi.org/10.1007/978-3-319-89327-3\_16, Springer Proceedings in Advanced Robotics, Siciliano Bruno, Khatib Oussama, In press, Human Friendly Robotics, 10th International Workshop,

arXiv.org e-Print Archive

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

Hal-Diderot

FieldSAFE: Dataset for Obstacle Detection in Agriculture

Author: Christiansen Peter
Green Ole
Jørgensen Rasmus Nyholm
Karstoft Henrik
Kragh Mikkel Fly
Larsen Morten
Laursen Morten Stigaard
Steen Kim Arild
Publication venue: 'MDPI AG'
Publication date: 11/09/2017
Field of study

In this paper, we present a novel multi-modal dataset for obstacle detection in agriculture. The dataset comprises approximately 2 hours of raw sensor data from a tractor-mounted sensor system in a grass mowing scenario in Denmark, October 2016. Sensing modalities include stereo camera, thermal camera, web camera, 360-degree camera, lidar, and radar, while precise localization is available from fused IMU and GNSS. Both static and moving obstacles are present including humans, mannequin dolls, rocks, barrels, buildings, vehicles, and vegetation. All obstacles have ground truth object labels and geographic coordinates.Comment: Submitted to special issue of MDPI Sensors: Sensors in Agricultur

arXiv.org e-Print Archive

Directory of Open Access Journals

Recommended from our members

A database and challenge for acoustic scene classification and event detection

Author: Benetos E.
Giannoulis D.
Lagrange M.
Plumbley M. D.
Rossignol M.
Stowell D.
Publication venue
Publication date: 01/01/2013
Field of study

City Research Online

Agent Behaviour Simulator (ABS):a platform for urban behaviour development

Author: Chrysanthou Yiorgos
Dalton Ruth
Loscos Céline
Tecchia Franco
Publication venue
Publication date: 01/01/2001
Field of study

Computer Graphics have become important for many applicationsand the quality of the produced images have greatly improved. Oneof the interesting remaining problems is the representation of densedynamic environments such as populated cities. Although recentlywe saw some successfulwork on the rendering such environments,the real?time simulation of virtual cities populated by thousands ofintelligent animated agents is still very challenging.In this paperwe describe a platformthat aims to accelerate the developmentof agent behaviours. The platform makes it easy to enterlocal rules and callbacks which govern the individual behaviours.It automatically performs the routine tasks such as collision detectionallowing the user to concentrate on defining the more involvedtasks. The platform is based on a 2D-grid with a four-layered structure.The two first layers are used to compute the collision detectionagainst the environment and other agents and the last two are usedfor more complex behaviours.A set of visualisation tools is incorporated that allows the testingof the real?time simulation. The choices made for the visualisationallow the user to better understand the way agents move inside theworld and how they take decisions, so that the user can evaluate ifit simulates the expected behaviour.Experimentation with the system has shown that behaviours inenvironments with thousands of agents can be developed and visualisedin effortlessly

CiteSeerX

Lancaster E-Prints