Search CORE

39,177 research outputs found

Extended Pixel Representation for Image Segmentation

Author: Girish Deeptha
Ralescu Anca
Singh Vineeta
Publication venue: eCommons
Publication date: 23/04/2016
Field of study

We explore the use of extended pixel representation for color based image segmentation using the K-means clustering algorithm. Various extended pixel representations have been implemented in this paper and their results have been compared. By extending the representation of pixels an image is mapped to a higher dimensional space. Unlike other approaches, where data is mapped into an implicit features space of higher dimension (kernel methods), in the approach considered here, the higher dimensions are defined explicitly. Preliminary experimental results which illustrate the proposed approach are promising

University of Dayton

Training of Convolutional Networks on Multiple Heterogeneous Datasets for Street Scene Semantic Segmentation

Author: Dubbelman Gijs
Meletis Panagiotis
Publication venue
Publication date: 01/01/2018
Field of study

We propose a convolutional network with hierarchical classifiers for per-pixel semantic segmentation, which is able to be trained on multiple, heterogeneous datasets and exploit their semantic hierarchy. Our network is the first to be simultaneously trained on three different datasets from the intelligent vehicles domain, i.e. Cityscapes, GTSDB and Mapillary Vistas, and is able to handle different semantic level-of-detail, class imbalances, and different annotation types, i.e. dense per-pixel and sparse bounding-box labels. We assess our hierarchical approach, by comparing against flat, non-hierarchical classifiers and we show improvements in mean pixel accuracy of 13.0% for Cityscapes classes and 2.4% for Vistas classes and 32.3% for GTSDB classes. Our implementation achieves inference rates of 17 fps at a resolution of 520x706 for 108 classes running on a GPU.Comment: IEEE Intelligent Vehicles 201

arXiv.org e-Print Archive

Repository TU/e

Crossref

Pure OAI Repository

Superpixel Convolutional Networks using Bilateral Inceptions

Author: A Adams
ES Gastal
J Domke
JB Tenenbaum
K He
M Kiefel
R Achanta
S Gould
S He
S Nowozin
S Paris
T-Y Lin
Publication venue
Publication date: 08/08/2016
Field of study

In this paper we propose a CNN architecture for semantic image segmentation. We introduce a new 'bilateral inception' module that can be inserted in existing CNN architectures and performs bilateral filtering, at multiple feature-scales, between superpixels in an image. The feature spaces for bilateral filtering and other parameters of the module are learned end-to-end using standard backpropagation techniques. The bilateral inception module addresses two issues that arise with general CNN segmentation architectures. First, this module propagates information between (super) pixels while respecting image edges, thus using the structured information of the problem for improved results. Second, the layer recovers a full resolution segmentation result from the lower resolution solution of a CNN. In the experiments, we modify several existing CNN architectures by inserting our inception module between the last CNN (1x1 convolution) layers. Empirical results on three different datasets show reliable improvements not only in comparison to the baseline networks, but also in comparison to several dense-pixel prediction techniques such as CRFs, while being competitive in time.Comment: European Conference on Computer Vision (ECCV), 201

arXiv.org e-Print Archive

Crossref

MPG.PuRe

HAL-Ecole des Ponts ParisTech

HAL - UPEC / UPEM