Search CORE

2,567 research outputs found

Exploring Context with Deep Structured models for Semantic Segmentation

Author: Hengel Anton van den
Lin Guosheng
Reid Ian
Shen Chunhua
Publication venue
Publication date: 01/01/2017
Field of study

State-of-the-art semantic image segmentation methods are mostly based on training deep convolutional neural networks (CNNs). In this work, we proffer to improve semantic segmentation with the use of contextual information. In particular, we explore `patch-patch' context and `patch-background' context in deep CNNs. We formulate deep structured models by combining CNNs and Conditional Random Fields (CRFs) for learning the patch-patch context between image regions. Specifically, we formulate CNN-based pairwise potential functions to capture semantic correlations between neighboring patches. Efficient piecewise training of the proposed deep structured model is then applied in order to avoid repeated expensive CRF inference during the course of back propagation. For capturing the patch-background context, we show that a network design with traditional multi-scale image inputs and sliding pyramid pooling is very effective for improving performance. We perform comprehensive evaluation of the proposed method. We achieve new state-of-the-art performance on a number of challenging semantic segmentation datasets including

NYUDv2

PASCAL

VOC2012

Cityscapes

PASCAL

Context

SUN

RGBD

SIFT

flow

, and

KITTI

datasets. Particularly, we report an intersection-over-union score of

77.8

on the

PASCAL

VOC2012

dataset.Comment: 16 pages. Accepted to IEEE T. Pattern Analysis & Machine Intelligence, 2017. Extended version of arXiv:1504.0101

arXiv.org e-Print Archive

DR-NTU (Digital Repository of NTU)

Convolutional Patch Networks with Spatial Prior for Road Detection and Urban Scene Understanding

Author: Brust Clemens-Alexander
Denzler Joachim
Rodner Erik
Sickert Sven
Simon Marcel
Publication venue
Publication date: 01/01/2015
Field of study

Classifying single image patches is important in many different applications, such as road detection or scene understanding. In this paper, we present convolutional patch networks, which are convolutional networks learned to distinguish different image patches and which can be used for pixel-wise labeling. We also show how to incorporate spatial information of the patch as an input to the network, which allows for learning spatial priors for certain categories jointly with an appearance model. In particular, we focus on road detection and urban scene understanding, two application areas where we are able to achieve state-of-the-art results on the KITTI as well as on the LabelMeFacade dataset. Furthermore, our paper offers a guideline for people working in the area and desperately wandering through all the painstaking details that render training CNs on image patches extremely difficult.Comment: VISAPP 2015 pape

arXiv.org e-Print Archive

CiteSeerX

Crossref