Search CORE

arXiv.org e-Print Archive

Semantic Video CNNs through Representation Warping

Author: Gadde Raghudeep
Gehler Peter V.
Jampani Varun
Publication venue
Publication date: 01/01/2017
Field of study

In this work, we propose a technique to convert CNN models for semantic segmentation of static images into CNNs for video data. We describe a warping method that can be used to augment existing architectures with very little extra computational cost. This module is called NetWarp and we demonstrate its use for a range of network architectures. The main design principle is to use optical flow of adjacent frames for warping internal network representations across time. A key insight of this work is that fast optical flow methods can be combined with many different CNN architectures for improved performance and end-to-end training. Experiments validate that the proposed approach incurs only little extra computational cost, while improving performance, when video streams are available. We achieve new state-of-the-art results on the CamVid and Cityscapes benchmark datasets and show consistent improvements over different baseline networks. Our code and models will be available at http://segmentation.is.tue.mpg.deComment: ICCV 201

Learning Grammars for Architecture-Specific Facade Parsing

Author: Gadde Raghudeep
Marlet Renaud
Paragios Nikos
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 02/10/2015
Field of study

International audienceParsing facade images requires optimal handcrafted grammar for a given class of buildings. Such a handcrafted grammar is often designed manually by experts. In this paper, we present a novel framework to learn a compact grammar from a set of ground-truth images. To this end, parse trees of ground-truth annotated images are obtained running existing inference algorithms with a simple, very general grammar. From these parse trees, repeated subtrees are sought and merged together to share derivations and produce a grammar with fewer rules. Furthermore, unsupervised clustering is performed on these rules, so that, rules corresponding to the same complex pattern are grouped together leading to a rich compact grammar. Experimental validation and comparison with the state-of-the-art grammar-based methods on four diff erent datasets show that the learned grammar helps in much faster convergence while producing equal or more accurate parsing results compared to handcrafted grammars as well as grammars learned by other methods. Besides, we release a new dataset of facade images from Paris following the Art-deco style and demonstrate the general applicability and extreme potential of the proposed framework

HAL-CentraleSupelec

INRIA a CCSD electronic archive server

HAL-Rennes 1

Efficient Facade Segmentation Using Auto-context

Author: Gadde Raghudeep
Gehler Peter
Jampani Varun
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2015
Field of study

International audienceIn this paper we propose a system for the problem of facade segmentation. Building facades are highly structured images and consequently most methods that have been proposed for this problem, aim to make use of this strong prior information. We are describing a system that is almost domain independent and consists of standard segmentation methods. A sequence of boosted decision trees is stacked using auto-context features and learned using the stacked generalization technique. We find that this, albeit standard, technique performs better, or equals, all previous published empirical results on all available facade benchmark datasets. The proposed method is simple to implement, easy to extend, and very efficient at test time inference

Efficient 2D and 3D Facade Segmentation using Auto-Context

Author: Gadde Raghudeep
Gehler Peter
Jampani Varun
Marlet Renaud
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/05/2018
Field of study

International audienceThis paper introduces a fast and efficient segmentation technique for 2D images and 3D point clouds of building facades. Facades of buildings are highly structured and consequently most methods that have been proposed for this problem aim to make use of this strong prior information. Contrary to most prior work, we are describing a system that is almost domain independent and consists of standard segmentation methods. We train a sequence of boosted decision trees using auto-context features. This is learned using stacked generalization. We find that this technique performs better, or comparable with all previous published methods and present empirical results on all available 2D and 3D facade benchmark datasets. The proposed method is simple to implement, easy to extend, and very efficient at test-time inference

Efficient 2D and 3D Facade Segmentation Using Auto-Context

Author: Peter V. Gehler
Raghudeep Gadde
Renaud Marlet
Varun Jampani
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

A MRF Shape Prior for Facade Parsing with Occlusions

Author: Gadde Raghudeep
Koziński Mateusz
Marlet Renaud
Obozinski Guillaume
Zagoruyko Sergey
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/06/2015
Field of study

International audienceWe present a new shape prior formalism for the segmentation of rectified facade images. It combines the simplicity of split grammars with unprecedented expressive power: the capability of encoding simultaneous alignment in two dimensions, facade occlusions and irregular boundaries between facade elements. We formulate the task of finding the most likely image segmentation conforming to a prior of the proposed form as a MAP-MRF problem over a 4-connected pixel grid, and propose an efficient optimization algorithm for solving it. Our method simultaneously segments the visible and occluding objects, and recovers the structure of the occluded facade. We demonstrate state-of-the-art results on a number of facade segmentation datasets

HAL-CentraleSupelec

INRIA a CCSD electronic archive server

HAL-Rennes 1

Learning Grammars for Architecture-Specific Facade Parsing

Author: A D’Ulizia
BJ Frey
C Higuera De La
C Higuera De la
C Wang
D Comaniciu
DL Davies
E Mäkinen
J Nivre
JC Dunn
L Simon
M Charikar
M Kass
M Tomita
Nikos Paragios
O Teboul
P Miller
P Wonka
PJ Rousseeuw
R Achanta
Raghudeep Gadde
RC Carrasco
Renaud Marlet
RS Sutton
S Gould
S Osher
SB Cohen
T Cohn
V Kolmogorov
Y Chi
Z Si
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study