525 research outputs found
Segmentation-Aware Convolutional Networks Using Local Attention Masks
We introduce an approach to integrate segmentation information within a
convolutional neural network (CNN). This counter-acts the tendency of CNNs to
smooth information across regions and increases their spatial precision. To
obtain segmentation information, we set up a CNN to provide an embedding space
where region co-membership can be estimated based on Euclidean distance. We use
these embeddings to compute a local attention mask relative to every neuron
position. We incorporate such masks in CNNs and replace the convolution
operation with a "segmentation-aware" variant that allows a neuron to
selectively attend to inputs coming from its own region. We call the resulting
network a segmentation-aware CNN because it adapts its filters at each image
point according to local segmentation cues. We demonstrate the merit of our
method on two widely different dense prediction tasks, that involve
classification (semantic segmentation) and regression (optical flow). Our
results show that in semantic segmentation we can match the performance of
DenseCRFs while being faster and simpler, and in optical flow we obtain clearly
sharper responses than networks that do not use local attention masks. In both
cases, segmentation-aware convolution yields systematic improvements over
strong baselines. Source code for this work is available online at
http://cs.cmu.edu/~aharley/segaware
Superpixel Convolutional Networks using Bilateral Inceptions
In this paper we propose a CNN architecture for semantic image segmentation.
We introduce a new 'bilateral inception' module that can be inserted in
existing CNN architectures and performs bilateral filtering, at multiple
feature-scales, between superpixels in an image. The feature spaces for
bilateral filtering and other parameters of the module are learned end-to-end
using standard backpropagation techniques. The bilateral inception module
addresses two issues that arise with general CNN segmentation architectures.
First, this module propagates information between (super) pixels while
respecting image edges, thus using the structured information of the problem
for improved results. Second, the layer recovers a full resolution segmentation
result from the lower resolution solution of a CNN. In the experiments, we
modify several existing CNN architectures by inserting our inception module
between the last CNN (1x1 convolution) layers. Empirical results on three
different datasets show reliable improvements not only in comparison to the
baseline networks, but also in comparison to several dense-pixel prediction
techniques such as CRFs, while being competitive in time.Comment: European Conference on Computer Vision (ECCV), 201
Object Contour and Edge Detection with RefineContourNet
A ResNet-based multi-path refinement CNN is used for object contour
detection. For this task, we prioritise the effective utilization of the
high-level abstraction capability of a ResNet, which leads to state-of-the-art
results for edge detection. Keeping our focus in mind, we fuse the high, mid
and low-level features in that specific order, which differs from many other
approaches. It uses the tensor with the highest-levelled features as the
starting point to combine it layer-by-layer with features of a lower
abstraction level until it reaches the lowest level. We train this network on a
modified PASCAL VOC 2012 dataset for object contour detection and evaluate on a
refined PASCAL-val dataset reaching an excellent performance and an Optimal
Dataset Scale (ODS) of 0.752. Furthermore, by fine-training on the BSDS500
dataset we reach state-of-the-art results for edge-detection with an ODS of
0.824.Comment: Keywords: Object Contour Detection, Edge Detection, Multi-Path
Refinement CN
- …