Search CORE

6,095 research outputs found

Reversible Recursive Instance-level Object Segmentation

Author: Feng Jiashi
Jie Zequn
Liang Xiaodan
Lin Liang
Shen Xiaohui
Wei Yunchao
Yan Shuicheng
Publication venue
Publication date: 18/11/2015
Field of study

In this work, we propose a novel Reversible Recursive Instance-level Object Segmentation (R2-IOS) framework to address the challenging instance-level object segmentation task. R2-IOS consists of a reversible proposal refinement sub-network that predicts bounding box offsets for refining the object proposal locations, and an instance-level segmentation sub-network that generates the foreground mask of the dominant object instance in each proposal. By being recursive, R2-IOS iteratively optimizes the two sub-networks during joint training, in which the refined object proposals and improved segmentation predictions are alternately fed into each other to progressively increase the network capabilities. By being reversible, the proposal refinement sub-network adaptively determines an optimal number of refinement iterations required for each proposal during both training and testing. Furthermore, to handle multiple overlapped instances within a proposal, an instance-aware denoising autoencoder is introduced into the segmentation sub-network to distinguish the dominant object from other distracting instances. Extensive experiments on the challenging PASCAL VOC 2012 benchmark well demonstrate the superiority of R2-IOS over other state-of-the-art methods. In particular, the

\text{AP}^r

over

20

classes at

0.5

IoU achieves

66.7\%

, which significantly outperforms the results of

58.7\%

by PFN~\cite{PFN} and

46.3\%

by~\cite{liu2015multi}.Comment: 9 page

arXiv.org e-Print Archive

Crossref

Burst Denoising with Kernel Prediction Networks

Author: Barron Jonathan T.
Carroll Robert
Chen Jiawen
Mildenhall Ben
Ng Ren
Sharlet Dillon
Publication venue
Publication date: 29/03/2018
Field of study

We present a technique for jointly denoising bursts of images taken from a handheld camera. In particular, we propose a convolutional neural network architecture for predicting spatially varying kernels that can both align and denoise frames, a synthetic data generation approach based on a realistic noise formation model, and an optimization guided by an annealed loss function to avoid undesirable local minima. Our model matches or outperforms the state-of-the-art across a wide range of noise levels on both real and synthetic data.Comment: To appear in CVPR 2018 (spotlight). Project page: http://people.eecs.berkeley.edu/~bmild/kpn

arXiv.org e-Print Archive

Crossref

xUnit: Learning a Spatial Activation Function for Efficient Image Restoration

Author: Kligvasser Idan
Michaeli Tomer
Shaham Tamar Rott
Publication venue
Publication date: 25/03/2018
Field of study

In recent years, deep neural networks (DNNs) achieved unprecedented performance in many low-level vision tasks. However, state-of-the-art results are typically achieved by very deep networks, which can reach tens of layers with tens of millions of parameters. To make DNNs implementable on platforms with limited resources, it is necessary to weaken the tradeoff between performance and efficiency. In this paper, we propose a new activation unit, which is particularly suitable for image restoration problems. In contrast to the widespread per-pixel activation units, like ReLUs and sigmoids, our unit implements a learnable nonlinear function with spatial connections. This enables the net to capture much more complex features, thus requiring a significantly smaller number of layers in order to reach the same performance. We illustrate the effectiveness of our units through experiments with state-of-the-art nets for denoising, de-raining, and super resolution, which are already considered to be very small. With our approach, we are able to further reduce these models by nearly 50% without incurring any degradation in performance.Comment: Conference on Computer Vision and Pattern Recognition (CVPR), 201

arXiv.org e-Print Archive

Crossref