1,206 research outputs found
Effective Use of Dilated Convolutions for Segmenting Small Object Instances in Remote Sensing Imagery
Thanks to recent advances in CNNs, solid improvements have been made in
semantic segmentation of high resolution remote sensing imagery. However, most
of the previous works have not fully taken into account the specific
difficulties that exist in remote sensing tasks. One of such difficulties is
that objects are small and crowded in remote sensing imagery. To tackle with
this challenging task we have proposed a novel architecture called local
feature extraction (LFE) module attached on top of dilated front-end module.
The LFE module is based on our findings that aggressively increasing dilation
factors fails to aggregate local features due to sparsity of the kernel, and
detrimental to small objects. The proposed LFE module solves this problem by
aggregating local features with decreasing dilation factor. We tested our
network on three remote sensing datasets and acquired remarkably good results
for all datasets especially for small objects
DefectNET: multi-class fault detection on highly-imbalanced datasets
As a data-driven method, the performance of deep convolutional neural
networks (CNN) relies heavily on training data. The prediction results of
traditional networks give a bias toward larger classes, which tend to be the
background in the semantic segmentation task. This becomes a major problem for
fault detection, where the targets appear very small on the images and vary in
both types and sizes. In this paper we propose a new network architecture,
DefectNet, that offers multi-class (including but not limited to) defect
detection on highly-imbalanced datasets. DefectNet consists of two parallel
paths, which are a fully convolutional network and a dilated convolutional
network to detect large and small objects respectively. We propose a hybrid
loss maximising the usefulness of a dice loss and a cross entropy loss, and we
also employ the leaky rectified linear unit (ReLU) to deal with rare occurrence
of some targets in training batches. The prediction results show that our
DefectNet outperforms state-of-the-art networks for detecting multi-class
defects with the average accuracy improvement of approximately 10% on a wind
turbine
Fast Deep Matting for Portrait Animation on Mobile Phone
Image matting plays an important role in image and video editing. However,
the formulation of image matting is inherently ill-posed. Traditional methods
usually employ interaction to deal with the image matting problem with trimaps
and strokes, and cannot run on the mobile phone in real-time. In this paper, we
propose a real-time automatic deep matting approach for mobile devices. By
leveraging the densely connected blocks and the dilated convolution, a light
full convolutional network is designed to predict a coarse binary mask for
portrait images. And a feathering block, which is edge-preserving and matting
adaptive, is further developed to learn the guided filter and transform the
binary mask into alpha matte. Finally, an automatic portrait animation system
based on fast deep matting is built on mobile devices, which does not need any
interaction and can realize real-time matting with 15 fps. The experiments show
that the proposed approach achieves comparable results with the
state-of-the-art matting solvers.Comment: ACM Multimedia Conference (MM) 2017 camera-read
- …