33 research outputs found
Fast, Exact and Multi-Scale Inference for Semantic Image Segmentation with Deep Gaussian CRFs
In this work we propose a structured prediction technique that combines the
virtues of Gaussian Conditional Random Fields (G-CRF) with Deep Learning: (a)
our structured prediction task has a unique global optimum that is obtained
exactly from the solution of a linear system (b) the gradients of our model
parameters are analytically computed using closed form expressions, in contrast
to the memory-demanding contemporary deep structured prediction approaches that
rely on back-propagation-through-time, (c) our pairwise terms do not have to be
simple hand-crafted expressions, as in the line of works building on the
DenseCRF, but can rather be `discovered' from data through deep architectures,
and (d) out system can trained in an end-to-end manner. Building on standard
tools from numerical analysis we develop very efficient algorithms for
inference and learning, as well as a customized technique adapted to the
semantic segmentation task. This efficiency allows us to explore more
sophisticated architectures for structured prediction in deep learning: we
introduce multi-resolution architectures to couple information across scales in
a joint optimization framework, yielding systematic improvements. We
demonstrate the utility of our approach on the challenging VOC PASCAL 2012
image segmentation benchmark, showing substantial improvements over strong
baselines. We make all of our code and experiments available at
{https://github.com/siddharthachandra/gcrf}Comment: Our code is available at https://github.com/siddharthachandra/gcr
Coarse-to-Fine Annotation Enrichment for Semantic Segmentation Learning
Rich high-quality annotated data is critical for semantic segmentation
learning, yet acquiring dense and pixel-wise ground-truth is both labor- and
time-consuming. Coarse annotations (e.g., scribbles, coarse polygons) offer an
economical alternative, with which training phase could hardly generate
satisfactory performance unfortunately. In order to generate high-quality
annotated data with a low time cost for accurate segmentation, in this paper,
we propose a novel annotation enrichment strategy, which expands existing
coarse annotations of training data to a finer scale. Extensive experiments on
the Cityscapes and PASCAL VOC 2012 benchmarks have shown that the neural
networks trained with the enriched annotations from our framework yield a
significant improvement over that trained with the original coarse labels. It
is highly competitive to the performance obtained by using human annotated
dense annotations. The proposed method also outperforms among other
state-of-the-art weakly-supervised segmentation methods.Comment: CIKM 2018 International Conference on Information and Knowledge
Managemen
Satellite Image Segmentation with Deep Residual Architectures for Time-Critical Applications
This work addresses the problem of training a convolutional neural network for segmenting satellite images in emergency situations, where images to be segmented are potentially very different from training images. Such case is particularly challenging due to the large intra-class variations in image statistics between images captured at different locations by different sensors. We propose a convolutional encoder-decoder network architecture where the encoder is built around residual networks. We show that the proposed architecture enable learning features able to generalize the learning process across images with largely different statistics. Our architecture can accurately segment images that have no reference in the training set, whereas a minimal refinement of the trained network significantly boosts the segmentation accuracy
Design Assessment and Simulation of PCA Based Image Difference Detection and Segmentation for Satellite Images Using Machine Learning
It is possible to define the quantity of temporal effects by employing multitemporal data sets to discover changes in nature or in the status of any object based on observations taken at various points in time. It's not uncommon to come across a variety of different methods for spotting changes in data. These methods can be categorized under a single umbrella term. There are two primary areas of study: supervised and unsupervised change detection. In this study, the goal is to identify the changes in land cover. Covers a specific area in Kayseri using unsupervised change detection algorithms and Landsat satellite pictures from various years have been gleaned through the use of remote sensing. In the meantime, image differencing is taking place. The method will be applied to the photographs using the image-enhancing process. In the next step, Principal Component Analysis (PCA) is employed. The difference image will be analyzed using Component Analysis. To find out which locations have and which do not. As a first step, a procedure must be in place. We've finished registering images one after the other. Consequently, the photos are being linked together. After then, it's back to black and white. Three non-overlapping portions of the difference image have been created. This can be done using the principal component analysis method. From the eigenvector space, we may get to the fundamental components. As a last point, the major feature vector space fuzzy C-Means Clustering is used to divide the component into two clusters, and then a change detection technique is carried out. As the world's population grew, farmland expansion and unplanned land encroachment intensified, resulting in uncontrolled deforestation around the globe. This project uses unsupervised learning algorithm K-means clustering. In a cost-effective manner that can be employed by officials, companies as well as private groups, to assist in fighting illicit deforestation and analysis of satellite database
Discovering Class-Specific Pixels for Weakly-Supervised Semantic Segmentation
We propose an approach to discover class-specific pixels for the
weakly-supervised semantic segmentation task. We show that properly combining
saliency and attention maps allows us to obtain reliable cues capable of
significantly boosting the performance. First, we propose a simple yet powerful
hierarchical approach to discover the class-agnostic salient regions, obtained
using a salient object detector, which otherwise would be ignored. Second, we
use fully convolutional attention maps to reliably localize the class-specific
regions in a given image. We combine these two cues to discover class-specific
pixels which are then used as an approximate ground truth for training a CNN.
While solving the weakly supervised semantic segmentation task, we ensure that
the image-level classification task is also solved in order to enforce the CNN
to assign at least one pixel to each object present in the image.
Experimentally, on the PASCAL VOC12 val and test sets, we obtain the mIoU of
60.8% and 61.9%, achieving the performance gains of 5.1% and 5.2% compared to
the published state-of-the-art results. The code is made publicly available