33 research outputs found

    Fast, Exact and Multi-Scale Inference for Semantic Image Segmentation with Deep Gaussian CRFs

    Get PDF
    In this work we propose a structured prediction technique that combines the virtues of Gaussian Conditional Random Fields (G-CRF) with Deep Learning: (a) our structured prediction task has a unique global optimum that is obtained exactly from the solution of a linear system (b) the gradients of our model parameters are analytically computed using closed form expressions, in contrast to the memory-demanding contemporary deep structured prediction approaches that rely on back-propagation-through-time, (c) our pairwise terms do not have to be simple hand-crafted expressions, as in the line of works building on the DenseCRF, but can rather be `discovered' from data through deep architectures, and (d) out system can trained in an end-to-end manner. Building on standard tools from numerical analysis we develop very efficient algorithms for inference and learning, as well as a customized technique adapted to the semantic segmentation task. This efficiency allows us to explore more sophisticated architectures for structured prediction in deep learning: we introduce multi-resolution architectures to couple information across scales in a joint optimization framework, yielding systematic improvements. We demonstrate the utility of our approach on the challenging VOC PASCAL 2012 image segmentation benchmark, showing substantial improvements over strong baselines. We make all of our code and experiments available at {https://github.com/siddharthachandra/gcrf}Comment: Our code is available at https://github.com/siddharthachandra/gcr

    Coarse-to-Fine Annotation Enrichment for Semantic Segmentation Learning

    Full text link
    Rich high-quality annotated data is critical for semantic segmentation learning, yet acquiring dense and pixel-wise ground-truth is both labor- and time-consuming. Coarse annotations (e.g., scribbles, coarse polygons) offer an economical alternative, with which training phase could hardly generate satisfactory performance unfortunately. In order to generate high-quality annotated data with a low time cost for accurate segmentation, in this paper, we propose a novel annotation enrichment strategy, which expands existing coarse annotations of training data to a finer scale. Extensive experiments on the Cityscapes and PASCAL VOC 2012 benchmarks have shown that the neural networks trained with the enriched annotations from our framework yield a significant improvement over that trained with the original coarse labels. It is highly competitive to the performance obtained by using human annotated dense annotations. The proposed method also outperforms among other state-of-the-art weakly-supervised segmentation methods.Comment: CIKM 2018 International Conference on Information and Knowledge Managemen

    Satellite Image Segmentation with Deep Residual Architectures for Time-Critical Applications

    Get PDF
    This work addresses the problem of training a convolutional neural network for segmenting satellite images in emergency situations, where images to be segmented are potentially very different from training images. Such case is particularly challenging due to the large intra-class variations in image statistics between images captured at different locations by different sensors. We propose a convolutional encoder-decoder network architecture where the encoder is built around residual networks. We show that the proposed architecture enable learning features able to generalize the learning process across images with largely different statistics. Our architecture can accurately segment images that have no reference in the training set, whereas a minimal refinement of the trained network significantly boosts the segmentation accuracy

    Design Assessment and Simulation of PCA Based Image Difference Detection and Segmentation for Satellite Images Using Machine Learning

    Get PDF
    It is possible to define the quantity of temporal effects by employing multitemporal data sets to discover changes in nature or in the status of any object based on observations taken at various points in time. It's not uncommon to come across a variety of different methods for spotting changes in data. These methods can be categorized under a single umbrella term.  There are two primary areas of study: supervised and unsupervised change detection. In this study, the goal is to identify the changes in land cover.  Covers a specific area in Kayseri using unsupervised change detection algorithms and Landsat satellite pictures from various years have been gleaned through the use of remote sensing. In the meantime, image differencing is taking place.  The method will be applied to the photographs using the image-enhancing process. In the next step, Principal Component Analysis (PCA) is employed.  The difference image will be analyzed using Component Analysis. To find out which locations have and which do not. As a first step, a procedure must be in place.  We've finished registering images one after the other. Consequently, the photos are being linked together. After then, it's back to black and white.  Three non-overlapping portions of the difference image have been created. This can be done using the principal component analysis method.  From the eigenvector space, we may get to the fundamental components. As a last point, the major feature vector space fuzzy C-Means Clustering is used to divide the component into two clusters, and then a change detection technique is carried out. As the world's population grew, farmland expansion and unplanned land encroachment intensified, resulting in uncontrolled deforestation around the globe. This project uses unsupervised learning algorithm K-means clustering. In a cost-effective manner that can be employed by officials, companies as well as private groups, to assist in fighting illicit deforestation and analysis of satellite database

    Discovering Class-Specific Pixels for Weakly-Supervised Semantic Segmentation

    Get PDF
    We propose an approach to discover class-specific pixels for the weakly-supervised semantic segmentation task. We show that properly combining saliency and attention maps allows us to obtain reliable cues capable of significantly boosting the performance. First, we propose a simple yet powerful hierarchical approach to discover the class-agnostic salient regions, obtained using a salient object detector, which otherwise would be ignored. Second, we use fully convolutional attention maps to reliably localize the class-specific regions in a given image. We combine these two cues to discover class-specific pixels which are then used as an approximate ground truth for training a CNN. While solving the weakly supervised semantic segmentation task, we ensure that the image-level classification task is also solved in order to enforce the CNN to assign at least one pixel to each object present in the image. Experimentally, on the PASCAL VOC12 val and test sets, we obtain the mIoU of 60.8% and 61.9%, achieving the performance gains of 5.1% and 5.2% compared to the published state-of-the-art results. The code is made publicly available
    corecore