Search CORE

33 research outputs found

Fast, Exact and Multi-Scale Inference for Semantic Image Segmentation with Deep Gaussian CRFs

Author: Chandra Siddhartha
Kokkinos Iasonas
Publication venue
Publication date: 01/01/2016
Field of study

In this work we propose a structured prediction technique that combines the virtues of Gaussian Conditional Random Fields (G-CRF) with Deep Learning: (a) our structured prediction task has a unique global optimum that is obtained exactly from the solution of a linear system (b) the gradients of our model parameters are analytically computed using closed form expressions, in contrast to the memory-demanding contemporary deep structured prediction approaches that rely on back-propagation-through-time, (c) our pairwise terms do not have to be simple hand-crafted expressions, as in the line of works building on the DenseCRF, but can rather be `discovered' from data through deep architectures, and (d) out system can trained in an end-to-end manner. Building on standard tools from numerical analysis we develop very efficient algorithms for inference and learning, as well as a customized technique adapted to the semantic segmentation task. This efficiency allows us to explore more sophisticated architectures for structured prediction in deep learning: we introduce multi-resolution architectures to couple information across scales in a joint optimization framework, yielding systematic improvements. We demonstrate the utility of our approach on the challenging VOC PASCAL 2012 image segmentation benchmark, showing substantial improvements over strong baselines. We make all of our code and experiments available at {https://github.com/siddharthachandra/gcrf}Comment: Our code is available at https://github.com/siddharthachandra/gcr

arXiv.org e-Print Archive

HAL-CentraleSupelec

Crossref

INRIA a CCSD electronic archive server

Coarse-to-Fine Annotation Enrichment for Semantic Segmentation Learning

Author: Bearman Amy L.
Bertasius Gedas
Chandra Siddhartha
Cordts Marius
Lin Di
Muja Marius
Papadopoulos Dim P.
Qi Xiaojuan
Shen Falong
Shen Fumin
Zhang Ke
Zhao Hengshuang
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2018
Field of study

Rich high-quality annotated data is critical for semantic segmentation learning, yet acquiring dense and pixel-wise ground-truth is both labor- and time-consuming. Coarse annotations (e.g., scribbles, coarse polygons) offer an economical alternative, with which training phase could hardly generate satisfactory performance unfortunately. In order to generate high-quality annotated data with a low time cost for accurate segmentation, in this paper, we propose a novel annotation enrichment strategy, which expands existing coarse annotations of training data to a finer scale. Extensive experiments on the Cityscapes and PASCAL VOC 2012 benchmarks have shown that the neural networks trained with the enriched annotations from our framework yield a significant improvement over that trained with the original coarse labels. It is highly competitive to the performance obtained by using human annotated dense annotations. The proposed method also outperforms among other state-of-the-art weakly-supervised segmentation methods.Comment: CIKM 2018 International Conference on Information and Knowledge Managemen

arXiv.org e-Print Archive

Crossref

University of Queensland eSpace

Satellite Image Segmentation with Deep Residual Architectures for Time-Critical Applications

Author: Boccardo Piero
Fiandrotti Attilio
Francini Gianluca
Ghassemi Sina
Magli Enrico
Sandu Constantin
Tonolo Fabio Giulio
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

This work addresses the problem of training a convolutional neural network for segmenting satellite images in emergency situations, where images to be segmented are potentially very different from training images. Such case is particularly challenging due to the large intra-class variations in image statistics between images captured at different locations by different sensors. We propose a convolutional encoder-decoder network architecture where the encoder is built around residual networks. We show that the proposed architecture enable learning features able to generalize the learning process across images with largely different statistics. Our architecture can accurately segment images that have no reference in the training set, whereas a minimal refinement of the trained network significantly boosts the segmentation accuracy

Crossref

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

Design Assessment and Simulation of PCA Based Image Difference Detection and Segmentation for Satellite Images Using Machine Learning

Author: Jat Nemi Chand
Kumar Chetan
Publication venue: 'Auricle Technologies, Pvt., Ltd.'
Publication date: 11/04/2022
Field of study

It is possible to define the quantity of temporal effects by employing multitemporal data sets to discover changes in nature or in the status of any object based on observations taken at various points in time. It's not uncommon to come across a variety of different methods for spotting changes in data. These methods can be categorized under a single umbrella term.  There are two primary areas of study: supervised and unsupervised change detection. In this study, the goal is to identify the changes in land cover.  Covers a specific area in Kayseri using unsupervised change detection algorithms and Landsat satellite pictures from various years have been gleaned through the use of remote sensing. In the meantime, image differencing is taking place.  The method will be applied to the photographs using the image-enhancing process. In the next step, Principal Component Analysis (PCA) is employed.  The difference image will be analyzed using Component Analysis. To find out which locations have and which do not. As a first step, a procedure must be in place.  We've finished registering images one after the other. Consequently, the photos are being linked together. After then, it's back to black and white.  Three non-overlapping portions of the difference image have been created. This can be done using the principal component analysis method.  From the eigenvector space, we may get to the fundamental components. As a last point, the major feature vector space fuzzy C-Means Clustering is used to divide the component into two clusters, and then a change detection technique is carried out. As the world's population grew, farmland expansion and unplanned land encroachment intensified, resulting in uncontrolled deforestation around the globe. This project uses unsupervised learning algorithm K-means clustering. In a cost-effective manner that can be employed by officials, companies as well as private groups, to assist in fighting illicit deforestation and analysis of satellite database

International Journal on Recent and Innovation Trends in Computing and Communication

Discovering Class-Specific Pixels for Weakly-Supervised Semantic Segmentation

Author: Chaudhry Arslan
Dokania Puneet K.
Torr Philip H. S.
Publication venue
Publication date: 01/01/2017
Field of study

We propose an approach to discover class-specific pixels for the weakly-supervised semantic segmentation task. We show that properly combining saliency and attention maps allows us to obtain reliable cues capable of significantly boosting the performance. First, we propose a simple yet powerful hierarchical approach to discover the class-agnostic salient regions, obtained using a salient object detector, which otherwise would be ignored. Second, we use fully convolutional attention maps to reliably localize the class-specific regions in a given image. We combine these two cues to discover class-specific pixels which are then used as an approximate ground truth for training a CNN. While solving the weakly supervised semantic segmentation task, we ensure that the image-level classification task is also solved in order to enforce the CNN to assign at least one pixel to each object present in the image. Experimentally, on the PASCAL VOC12 val and test sets, we obtain the mIoU of 60.8% and 61.9%, achieving the performance gains of 5.1% and 5.2% compared to the published state-of-the-art results. The code is made publicly available

arXiv.org e-Print Archive

Crossref

Oxford University Research Archive