295 research outputs found
Deep learning in remote sensing: a review
Standing at the paradigm shift towards data-intensive science, machine
learning techniques are becoming increasingly important. In particular, as a
major breakthrough in the field, deep learning has proven as an extremely
powerful tool in many fields. Shall we embrace deep learning as the key to all?
Or, should we resist a 'black-box' solution? There are controversial opinions
in the remote sensing community. In this article, we analyze the challenges of
using deep learning for remote sensing data analysis, review the recent
advances, and provide resources to make deep learning in remote sensing
ridiculously simple to start with. More importantly, we advocate remote sensing
scientists to bring their expertise into deep learning, and use it as an
implicit general model to tackle unprecedented large-scale influential
challenges, such as climate change and urbanization.Comment: Accepted for publication IEEE Geoscience and Remote Sensing Magazin
Learning deep structured active contours end-to-end
The world is covered with millions of buildings, and precisely knowing each
instance's position and extents is vital to a multitude of applications.
Recently, automated building footprint segmentation models have shown superior
detection accuracy thanks to the usage of Convolutional Neural Networks (CNN).
However, even the latest evolutions struggle to precisely delineating borders,
which often leads to geometric distortions and inadvertent fusion of adjacent
building instances. We propose to overcome this issue by exploiting the
distinct geometric properties of buildings. To this end, we present Deep
Structured Active Contours (DSAC), a novel framework that integrates priors and
constraints into the segmentation process, such as continuous boundaries,
smooth edges, and sharp corners. To do so, DSAC employs Active Contour Models
(ACM), a family of constraint- and prior-based polygonal models. We learn ACM
parameterizations per instance using a CNN, and show how to incorporate all
components in a structured output model, making DSAC trainable end-to-end. We
evaluate DSAC on three challenging building instance segmentation datasets,
where it compares favorably against state-of-the-art. Code will be made
available.Comment: To appear, CVPR 201
Find your Way by Observing the Sun and Other Semantic Cues
In this paper we present a robust, efficient and affordable approach to
self-localization which does not require neither GPS nor knowledge about the
appearance of the world. Towards this goal, we utilize freely available
cartographic maps and derive a probabilistic model that exploits semantic cues
in the form of sun direction, presence of an intersection, road type, speed
limit as well as the ego-car trajectory in order to produce very reliable
localization results. Our experimental evaluation shows that our approach can
localize much faster (in terms of driving time) with less computation and more
robustly than competing approaches, which ignore semantic information
SRDA-Net: Super-Resolution Domain Adaptation Networks for Semantic Segmentation
Recently, Unsupervised Domain Adaptation was proposed to address the domain
shift problem in semantic segmentation task, but it may perform poor when
source and target domains belong to different resolutions. In this work, we
design a novel end-to-end semantic segmentation network, Super-Resolution
Domain Adaptation Network (SRDA-Net), which could simultaneously complete
super-resolution and domain adaptation. Such characteristics exactly meet the
requirement of semantic segmentation for remote sensing images which usually
involve various resolutions. Generally, SRDA-Net includes three deep neural
networks: a Super-Resolution and Segmentation (SRS) model focuses on recovering
high-resolution image and predicting segmentation map; a pixel-level domain
classifier (PDC) tries to distinguish the images from which domains; and
output-space domain classifier (ODC) discriminates pixel label distributions
from which domains. PDC and ODC are considered as the discriminators, and SRS
is treated as the generator. By the adversarial learning, SRS tries to align
the source with target domains on pixel-level visual appearance and
output-space. Experiments are conducted on the two remote sensing datasets with
different resolutions. SRDA-Net performs favorably against the state-of-the-art
methods in terms of accuracy and visual quality. Code and models are available
at https://github.com/tangzhenjie/SRDA-Net
- …