2,010 research outputs found
A Comprehensive Survey of Deep Learning in Remote Sensing: Theories, Tools and Challenges for the Community
In recent years, deep learning (DL), a re-branding of neural networks (NNs),
has risen to the top in numerous areas, namely computer vision (CV), speech
recognition, natural language processing, etc. Whereas remote sensing (RS)
possesses a number of unique challenges, primarily related to sensors and
applications, inevitably RS draws from many of the same theories as CV; e.g.,
statistics, fusion, and machine learning, to name a few. This means that the RS
community should be aware of, if not at the leading edge of, of advancements
like DL. Herein, we provide the most comprehensive survey of state-of-the-art
RS DL research. We also review recent new developments in the DL field that can
be used in DL for RS. Namely, we focus on theories, tools and challenges for
the RS community. Specifically, we focus on unsolved challenges and
opportunities as it relates to (i) inadequate data sets, (ii)
human-understandable solutions for modelling physical phenomena, (iii) Big
Data, (iv) non-traditional heterogeneous data sources, (v) DL architectures and
learning algorithms for spectral, spatial and temporal data, (vi) transfer
learning, (vii) an improved theoretical understanding of DL systems, (viii)
high barriers to entry, and (ix) training and optimizing the DL.Comment: 64 pages, 411 references. To appear in Journal of Applied Remote
Sensin
Deep Learning Meets Hyperspectral Image Analysis: A Multidisciplinary Review
Modern hyperspectral imaging systems produce huge datasets potentially conveying a great abundance of information; such a resource, however, poses many challenges in the analysis and interpretation of these data. Deep learning approaches certainly offer a great variety of opportunities for solving classical imaging tasks and also for approaching new stimulating problems in the spatial–spectral domain. This is fundamental in the driving sector of Remote Sensing where hyperspectral technology was born and has mostly developed, but it is perhaps even more true in the multitude of current and evolving application sectors that involve these imaging technologies. The present review develops on two fronts: on the one hand, it is aimed at domain professionals who want to have an updated overview on how hyperspectral acquisition techniques can combine with deep learning architectures to solve specific tasks in different application fields. On the other hand, we want to target the machine learning and computer vision experts by giving them a picture of how deep learning technologies are applied to hyperspectral data from a multidisciplinary perspective. The presence of these two viewpoints and the inclusion of application fields other than Remote Sensing are the original contributions of this review, which also highlights some potentialities and critical issues related to the observed development trends
Multi-level Feature Fusion-based CNN for Local Climate Zone Classification from Sentinel-2 Images: Benchmark Results on the So2Sat LCZ42 Dataset
As a unique classification scheme for urban forms and functions, the local
climate zone (LCZ) system provides essential general information for any
studies related to urban environments, especially on a large scale. Remote
sensing data-based classification approaches are the key to large-scale mapping
and monitoring of LCZs. The potential of deep learning-based approaches is not
yet fully explored, even though advanced convolutional neural networks (CNNs)
continue to push the frontiers for various computer vision tasks. One reason is
that published studies are based on different datasets, usually at a regional
scale, which makes it impossible to fairly and consistently compare the
potential of different CNNs for real-world scenarios. This study is based on
the big So2Sat LCZ42 benchmark dataset dedicated to LCZ classification. Using
this dataset, we studied a range of CNNs of varying sizes. In addition, we
proposed a CNN to classify LCZs from Sentinel-2 images, Sen2LCZ-Net. Using this
base network, we propose fusing multi-level features using the extended
Sen2LCZ-Net-MF. With this proposed simple network architecture and the highly
competitive benchmark dataset, we obtain results that are better than those
obtained by the state-of-the-art CNNs, while requiring less computation with
fewer layers and parameters. Large-scale LCZ classification examples of
completely unseen areas are presented, demonstrating the potential of our
proposed Sen2LCZ-Net-MF as well as the So2Sat LCZ42 dataset. We also
intensively investigated the influence of network depth and width and the
effectiveness of the design choices made for Sen2LCZ-Net-MF. Our work will
provide important baselines for future CNN-based algorithm developments for
both LCZ classification and other urban land cover land use classification
Classification of Arrhythmia by Using Deep Learning with 2-D ECG Spectral Image Representation
The electrocardiogram (ECG) is one of the most extensively employed signals
used in the diagnosis and prediction of cardiovascular diseases (CVDs). The ECG
signals can capture the heart's rhythmic irregularities, commonly known as
arrhythmias. A careful study of ECG signals is crucial for precise diagnoses of
patients' acute and chronic heart conditions. In this study, we propose a
two-dimensional (2-D) convolutional neural network (CNN) model for the
classification of ECG signals into eight classes; namely, normal beat,
premature ventricular contraction beat, paced beat, right bundle branch block
beat, left bundle branch block beat, atrial premature contraction beat,
ventricular flutter wave beat, and ventricular escape beat. The one-dimensional
ECG time series signals are transformed into 2-D spectrograms through
short-time Fourier transform. The 2-D CNN model consisting of four
convolutional layers and four pooling layers is designed for extracting robust
features from the input spectrograms. Our proposed methodology is evaluated on
a publicly available MIT-BIH arrhythmia dataset. We achieved a state-of-the-art
average classification accuracy of 99.11\%, which is better than those of
recently reported results in classifying similar types of arrhythmias. The
performance is significant in other indices as well, including sensitivity and
specificity, which indicates the success of the proposed method.Comment: 14 pages, 5 figures, accepted for future publication in Remote
Sensing MDPI Journa
Fair comparison of skin detection approaches on publicly available datasets
Skin detection is the process of discriminating skin and non-skin regions in
a digital image and it is widely used in several applications ranging from hand
gesture analysis to track body parts and face detection. Skin detection is a
challenging problem which has drawn extensive attention from the research
community, nevertheless a fair comparison among approaches is very difficult
due to the lack of a common benchmark and a unified testing protocol. In this
work, we investigate the most recent researches in this field and we propose a
fair comparison among approaches using several different datasets. The major
contributions of this work are an exhaustive literature review of skin color
detection approaches, a framework to evaluate and combine different skin
detector approaches, whose source code is made freely available for future
research, and an extensive experimental comparison among several recent methods
which have also been used to define an ensemble that works well in many
different problems. Experiments are carried out in 10 different datasets
including more than 10000 labelled images: experimental results confirm that
the best method here proposed obtains a very good performance with respect to
other stand-alone approaches, without requiring ad hoc parameter tuning. A
MATLAB version of the framework for testing and of the methods proposed in this
paper will be freely available from https://github.com/LorisNann
Texture features based microscopic image classification of liver cellular granuloma using artificial neural networks
Automated classification of Schistosoma mansoni granulomatous microscopic images of mice liver using Artificial Intelligence (AI) technologies is a key issue for accurate diagnosis and treatment. In this paper, three grey difference statistics-based features, namely three Gray-Level Co-occurrence Matrix (GLCM) based features and fifteen Gray Gradient Co-occurrence Matrix (GGCM) features were calculated by correlative analysis. Ten features were selected for three-level cellular granuloma classification using a Scaled Conjugate Gradient Back-Propagation Neural Network (SCG-BPNN) in the same performance. A cross-entropy is then calculated to evaluate the proposed Sigmoid input and the ten-hidden layer network. The results depicted that SCG-BPNN with texture features performs high recognition rate compared to using morphological features, such as shape, size, contour, thickness and other geometry-based features for the classification. The proposed method also has a high accuracy rate of 87.2% compared to the Back-Propagation Neural Network (BPNN), Back-Propagation Hopfield Neural Network (BPHNN) and Convolutional Neural Network (CNN)
- …