Search CORE

491 research outputs found

A Comprehensive Survey of Deep Learning in Remote Sensing: Theories, Tools and Challenges for the Community

Author: Ball John E.
Anderson Derek T.
Chan Chee Seng
Publication venue
Publication date: 01/01/2017
Field of study

In recent years, deep learning (DL), a re-branding of neural networks (NNs), has risen to the top in numerous areas, namely computer vision (CV), speech recognition, natural language processing, etc. Whereas remote sensing (RS) possesses a number of unique challenges, primarily related to sensors and applications, inevitably RS draws from many of the same theories as CV; e.g., statistics, fusion, and machine learning, to name a few. This means that the RS community should be aware of, if not at the leading edge of, of advancements like DL. Herein, we provide the most comprehensive survey of state-of-the-art RS DL research. We also review recent new developments in the DL field that can be used in DL for RS. Namely, we focus on theories, tools and challenges for the RS community. Specifically, we focus on unsolved challenges and opportunities as it relates to (i) inadequate data sets, (ii) human-understandable solutions for modelling physical phenomena, (iii) Big Data, (iv) non-traditional heterogeneous data sources, (v) DL architectures and learning algorithms for spectral, spatial and temporal data, (vi) transfer learning, (vii) an improved theoretical understanding of DL systems, (viii) high barriers to entry, and (ix) training and optimizing the DL.Comment: 64 pages, 411 references. To appear in Journal of Applied Remote Sensin

arXiv.org e-Print Archive

FigShare

Learnable Reconstruction Methods from RGB Images to Hyperspectral Imaging: A Survey

Author: Felix Heide
Jingang Zhang
Qiang Fu
Runmu Su
Wenqi Ren
Yunfeng Nie
Publication venue
Publication date: 30/06/2021
Field of study

Hyperspectral imaging enables versatile applications due to its competence in capturing abundant spatial and spectral information, which are crucial for identifying substances. However, the devices for acquiring hyperspectral images are expensive and complicated. Therefore, many alternative spectral imaging methods have been proposed by directly reconstructing the hyperspectral information from lower-cost, more available RGB images. We present a thorough investigation of these state-of-the-art spectral reconstruction methods from the widespread RGB images. A systematic study and comparison of more than 25 methods has revealed that most of the data-driven deep learning methods are superior to prior-based methods in terms of reconstruction accuracy and quality despite lower speeds. This comprehensive review can serve as a fruitful reference source for peer researchers, thus further inspiring future development directions in related domains

arXiv.org e-Print Archive

Directory of Open Access Journals

Multisource and Multitemporal Data Fusion in Remote Sensing

Author: Ghamisi Pedram
Rasti Behnood
Yokoya Naoto
Wang Qunming
Hofle Bernhard
Bruzzone Lorenzo
Bovolo Francesca
Chi Mingmin
Anders Katharina
Gloaguen Richard
Atkinson Peter M.
Benediktsson Jon Atli
Publication venue
Publication date: 19/12/2018
Field of study

The sharp and recent increase in the availability of data captured by different sensors combined with their considerably heterogeneous natures poses a serious challenge for the effective and efficient processing of remotely sensed data. Such an increase in remote sensing and ancillary datasets, however, opens up the possibility of utilizing multimodal datasets in a joint manner to further improve the performance of the processing approaches with respect to the application at hand. Multisource data fusion has, therefore, received enormous attention from researchers worldwide for a wide variety of applications. Moreover, thanks to the revisit capability of several spaceborne sensors, the integration of the temporal information with the spatial and/or spectral/backscattering information of the remotely sensed data is possible and helps to move from a representation of 2D/3D data to 4D data structures, where the time variable adds new information as well as challenges for the information extraction algorithms. There are a huge number of research works dedicated to multisource and multitemporal data fusion, but the methods for the fusion of different modalities have expanded in different paths according to each research community. This paper brings together the advances of multisource and multitemporal data fusion approaches with respect to different research communities and provides a thorough and discipline-specific starting point for researchers at different levels (i.e., students, researchers, and senior researchers) willing to conduct novel investigations on this challenging topic by supplying sufficient detail and references

arXiv.org e-Print Archive

Leveraging Computer Vision for Applications in Biomedicine and Geoscience

Author: Johansen Thomas Haugland
Publication venue: 'Wiley'
Publication date: 01/01/2021
Field of study

Skin cancer is one of the most common types of cancer and is usually classified as either non-melanoma and melanoma skin cancer. Melanoma skin cancer accounts for about half of all skin cancer-related deaths. The 5-year survival rate is 99% when the cancer is detected early but drops to 25% once it becomes metastatic. In other words, the key to preventing death is early detection. Foraminifera are microscopic single-celled organisms that exist in marine environments and are classified as living a benthic or planktic lifestyle. In total, roughly 50,000 species are known to have existed, of which about 9,000 are still living today. Foraminifera are important proxies for reconstructing past ocean and climate conditions and as bio-indicators of anthropogenic pollution. Since the 1800s, the identification and counting of foraminifera have been performed manually. The process is resource-intensive. In this dissertation, we leverage recent advances in computer vision, driven by breakthroughs in deep learning methodologies and scale-space theory, to make progress towards both early detection of melanoma skin cancer and automation of the identification and counting of microscopic foraminifera. First, we investigate the use of hyperspectral images in skin cancer detection by performing a critical review of relevant, peer-reviewed research. Second, we present a novel scale-space methodology for detecting changes in hyperspectral images. Third, we develop a deep learning model for classifying microscopic foraminifera. Finally, we present a deep learning model for instance segmentation of microscopic foraminifera. The works presented in this dissertation are valuable contributions in the fields of biomedicine and geoscience, more specifically, towards the challenges of early detection of melanoma skin cancer and automation of the identification, counting, and picking of microscopic foraminifera

Performance Enhancement of Hyperspectral Semantic Segmentation Leveraging Ensemble Networks

Author: Soucy Nicholas
Publication venue: DigitalCommons@UMaine
Publication date: 01/12/2022
Field of study

Hyperspectral image (HSI) semantic segmentation is a growing field within computer vision, machine learning, and forestry. Due to the separate nature of these communities, research applying deep learning techniques to ground-type semantic segmentation needs improvement, along with working to bring the research and expectations of these three communities together. Semantic segmentation consists of classifying individual pixels within the image based on the features present. Many issues need to be resolved in HSI semantic segmentation including data preprocessing, feature reduction, semantic segmentation techniques, and adversarial training. In this thesis, we tackle these challenges by employing ensemble methods for HSI semantic segmentation. Deep neural networks (DNNs) for classification tasks have been employed in HSI semantic segmentation with great success. The ensemble method in traditional classification is often used to increase performance, but research into applying it to semantic segmentation in HSIs is relatively new. Instead of using a single network approach to classification, the ensemble method employs multiple networks to improve performance. Research into ensemble methods in HSI has seen increased accuracy, but often has higher computational complexity and relies on expensive preprocessing techniques. To showcase the performance increase the ensemble method has on semantic segmentation, we propose the novel flagship model Clustering Ensemble U-Net (CEU-Net). In CEU-Net we (1) use a bagging ensemble technique to reduce the computational complexity, (2) utilize clustering on class labels as an intelligent method of delineating which data goes to each network, thereby making each sub-network an expert on a particular cluster, and (3) implement with or without patching for better data flexibility. It is shown that CEU-Net outperforms existing hyperspectral semantic segmentation methods, achieving better performance with and without patching compared to baseline models. Semantic segmentation models are vulnerable to adversarial attacks and need adversarial training to counteract them. Adversarial attacks are often intelligent attacks that use the knowledge of a trained classifier to create imperceptible perturbations to hurt classification accuracy. Traditional approaches to adversarial robustness focus on training or retraining a single network on attacked data, however, in the presence of multiple attacks these approaches decrease the performance compared to networks trained individually on each attack. To combat adversarial attacks in HSI semantic segmentation, we propose the Adversarial Discriminator Ensemble Network (ADE-Net) which focuses on attack type detection and adversarial robustness under a unified model to preserve per data-type weight optimally while making the overall network robust. In the proposed method, a discriminator network is used to separate data by attack type into their specific attack-expert ensemble sub-network. The ensemble and discriminator networks are trained together using a unified novel loss function to share information between each network. Our approach allows for the presence of multiple attacks mixed together while also labeling attack types during testing. In this thesis, we experimentally show that ADE-Net outperforms the baseline, which is a single network adversarially trained under a mix of multiple attacks, for popular HSI datasets

University of Maine