528 research outputs found
Broadband hyperspectral imaging for breast tumor detection using spectral and spatial information
Complete tumor removal during breast-conserving surgery remains challenging due to the lack of optimal intraoperative margin assessment techniques. Here, we use hyperspectral imaging for tumor detection in fresh breast tissue. We evaluated different wavelength ranges and two classification algorithms; a pixel-wise classification algorithm and a convolutional neural network that combines spectral and spatial information. The highest classification performance was obtained using the full wavelength range (450-1650nm). Adding spatial information mainly improved the differentiation of tissue classes within the malignant and healthy classes. High sensitivity and specificity were accomplished, which offers potential for hyperspectral imaging as a margin assessment technique to improve surgical outcome. (C) 2019 Optical Society of America under the terms of the OSA Open Access Publishing Agreemen
Exploring synergetic effects of dimensionality reduction and resampling tools on hyperspectral imagery data classification
The present paper addresses the problem of the classification of hyperspectral images with multiple imbalanced classes and very high dimensionality. Class imbalance is handled by resampling the data set, whereas PCA and a supervised filter are applied to reduce the number of spectral bands. This is a preliminary study that pursues to investigate the benefits of combining several techniques to tackle the imbalance and the high dimensionality problems, and also to evaluate the order of application that leads to the best classification performance. Experimental results demonstrate the significance of using together these two preprocessing tools to improve the performance of hyperspectral imagery classification. Although it seems that the most effective order corresponds to first a resampling strategy and then a feature (or extraction) selection algorithm, this is a question that still needs a much more thorough investigation in the futureThis work has partially been supported by the Spanish Ministry of Education and Science under grants CSD2007–00018, AYA2008–05965–0596 and TIN2009–14205, the Fundació Caixa Castelló–Bancaixa under grant P1–1B2009–04, and the Generalitat Valenciana under grant PROMETEO/2010/02
Exploring issues of balanced versus imbalanced samples in mapping grass community in the telperion reserve using high resolution images and selected machine learning algorithms
ABSTRACT
Accurate vegetation mapping is essential for a number of reasons, one of which is for conservation purposes. The main objective of this research was to map different grass communities in the game reserve using RapidEye and Sentinel-2 MSI images and machine learning classifiers [support vector machine (SVM) and Random forest (RF)] to test the impacts of balanced and imbalance training data on the performance and the accuracy of Support Vector Machine and Random forest in mapping the grass communities and test the sensitivities of pixel resolution to balanced and imbalance training data in image classification. The imbalanced and balanced data sets were obtained through field data collection.
The results show RF and SVM are producing a high overall accuracy for Sentinel-2 imagery for both the balanced and imbalanced data set. The RF classifier has yielded an overall accuracy of 79.45% and kappa of 74.38% and an overall accuracy of 76.19% and kappa of 73.21% using imbalanced and balanced training data respectively. The SVM classifier yielded an overall accuracy of 82.54% and kappa of 80.36% and an overall accuracy of 82.21% and a kappa of 78.33% using imbalanced and balanced training data respectively.
For the RapidEye imagery, RF and SVM algorithm produced overall accuracy affected by a balanced data set leading to reduced accuracy. The RF algorithm had an overall accuracy that dropped by 6% (from 63.24% to 57.94%) while the SVM dropped by 7% (from 57.31% to 50.79%). The results thereby show that the imbalanced data set is a better option when looking at the image classification of vegetation species than the balanced data set.
The study recommends the implementation of ways of handling misclassification among the different grass species to improve classification for future research. Further research can be carried out on other types of high resolution multispectral imagery using different advanced algorithms on different training size samples.EM201
Hyperspectral Image Analysis with Subspace Learning-based One-Class Classification
Hyperspectral image (HSI) classification is an important task in many
applications, such as environmental monitoring, medical imaging, and land
use/land cover (LULC) classification. Due to the significant amount of spectral
information from recent HSI sensors, analyzing the acquired images is
challenging using traditional Machine Learning (ML) methods. As the number of
frequency bands increases, the required number of training samples increases
exponentially to achieve a reasonable classification accuracy, also known as
the curse of dimensionality. Therefore, separate band selection or
dimensionality reduction techniques are often applied before performing any
classification task over HSI data. In this study, we investigate recently
proposed subspace learning methods for one-class classification (OCC). These
methods map high-dimensional data to a lower-dimensional feature space that is
optimized for one-class classification. In this way, there is no separate
dimensionality reduction or feature selection procedure needed in the proposed
classification framework. Moreover, one-class classifiers have the ability to
learn a data description from the category of a single class only. Considering
the imbalanced labels of the LULC classification problem and rich spectral
information (high number of dimensions), the proposed classification approach
is well-suited for HSI data. Overall, this is a pioneer study focusing on
subspace learning-based one-class classification for HSI data. We analyze the
performance of the proposed subspace learning one-class classifiers in the
proposed pipeline. Our experiments validate that the proposed approach helps
tackle the curse of dimensionality along with the imbalanced nature of HSI
data
Using LUCAS survey and Recurrent Neural Networks to produce LCLU classification based on a Satellite Image time series of Sentinel-2
Dissertation presented as the partial requirement for obtaining a Master's degree in Information Management, specialization in Knowledge Management and Business IntelligenceThe need of timely and accurate information for the territory has increased over the years, making
Land Cover Land Use (LCLU) mapping one of the most common application of remote sensing.
Recently, the advances in satellite technology and the open access policies for remote sensing data
increased the interest in exploring satellite image time series. In addition, the attention of
researchers has shifted from standard machine learning algorithms (e.g., Support Vector Machines
and Random Forest) to Recurrent Neural Networks due to their ability of exploiting sequential
information. However, acquiring reference data to train these algorithms is still a hurdle. This study
aims to evaluate the capability of a Gated Recurrent Unit in performing pixel-level LCLU classification
of a satellite image time series, using Sentinel-2 imagery and having the LUCAS survey as reference
data. To assess the performance of our model we compared it to state-of-the-art classifiers (SVM and
RF). Due to the unbalance nature of the LUCAS survey, we applied oversampling to this dataset to
increase the performance of our models, testing three different oversampling techniques. The results
attained showed that Recurrent Neural Networks did not outperform the other state-of-the-art
algorithms, when trained with a limited number of sampling units, and that oversampling the LUCAS
survey increased the performance of all the classifiers. Finally, we were able to demonstrate that it is
possible to produce LCLU classification of satellite image time series using only open-source data by
using Sentinel-2 imagery and the LUCAS survey as refence data
Machine Learning for Classifying Marine Vegetation from Hyperspectral Drone Data in the Norwegian coast
Along the Norwegian coasts the presence of blue forests are the key marine habitats. Due to increased anthropogenic activity and climate change, the health and extent of the blue forests is threatened. However, no low-cost, reliable system for monitoring blue forests exists in Norway at this time. This thesis studied machine learning methods to classify marine vegetation from hyperspectral data acquired in Norway. The study area is situated by Larvik at Ølbergholmen. The dataset consists of 12 hyperspectral images with 173 spectral bands in the region 390 nm - 749 nm and corresponding labels of the different classes. This dataset was used to train and evaluate the machine learning methods. In addition, an independent dataset from a different site was used for robustness evaluation. Three machine learning methods were studied; Random Forest (RF), Support Vector Machines (SVM) and Convolutional Neural Network (CNN). The results indicate that the powerful CNN approach had the best performance during validation based on the computed statistical measures. However, when evaluated for robustness, RF performed the best. The computed confusion matrices for the validation and robustness studies revealed that the presence of a so-called turf algae caused difficulties in distinguishing between the classes, which is an important finding with regard to future research. This thesis has shown that machine learning can be used for monitoring blue forests and various marine vegetation species using hyperspectral drone imaging along the Norwegian coast
- …