7 research outputs found

    Automated Remote Sensing Image Interpretation with Limited Labeled Training Data

    Get PDF
    Automated remote sensing image interpretation has been investigated for more than a decade. In early years, most work was based on the assumption that there are sufficient labeled samples to be used for training. However, ground-truth collection is a very tedious and time-consuming task and sometimes very expensive, especially in the field of remote sensing that usually relies on field surveys to collect ground truth. In recent years, as the development of advanced machine learning techniques, remote sensing image interpretation with limited ground-truth has caught the attention of researchers in the fields of both remote sensing and computer science. Three approaches that focus on different aspects of the interpretation process, i.e., feature extraction, classification, and segmentation, are proposed to deal with the limited ground truth problem. First, feature extraction techniques, which usually serve as a pre-processing step for remote sensing image classification are explored. Instead of only focusing on feature extraction, a joint feature extraction and classification framework is proposed based on ensemble local manifold learning. Second, classifiers in the case of limited labeled training data are investigated, and an enhanced ensemble learning method that outperforms state-of-the-art classification methods is proposed. Third, image segmentation techniques are investigated, with the aid of unlabeled samples and spatial information. A semi-supervised self-training method is proposed, which is capable of expanding the number of training samples by its own and hence improving classification performance iteratively. Experiments show that the proposed approaches outperform state-of-the-art techniques in terms of classification accuracy on benchmark remote sensing datasets.4 month

    Uncertainty quantification in graph-based classification of high dimensional data

    Get PDF
    Classification of high dimensional data finds wide-ranging applications. In many of these applications equipping the resulting classification with a measure of uncertainty may be as important as the classification itself. In this paper we introduce, develop algorithms for, and investigate the properties of, a variety of Bayesian models for the task of binary classification; via the posterior distribution on the classification labels, these methods automatically give measures of uncertainty. The methods are all based around the graph formulation of semi-supervised learning. We provide a unified framework which brings together a variety of methods which have been introduced in different communities within the mathematical sciences. We study probit classification in the graph-based setting, generalize the level-set method for Bayesian inverse problems to the classification setting, and generalize the Ginzburg-Landau optimization-based classifier to a Bayesian setting; we also show that the probit and level set approaches are natural relaxations of the harmonic function approach introduced in [Zhu et al 2003]. We introduce efficient numerical methods, suited to large data-sets, for both MCMC-based sampling as well as gradient-based MAP estimation. Through numerical experiments we study classification accuracy and uncertainty quantification for our models; these experiments showcase a suite of datasets commonly used to evaluate graph-based semi-supervised learning algorithms.Comment: 33 pages, 14 figure

    The remote sensing of papyrus vegetation (Cyperus papyrus L.) in swamp wetlands of South Africa.

    Get PDF
    Thesis (Ph.D.)-University of KwaZulu-Natal, Pietermaritzburg, 2010.Papyrus (Cyperus papyrus .L) swamp is the most species rich habitat that play vital hydrological, ecological, and economic roles in central tropical and western African wetlands. However, the existence of papyrus vegetation is endangered due to intensification of agricultural use and human encroachment. Techniques for modelling the distribution of papyrus swamps, quantity and quality are therefore critical for the rapid assessment and proactive management of papyrus vegetation. In this regard, remote sensing techniques provide rapid, potentially cheap, and relatively accurate strategies to accomplish this task. This study advocates the development of techniques based on hyperspectral remote sensing technology to accurately map and predict biomass of papyrus vegetation in a high mixed species environment of St Lucia- South Africa which has been overlooked in scientific research. Our approach was to investigate the potential of hyperspectral remote sensing at two levels of investigation: field level and airborne platform level. First, the study provides an overview of the current use of both multispectral and hyperspectral remote sensing techniques in mapping the quantity and the quality of wetland vegetation as well as the challenges and the need for further research. Second, the study explores whether papyrus can be discriminated from each one of its coexistence species (binary class). Our results showed that, at full canopy cover, papyrus vegetation can be accurately discriminated from its entire co-existing species using a new hierarchical method based on three integrated analysis levels and field spectrometry under natural field conditions. These positive results prompted the need to test the use of canopy hyperspectral data resampled to HYMAP resolution and two machine learning algorithms in identifying key spectral bands that allowed for better discrimination among papyrus and other co-existing species (n = 3) (multi-class classification). Results showed that the random forest algorithm (RF) simplified the process by identifying the minimum number of spectral bands that provided the best overall accuracies. Narrow band NDVI and SR-based vegetation indices calculated from hyperspectral data as well as some vegetation indices published in literature were investigated to test their potential in improving the classification accuracy of wetland plant species. The study also evaluated the robustness and reliability of RF as a variables selection method and as a classification algorithm in identifying key spectral bands that allowed for the successful classification of wetland species. Third, the focus was to upscale the results of field spectroscopy analysis to airborne hyperspectral sensor (AISA eagle) to discriminate papyrus and it co-existing species. The results indicated that specific wavelengths located in the visible, red-edge, and near-infrared region of the electromagnetic spectrum have the highest potential of discriminating papyrus from the other species. Finally, the study explored the ability of narrow NDVI-based vegetation indices calculated from hyperspectral data in predicting the green above ground biomass of papyrus. The results demonstrated that papyrus biomass can be modelled with relatively low error of estimates using a non-linear RF regression algorithm. This provided a basis for the algorithm to be used in mapping wetland biomass in highly complex environments. Overall, the study has demonstrated the potential of remote sensing techniques in discriminating papyrus swamps and its co-existing species as well as in predicting biomass. Compared to previous studies, the RF model applied in this study has proved to be a robust, accurate, and simple new method for variables selection, classification, and modelling of hyperspectral data. The results are important for establishing a baseline of the species distributions in South African swamp wetlands for future monitoring and control efforts

    Remote sensing of the distribution and quality of subtropical C3 and C4 grasses.

    Get PDF
    Thesis (Ph.D.)-University of KwaZulu-Natal, Durban, 2013.Global climate change is expected to be accompanied by changes in the composition of plant functional types. Such changes are predicted to follow shifts in the percentage cover and abundance of grass species, following the C3 and C4 photosynthetic pathways. These two groups differ in a number of physiological, structural and biochemical aspects. It is important to measure these characteristic properties because they affect ecosystem processes, such as nutrient cycling. High spectral and spatial resolution remote sensing systems have been proven to offer data, which can be used to accurately detect, classify and map plant species. The major challenge, however, is that the spectral reflectance data obtained over many narrow contiguous channels (i.e. hyperspectral data) represent multiple classes that are often mixed for a limited training-sample size. This is commonly referred to as the Hughes phenomenon or “the curse of dimensionality”. In the context of hyperspectral data analysis, the Hughes phenomenon often introduces a high degree of multicollinearity, which is caused by the use of highly-correlated spectral predictors. Multicollinearity is a prominent problem in processing hyperspectral data for vegetation applications, due to similarities in the spectral reflectance properties of biophysical and biochemical attributes. This study explored an innovative method to solve the problems associated with spectral dimensionality and the related multicollinearity, by developing a user-defined inter-band correlation filter function to resample hyperspectral data. The proposed resampling technique convolves the spectral dependence information between a chosen band-centre and its shorter and longer wavelength neighbours. The utility of the new resampling technique was assessed for discriminating C3 (Festuca costata) and C4 (Themeda triandra and Rendlia altera) grasses and for predicting their nutrient content (nitrogen, protein, moisture, and fibre), using partial least squares and random forest regressions. In general, results obtained showed that the user-defined inter-band correlation filter technique can mitigate the problem of multicollinearity in both classification and regression analyses. Wavebands in the shortwave infrared region were found to be very important in regression and classification analyses, using field spectra-only datasets. Next, the analyses were up-scaled from field spectra to the new generation multispectral satellite, WorldView-2 imagery, which was acquired for the Cathedral Peak region of the Drakensberg Mountains. The results obtained, showed that the WV2 image data contain useful information for classifying the C3 and C4 grasses and for predicting variability in their nitrogen and fibre concentrations. This study makes a contribution by developing a user-defined inter-band correlation filter to resample hyperspectral data, and thereby mitigating the high dimensionality and multicollinearity problems, in remote sensing applications involving C3 and C4 grass species or communities
    corecore