4 research outputs found

    Semi-supervised and unsupervised kernel-based novelty detection with application to remote sensing images

    Get PDF
    The main challenge of new information technologies is to retrieve intelligible information from the large volume of digital data gathered every day. Among the variety of existing data sources, the satellites continuously observing the surface of the Earth are key to the monitoring of our environment. The new generation of satellite sensors are tremendously increasing the possibilities of applications but also increasing the need for efficient processing methodologies in order to extract information relevant to the users' needs in an automatic or semi-automatic way. This is where machine learning comes into play to transform complex data into simplified products such as maps of land-cover changes or classes by learning from data examples annotated by experts. These annotations, also called labels, may actually be difficult or costly to obtain since they are established on the basis of ground surveys. As an example, it is extremely difficult to access a region recently flooded or affected by wildfires. In these situations, the detection of changes has to be done with only annotations from unaffected regions. In a similar way, it is difficult to have information on all the land-cover classes present in an image while being interested in the detection of a single one of interest. These challenging situations are called novelty detection or one-class classification in machine learning. In these situations, the learning phase has to rely only on a very limited set of annotations, but can exploit the large set of unlabeled pixels available in the images. This setting, called semi-supervised learning, allows significantly improving the detection. In this Thesis we address the development of methods for novelty detection and one-class classification with few or no labeled information. The proposed methodologies build upon the kernel methods, which take place within a principled but flexible framework for learning with data showing potentially non-linear feature relations. The thesis is divided into two parts, each one having a different assumption on the data structure and both addressing unsupervised (automatic) and semi-supervised (semi-automatic) learning settings. The first part assumes the data to be formed by arbitrary-shaped and overlapping clusters and studies the use of kernel machines, such as Support Vector Machines or Gaussian Processes. An emphasis is put on the robustness to noise and outliers and on the automatic retrieval of parameters. Experiments on multi-temporal multispectral images for change detection are carried out using only information from unchanged regions or none at all. The second part assumes high-dimensional data to lie on multiple low dimensional structures, called manifolds. We propose a method seeking a sparse and low-rank representation of the data mapped in a non-linear feature space. This representation allows us to build a graph, which is cut into several groups using spectral clustering. For the semi-supervised case where few labels of one class of interest are available, we study several approaches incorporating the graph information. The class labels can either be propagated on the graph, constrain spectral clustering or used to train a one-class classifier regularized by the given graph. Experiments on the unsupervised and oneclass classification of hyperspectral images demonstrate the effectiveness of the proposed approaches

    The Vezo communities and fisheries of the coral reef ecosystem in the Bay of Ranobe, Madagascar

    Get PDF
    Madagascar, a country whose extraordinary levels of endemism and biodiversity are celebrated globally by scientists and laymen alike, yet historically has received surprisingly little research attention, is the setting of the present dissertation. Here, I contribute to the need for applied research by: 1) focusing on the most intensely fished section of the Toliara Barrier Reef, the Bay of Ranobe; 2) characterizing the marine environment, the human population, and the fisheries; and 3) collecting the longest known time-series of data on fisheries of Madagascar, thereby providing a useful baseline for future analyses. In Chapter 1, the bathymetry of the Bay was characterized following a unique application of the boosted regression tree classifier to the RGB bands of IKONOS imagery. Derivation of water depths, based on DOS-corrected images, following a generic, log-transformed multiple linear regression approach produced a predictive accuracy of 1.28 m, whereas model fitting performed using the boosted regression tree classifier, allowing for interaction effects (tree complexity= 2), provided increased accuracy (RMSE= 1.01 m). Estimates of human population abundance, distribution, and dynamics were obtained following a dwelling-unit enumeration approach, using IKONOS Panchromatic and Google Earth images. Results indicated, in 2016, 31,850 people lived within 1 km of the shore, and 28,046 people lived within the 12 coastal villages of the Bay. Localized population growth rates within the villages, where birth rates and migration are combined, ranged from 2.96% - 6.83%, greatly exceeding official estimates of 2.78%. Annual pirogue counts demonstrated a shift in fishing effort from south to the north. Gear and boat (pirogue) profiles were developed, and the theoretical maximum number of fishermen predicted (n= 4,820), in 2013, from a regression model based on pirogue lengths (R2= 0.49). Spatial fishing effort distribution was mapped following a satellite-based enumeration of fishers-at-sea, resulting in a bay-wide estimate of intensity equaling 33.3 pirogue-meters km-2. Landings and CPUE were characterized, with respect to finfish, by family, species, gear, and village. Expansion of landings to bay-wide fisheries yields indicated 1,885.8 mt year-1 of mixed fisheries productivity, with an estimated wholesale value of 1.64 million USD per annum

    Untangling hotel industry’s inefficiency: An SFA approach applied to a renowned Portuguese hotel chain

    Get PDF
    The present paper explores the technical efficiency of four hotels from Teixeira Duarte Group - a renowned Portuguese hotel chain. An efficiency ranking is established from these four hotel units located in Portugal using Stochastic Frontier Analysis. This methodology allows to discriminate between measurement error and systematic inefficiencies in the estimation process enabling to investigate the main inefficiency causes. Several suggestions concerning efficiency improvement are undertaken for each hotel studied.info:eu-repo/semantics/publishedVersio
    corecore