2,958 research outputs found

    Hyper-Spectral Image Analysis with Partially-Latent Regression and Spatial Markov Dependencies

    Get PDF
    Hyper-spectral data can be analyzed to recover physical properties at large planetary scales. This involves resolving inverse problems which can be addressed within machine learning, with the advantage that, once a relationship between physical parameters and spectra has been established in a data-driven fashion, the learned relationship can be used to estimate physical parameters for new hyper-spectral observations. Within this framework, we propose a spatially-constrained and partially-latent regression method which maps high-dimensional inputs (hyper-spectral images) onto low-dimensional responses (physical parameters such as the local chemical composition of the soil). The proposed regression model comprises two key features. Firstly, it combines a Gaussian mixture of locally-linear mappings (GLLiM) with a partially-latent response model. While the former makes high-dimensional regression tractable, the latter enables to deal with physical parameters that cannot be observed or, more generally, with data contaminated by experimental artifacts that cannot be explained with noise models. Secondly, spatial constraints are introduced in the model through a Markov random field (MRF) prior which provides a spatial structure to the Gaussian-mixture hidden variables. Experiments conducted on a database composed of remotely sensed observations collected from the Mars planet by the Mars Express orbiter demonstrate the effectiveness of the proposed model.Comment: 12 pages, 4 figures, 3 table

    High-Dimensional Regression with Gaussian Mixtures and Partially-Latent Response Variables

    Get PDF
    In this work we address the problem of approximating high-dimensional data with a low-dimensional representation. We make the following contributions. We propose an inverse regression method which exchanges the roles of input and response, such that the low-dimensional variable becomes the regressor, and which is tractable. We introduce a mixture of locally-linear probabilistic mapping model that starts with estimating the parameters of inverse regression, and follows with inferring closed-form solutions for the forward parameters of the high-dimensional regression problem of interest. Moreover, we introduce a partially-latent paradigm, such that the vector-valued response variable is composed of both observed and latent entries, thus being able to deal with data contaminated by experimental artifacts that cannot be explained with noise models. The proposed probabilistic formulation could be viewed as a latent-variable augmentation of regression. We devise expectation-maximization (EM) procedures based on a data augmentation strategy which facilitates the maximum-likelihood search over the model parameters. We propose two augmentation schemes and we describe in detail the associated EM inference procedures that may well be viewed as generalizations of a number of EM regression, dimension reduction, and factor analysis algorithms. The proposed framework is validated with both synthetic and real data. We provide experimental evidence that our method outperforms several existing regression techniques

    Prediction Models for Estimation of Soil Moisture Content

    Get PDF
    This thesis introduces the implementation of different supervised learning techniques for producing accurate estimates of soil moisture content using empirical information, including meteorological and remotely sensed data. The models thus developed can be extended to be used by the personal remote sensing systems developed in the Center for Self-Organizing Intelligent Systems (CSOIS). The dfferent models employed extend over a wide range of machine-learning techniques starting from basic linear regression models through models based on Bayesian framework. Also, ensembling methods such as bagging and boosting are implemented on all models for considerable improvements in accuracy. The main research objective is to understand, compare, and analyze the mathematical backgrounds underlying and results obtained from dfferent models and the respective improvisation techniques employed

    Estimation of Mars surface physical properties from hyperspectral images using Sliced Inverse Regression

    Get PDF
    Visible and near infrared imaging spectroscopy is a key remote sensing technique to study and monitor planet Mars. Indeed it allows the detection, mapping and characterization of minerals as well as volatile species that often constitute the first step toward the resolution of key climatic and geological issues. These tasks are carried out by the spectral analysis of the solar light reflected in different directions by the materials forming the top few millimeters or centimeters of the ground. The chemical composition, granularity, texture, physical state, etc. of the materials determine the morphology of the hundred thousands spectra that typically constitute an image. Radiative transfer models simulating the propagation of solar light through the Martian atmosphere and surface and then to the sensor aim at evaluating numerically the direct and quantitative link between parameters and spectra. Then techniques must be applied in order to reverse the link and evaluate the properties of atmospheric and surface materials from the spectra. Processing all the pixels of an image finally provides physical and structural maps. We use a regularized version of SIR method (K.C. Li, Sliced Inverse Regression for dimension reduction, Journal of the American Statistical Association, 86:316-327, 1991) combined to a linear interpolation to reverse the previous numerical link. For that purpose we first generate numerous cor- responding pairs of parameters - synthetic spectra by direct radiative transfer modeling in order to constitute a learning database. The SIR step allows to reduce the dimension of the spectra (usually 184 wavelengths) in order to overcome the curse of dimensionality. Then, a linear interpolation is used to relate the reduced components of a spectrum to a given physical parameter value. Such inverted link is applied to a real dataset of hyperspectral images collected by the OMEGA instrument (Mars Express mission)

    Proceedings of the 2011 New York Workshop on Computer, Earth and Space Science

    Full text link
    The purpose of the New York Workshop on Computer, Earth and Space Sciences is to bring together the New York area's finest Astronomers, Statisticians, Computer Scientists, Space and Earth Scientists to explore potential synergies between their respective fields. The 2011 edition (CESS2011) was a great success, and we would like to thank all of the presenters and participants for attending. This year was also special as it included authors from the upcoming book titled "Advances in Machine Learning and Data Mining for Astronomy". Over two days, the latest advanced techniques used to analyze the vast amounts of information now available for the understanding of our universe and our planet were presented. These proceedings attempt to provide a small window into what the current state of research is in this vast interdisciplinary field and we'd like to thank the speakers who spent the time to contribute to this volume.Comment: Author lists modified. 82 pages. Workshop Proceedings from CESS 2011 in New York City, Goddard Institute for Space Studie

    Estimating the soil clay content and organic matter by means of different calibration methods of vis-NIR diffuse reflectance spectroscopy

    Get PDF
    The selection of calibration method is one of the main factors influencing measurement accuracy of soil properties estimation in visible and near infrared reflectance spectroscopy. In this study, the performance of three regression techniques, namely, partial least-squares regression (PLSR), support vector regression (SVR), and multivariate adaptive regression splines (MARS) were compared to identify the best method to assess organic matter (OM) and clay content in the salt-affected soils. One hundred and two soil samples collected from Northern Sinai, Egypt, were used as the data set for the calibration and validation procedures. The dry samples were scanned using a FieldSpec Pro FR Portable Spectroradiometer (Analytical Spectral Devices, ASD) with a measurement range of 350–2500 nm. The spectra were subjected to seven pre-processed techniques, e.g., Savitzky–Golay (SG) smoothing, first derivative with SG smoothing (FD-SG), second derivative with SG smoothing (SD-SG), continuum removed reflectance (CR), standard normal variate and detrending (SNV-DT), multiplicative scatter correction (MSC) and extended MSC. The results of cross-validation showed that in most cases MARS models performed better than PLSR and SVR models. The best predictions were obtained using MARS calibration methods with CR prep-processing, yielding R2, root mean squared error (RMSE), and ratio of performance to deviation (RPD) values of 0.85, 0.19%, and 2.63, respectively, for OM; and 0.90, 5.32%, and 3.15, respectively, for clay content

    Wind-Tunnel Balance Characterization for Hypersonic Research Applications

    Get PDF
    Wind-tunnel research was recently conducted at the NASA Langley Research Center s 31-Inch Mach 10 Hypersonic Facility in support of the Mars Science Laboratory s aerodynamic program. Researchers were interested in understanding the interaction between the freestream flow and the reaction control system onboard the entry vehicle. A five-component balance, designed for hypersonic testing with pressurized flow-through capability, was used. In addition to the aerodynamic forces, the balance was exposed to both thermal gradients and varying internal cavity pressures. Historically, the effect of these environmental conditions on the response of the balance have not been fully characterized due to the limitations in the calibration facilities. Through statistical design of experiments, thermal and pressure effects were strategically and efficiently integrated into the calibration of the balance. As a result of this new approach, researchers were able to use the balance continuously throughout the wide range of temperatures and pressures and obtain real-time results. Although this work focused on a specific application, the methodology shown can be applied more generally to any force measurement system calibration

    Study of Mobile Robot Operations Related to Lunar Exploration

    Get PDF
    Mobile robots extend the reach of exploration in environments unsuitable, or unreachable, by humans. Far-reaching environments, such as the south lunar pole, exhibit lighting conditions that are challenging for optical imagery required for mobile robot navigation. Terrain conditions also impact the operation of mobile robots; distinguishing terrain types prior to physical contact can improve hazard avoidance. This thesis presents the conclusions of a trade-off that uses the results from two studies related to operating mobile robots at the lunar south pole. The lunar south pole presents engineering design challenges for both tele-operation and lidar-based autonomous navigation in the context of a near-term, low-cost, short-duration lunar prospecting mission. The conclusion is that direct-drive tele-operation may result in improved science data return. The first study is on demonstrating lidar reflectance intensity, and near-infrared spectroscopy, can improve terrain classification over optical imagery alone. Two classification techniques, Naive Bayes and multi-class SVM, were compared for classification errors. Eight terrain types, including aggregate, loose sand and compacted sand, are classified using wavelet-transformed optical images, and statistical values of lidar reflectance intensity. The addition of lidar reflectance intensity was shown to reduce classification errors for both classifiers. Four types of aggregate material are classified using statistical values of spectral reflectance. The addition of spectral reflectance was shown to reduce classification errors for both classifiers. The second study is on human performance in tele-operating a mobile robot over time-delay and in lighting conditions analogous to the south lunar pole. Round-trip time delay between operator and mobile robot leads to an increase in time to turn the mobile robot around obstacles or corners as operators tend to implement a `wait and see\u27 approach. A study on completion time for a cornering task through varying corridor widths shows that time-delayed performance fits a previously established cornering law, and that varying lighting conditions did not adversely affect human performance. The results of the cornering law are interpreted to quantify the additional time required to negotiate a corner under differing conditions, and this increase in time can be interpreted to be predictive when operating a mobile robot through a driving circuit

    Chrome Layer Thickness Modelling in a Hard Chromium Plating Process Using a Hybrid PSO/ RBF–SVM–Based Model

    Get PDF
    The purpose of chromium plating is the creation of a hard and wear-resistant layer of chromium over a metallic surface. The principal feature of chromium plating is its endurance in the face of the wear and corrosion. This industrial process has a vast range of applications in many different areas. In the performance of this process, some difficulties can be found. Some of the most common are melt deposition, milky white chromium deposition, rough or sandy chromium deposition and lack of toughness of the layer or wear and lack of thickness of the layer deposited. This study builds a novel nonparametric method relied on the statistical machine learning that employs a hybrid support vector machines (SVMs) model for the hard chromium layer thickness forecast. The SVM hyperparameters optimization was made with the help of the Particle Swarm Optimizer (PSO). The outcomes indicate that PSO/SVM–based model together with radial basis function (RBF) kernel has permitted to foretell the thickness of the chromium layer created in this industrial process satisfactorily. Thus, two kinds of outcomes have been obtained: firstly, this model permits to determine the ranking of relevance of the seven independent input variables investigated in this industrial process. Finally, the high achievement and lack of complexity of the model indicate that the PSO/SVM method is very interesting compared to other conventional foretelling techniques, since a coefficient of determination of 0.9952 is acquired

    Representing complex data using localized principal components with application to astronomical data

    Full text link
    Often the relation between the variables constituting a multivariate data space might be characterized by one or more of the terms: ``nonlinear'', ``branched'', ``disconnected'', ``bended'', ``curved'', ``heterogeneous'', or, more general, ``complex''. In these cases, simple principal component analysis (PCA) as a tool for dimension reduction can fail badly. Of the many alternative approaches proposed so far, local approximations of PCA are among the most promising. This paper will give a short review of localized versions of PCA, focusing on local principal curves and local partitioning algorithms. Furthermore we discuss projections other than the local principal components. When performing local dimension reduction for regression or classification problems it is important to focus not only on the manifold structure of the covariates, but also on the response variable(s). Local principal components only achieve the former, whereas localized regression approaches concentrate on the latter. Local projection directions derived from the partial least squares (PLS) algorithm offer an interesting trade-off between these two objectives. We apply these methods to several real data sets. In particular, we consider simulated astrophysical data from the future Galactic survey mission Gaia.Comment: 25 pages. In "Principal Manifolds for Data Visualization and Dimension Reduction", A. Gorban, B. Kegl, D. Wunsch, and A. Zinovyev (eds), Lecture Notes in Computational Science and Engineering, Springer, 2007, pp. 180--204, http://www.springer.com/dal/home/generic/search/results?SGWID=1-40109-22-173750210-
    • …
    corecore