6,324 research outputs found

    Machine learning to generate soil information

    Get PDF
    This thesis is concerned with the novel use of machine learning (ML) methods in soil science research. ML adoption in soil science has increased considerably, especially in pedometrics (the use of quantitative methods to study the variation of soils). In parallel, the size of the soil datasets has also increased thanks to projects of global impact that aim to rescue legacy data or new large extent surveys to collect new information. While we have big datasets and global projects, currently, modelling is mostly based on "traditional" ML approaches which do not take full advantage of these large data compilations. This compilation of these global datasets is severely limited by privacy concerns and, currently, no solution has been implemented to facilitate the process. If we consider the performance differences derived from the generality of global models versus the specificity of local models, there is still a debate on which approach is better. Either in global or local DSM, most applications are static. Even with the large soil datasets available to date, there is not enough soil data to perform a fully-empirical, space-time modelling. Considering these knowledge gaps, this thesis aims to introduce advanced ML algorithms and training techniques, specifically deep neural networks, for modelling large datasets at a global scale and provide new soil information. The research presented here has been successful at applying the latest advances in ML to improve upon some of the current approaches for soil modelling with large datasets. It has also created opportunities to utilise information, such as descriptive data, that has been generally disregarded. ML methods have been embraced by the soil community and their adoption is increasing. In the particular case of neural networks, their flexibility in terms of structure and training makes them a good candidate to improve on current soil modelling approaches

    Rapid detection of soil carbonates by means of NIR spectroscopy, deep learning methods and phase quantification by powder Xray diffraction

    Full text link
    Soil NIR spectral absorbance/reflectance libraries are utilized towards improving agricultural production and analysis of soil properties which are key prerequisite for agroecological balance and environmental sustainability. Carbonates in particular, represent a soil property which is mostly affected even by mild, let alone extreme, changes of environmental conditions during climate change. In this study we propose a rapid and efficient way to predict carbonates content in soil by means of FT NIR reflectance spectroscopy and by use of deep learning methods. We exploited multiple machine learning methods, such as: 1) a MLP Regressor and 2) a CNN and compare their performance with other traditional ML algorithms such as PLSR, Cubist and SVM on the combined dataset of two NIR spectral libraries: KSSL (USDA), a dataset of soil samples reflectance spectra collected nationwide, and LUCAS TopSoil (European Soil Library) which contains soil sample absorbance spectra from all over the European Union, and use them to predict carbonate content on never before seen soil samples. Soil samples in KSSL and in TopSoil spectral libraries were acquired in the spectral region of visNIR, however in this study, only the NIR spectral region was utilized. Quantification of carbonates by means of Xray Diffraction is in good agreement with the volumetric method and the MLP prediction. Our work contributes to rapid carbonates content prediction in soil samples in cases where: 1) no volumetric method is available and 2) only NIR spectra absorbance data are available. Up till now and to the best of our knowledge, there exists no other study, that presents a prediction model trained on such an extensive dataset with such promising results on unseen data, undoubtedly supporting the notion that deep learning models present excellent prediction tools for soil carbonates content.Comment: 39 pages, 5 figure

    Spectroscopic Sensor Data Fusion to Improve the Prediction of Soil Nutrient Contents

    Get PDF
    This study aims to advance the understanding and application of spectroscopic sensor data fusion for improving soil nutrient content predictions. In addition to presenting an extensive review of studies on the spectroscopic sensor data fusion, a research investigation was conducted to assess the effectiveness of five fusion algorithms in predicting three primary nutrients (Nitrogen, Phosphorous, and Potassium) and two secondary nutrients (Calcium and Magnesium) in soil using Visible and Near-Infrared, Mid-Infrared, and X-ray Fluorescence data. Among the five fusion algorithms, one was a low-level fusion involving data concatenation. Two were mid-level fusions, incorporating feature extraction by applying (i) Principal Component reduction and (ii) Partial Least Squares reduction. The other two were high-level fusions, namely (i) Simple Averaging and (ii) Granger Ramanathan Averaging. The results indicate that Low-Level Fusion may not be suitable for inherently incompatible data. Mid-level fusion improved the R² by 0.1-18%, RMSE by 0-8%, and RPIQ by 0-11.5%, while high-level fusion enhanced the R² by 0-12.5%, RMSE by 0-12%, and RPIQ by 2.3-13.4%, depending on the nutrients and fusion algorithms. Despite these improvements, predictions were only satisfactory for primary nutrients, and none of the algorithms could notably enhance predictions for Phosphorous. The study also finds that fusion algorithms do not significantly improve bias. The study provided evidence of improvement in prediction accuracy with data fusion which can aid in delineating management zones for precision agriculture. It also encourages further research on novel approaches of sensor fusion and algorithms that can effectively handle non-linearity introduced due to the fusion of data. Advisor: Yufeng G

    Estimation of soluble solids content and fruit temperature in 'rocha' pear using Vis-NIR spectroscopy and the spectraNet–32 deep learning architecture

    Get PDF
    Spectra-based methods are becoming increasingly important in Precision Agriculture as they offer non-destructive, quick tools for measuring the quality of produce. This study introduces a novel approach for esti-mating the soluble solids content (SSC) of 'Rocha' pears using the SpectraNet-32 deep learning architecture, which operates on 1D fruit spectra in the visible to near-infrared region (Vis-NIRS). This method was also able to estimate fruit temperatures, which improved the SSC prediction performance. The dataset consisted of 3300 spectra from 1650 'Rocha' pears collected from local markets over several weeks during the 2010 and 2011 seasons, which had varying edaphoclimatic conditions. Two types of partial least squares (PLS) feature selection methods, under various configurations, were applied to the input spectra to identify the most significant wavelengths for training SpectraNet-32. The model's robustness was also compared to a similar state-of-the-art deep learning architecture, DeepSpectra, as well as four other classical machine learning algorithms: PLS, multiple linear regression (MLR), support vector machine (SVM), and multi-layer perceptron (MLP). In total, 23 different experimental method configurations were assessed, with 150 neural networks each. SpectraNet-32 consistently outperformed other methods in several metrics. On average, it was 6.1% better than PLS in terms of the root mean square error of prediction (RMSEP, 1.08 vs. 1.15%), 7.7% better in prediction gain (PG, 1.67 vs. 1.55), 3.6% better in the coefficient of determination (R2, 0.58 vs. 0.56) and 5.8% better in the coefficient of variation (CV%, 8.35 vs. 8.86).info:eu-repo/semantics/publishedVersio
    • …
    corecore