3 research outputs found

    Examining the Performance of PARACUDA-II Data-Mining Engine versus Selected Techniques to Model Soil Carbon from Reflectance Spectra

    No full text
    The monitoring and quantification of soil carbon provide a better understanding of soil and atmosphere dynamics. Visible-near-infrared-short-wave infrared (VIS-NIR-SWIR) reflectance spectroscopy can quantitatively estimate soil carbon content more rapidly and cost-effectively compared to traditional laboratory analysis. However, effective estimation of soil carbon using reflectance spectroscopy to a great extent depends on the selection of a suitable preprocessing sequence and data-mining algorithm. Many efforts have been dedicated to the comparison of conventional chemometric techniques and their optimization for soil properties prediction. Instead, the current study focuses on the potential of the new data-mining engine PARACUDA-II®, recently developed at Tel-Aviv University (TAU), by comparing its performance in predicting soil oxidizable carbon (Cox) against common data-mining algorithms including partial least squares regression (PLSR), random forests (RF), boosted regression trees (BRT), support vector machine regression (SVMR), and memory based learning (MBL). To this end, 103 soil samples from the Pokrok dumpsite in the Czech Republic were scanned with an ASD FieldSpec III Pro FR spectroradiometer in the laboratory under a strict protocol. Spectra preprocessing for conventional data-mining techniques was conducted using Savitzky-Golay smoothing and the first derivative method. PARACUDA-II®, on the other hand, operates based on the all possibilities approach (APA) concept, a conditional Latin hypercube sampling (cLHs) algorithm and parallel programming, to evaluate all of the potential combinations of eight different spectral preprocessing techniques against the original reflectance and chemical data prior to the model development. The comparison of results was made in terms of the coefficient of determination (R2) and root-mean-square error of prediction (RMSEp). Results showed that the PARACUDA-II® engine performed better than the other selected regular schemes with R2 value of 0.80 and RMSEp of 0.12; the PLSR was less predictive compared to other techniques with R2 = 0.63 and RMSEp = 0.29. This can be attributed to its capability to assess all the available options in an automatic way, which enables the hidden models to rise up and yield the best available model

    A multivariate statistical and GIS approach to estimate heavy metal(loid)s in contaminated surface soils

    Get PDF
    In recent decades, there has been a growing concern over the escalating pollution of soil with heavy metal(loid)s, which poses an immediate threat to human health, food safety, and the overall soil environment. This research aimed to assess the extent of contamination, spatial distribution, sources of contamination, potential ecological risks, and health hazards associated with heavy metal(loid)s (specifically As, Cd, Co, Cr, Cu, Fe, Mn, Ni, Pb, Sr, and Zn) by collecting soil samples from the surface soils in the mining region of Cerrito Blanco and Matehuala, San Luis Potosi in central Mexico. In addition to this, another study was conducted on rare trace metal(loid)s (B, Ba, Sb, Sn, and V) and other trace metals (Ca, Mg, Na, and K) in this selected region, which shows a level of contamination for those metals. The contamination levels of these heavy metal(loid)s were determined using various indices such as Igeo (geo-accumulation index), Cf (contamination factor), PLI (pollution load index), Cd (degree of contamination), mCd (modified degree of contamination), PIN (nemerow pollution index), EF (enrichment factor), and PERI (potential ecological risk index). Multivariate statistical techniques, such as principal component analysis (PCA), cluster analysis, or factor analysis, were used to identification of patterns and correlations among different heavy metal(loid)s and soil parameters. The findings indicated a significant degree of contamination in the surface soil due to heavy metal(loid)s. The integrated contamination indices and the potential ecological risk index revealed high levels of contamination and substantial ecological risks in the study areas, with particular emphasis on the need to control As in the surface soils surrounding Matehuala. Source identification of heavy metal(loid)s were performed using the APCS-MLR, PMF, and UNMIX receptor models, which detected three potential sources: mining and smelting activities, natural sources, and anthropogenic activities. The APCS-MLR model appeared to be more suitable for identifying complex contamination sources, demonstrating a better R2 coefficient and P/M (predicted/measured) ratio than the other models. Mining and smelting activities were identified as the primary factors influencing the distribution of heavy metal(loid)s in the surface soils. The most effective GIS interpolation technique was selected to analyse the spatial distribution patterns of heavy metal(loid) content, comparing five different GIS interpolation approaches such as Inverse Distance Weighting (IDW), Local Polynomial (LP), Ordinary Kriging (OK), Empirical Bayesian Kriging (EBK), and Radial Basis Functions (RBF). The results indicated regions of significant concentrations for all heavy metal(loid)s, with the northern, western, and central parts of the study area exhibiting particularly elevated levels. Ecological risk assessment based on PERI revealed considerable risk for As and moderate risk for the remaining metals. Moreover, a probabilistic evaluation of health risks indicated minimal non-carcinogenic risks (HI) for humans but significant carcinogenic risks (CR) for both adults and children. Notably, children were found to be more vulnerable to the health risks associated with exposure to these heavy metals compared to adults. Consequently, enhanced monitoring efforts are necessary to address the issue of heavy metal(loid)s contamination in the rapidly developing Matehuala regions.James Watt Scholarshi
    corecore