32 research outputs found

    Predicting uncertainty of machine learning models for modelling nitrate pollution of groundwater using quantile regression and UNNEC methods

    Get PDF
    Although estimating the uncertainty of models used for modelling nitrate contamination of groundwater is essential in groundwater management, it has been generally ignored. This issue motivates this research to explore the predictive uncertainty of machine-learning (ML) models in this field of study using two different residuals uncertainty methods: quantile regression (QR) and uncertainty estimation based on local errors and clustering (UNEEC). Prediction-interval coverage probability (PICP), the most important of the statistical measures of uncertainty, was used to evaluate uncertainty. Additionally, three state-of-the-art ML models including support vector machine (SVM), random forest (RF), and k-nearest neighbor (kNN) were selected to spatially model groundwater nitrate concentrations. The models were calibrated with nitrate concentrations from 80 wells (70% of the data) and then validated with nitrate concentrations from 34 wells (30% of the data). Both uncertainty and predictive performance criteria should be considered when comparing and selecting the best model. Results highlight that the kNN model is the best model because not only did it have the lowest uncertainty based on the PICP statistic in both the QR (0.94) and the UNEEC (in all clusters, 0.85–0.91) methods, but it also had predictive performance statistics (RMSE = 10.63, R2 = 0.71) that were relatively similar to RF (RMSE = 10.41, R2 = 0.72) and higher than SVM (RMSE = 13.28, R2 = 0.58). Determining the uncertainty of ML models used for spatially modelling groundwater-nitrate pollution enables managers to achieve better risk-based decision making and consequently increases the reliability and credibility of groundwater-nitrate predictions

    Assessment and correction of DEM generation from airborne and space borne radar systems with reference to geo-hazard identification in the Cameron Highlands, Malaysia.

    Full text link
    This research assesses the accuracy of SRTM and AIRSAR DEMs acquired over the mountainous-hillands of Cameron Highlandswith DEMs generated from Digital Aerial Photograph (DAP) with a fine (2 m) spatial resolution and height resolution of about 0.5 m. Theground control points used for generating stereo models from the DAP were acquired during field work using GPS which achievedaccuracy better than 2 cm in most cases.To overcome the difficulty of overlaying the DEMs with the DAP DEM as no features can be easily identified on both the images,therefore a technique of using transects and contours generated from the DEMs were used to correct the horizontal displacement. ForAIRSAR DEM, comparing the accompanying AIRSAR composite images was also employed. These then allowed an analysis of theheight accuracy to be undertaken.The height of both the AIRSAR and SRTM DEMs were also corrected by applying Linear Regression Models. These models wereproduced by comparing pixels obtained from points, profiles and an area. Once again the corrected DEMs were assessed. Finally theextracted profiles and contours from the corrected SRTM and AIRSAR were compared with the reference DEM.From the comparisons, the horizontal errors were found to be about one and the half pixels (138.72 m: for SRTM) to the east and 1pixel (10 m: for AIRSAR) to the south. The SD of height differences of the SRTM and AIRSAR DEMs using 90% data were 9.2 m and 5.2 mwith profiles comparison; 10.4 m and 5.4 m with area comparison; 10.8 m and 2.4 m with GPS GCPs comparison respectively. From thethree comparisons, the means of height differences are 5.2 m, 6.1 m and 15.2 m for SRTM and 8.1 m, 8.3 m and 7.9 m for AIRSAR DEM.The results suggest there is height offset in the AIRSAR DEM. When both heights of DEMs were corrected, the generated contours areclose to each other and to reference contours. Using contour colours images and height modelling, the corrected DEM was found tohave the potential to detect areas that prone to flash floods and mudslides

    Social Vulnerability Assessment Using Artificial Neural Network (ANN) Model for Earthquake Hazard in Tabriz City, Iran

    Get PDF
    This study presents the application of an artificial neural network (ANN) and geographic information system (GIS) for estimating the social vulnerability to earthquakes in the Tabriz city, Iran. Thereby, seven indicators were identified and used for earthquake vulnerability mapping, including population density, household density, employed density, unemployed density, and literate people. To obtain more accuracy in our analysis, all of the indicators were entered into a geographic information system (GIS). After the standardization of the data, an artificial neural network (ANN) model was applied for deriving a social vulnerability map (SVM) of different hazard classes for Tabriz city. The results showed that 0.77% of the total area was found to be very highly vulnerable. Very low vulnerability was recorded for 76.31% of the total study area. The comparison of data provided by (SVM) and the residential building vulnerability (RBV) of Tabriz city indicated the validity of the results obtained by ANN processes. Scatter plots are used to plot the data. These scatter plots indicate the existence of a strong positive relationship between the most vulnerable zones (1, 4, and 5) and the least (3, 7, and 9) of the SVM and RBV. The results highlight the importance of using social vulnerability study for defining seismic-risk mitigation policies, emergency management, and territorial planning in order to reduce the impacts of disasters

    SEVUCAS: A Novel GIS-Based Machine Learning Software for Seismic Vulnerability Assessment

    Get PDF
    Since it is not possible to determine the exact time of a natural disaster’s occurrence and the amount of physical and financial damage on humans or the environment resulting from their event, decision-makers need to identify areas with potential vulnerability in order to reduce future losses. In this paper, a GIS-based open source software entitled Seismic-Related Vulnerability Calculation Software (SEVUCAS), based on the Step-wise Weight Assessment Ratio Analysis (SWARA) method and geographic information system, has been developed to assess seismic vulnerability by considering four groups of criteria (i.e., geotechnical, structural, socio-economic, and physical distance to needed facilities and away from dangerous facilities). The software was developed in C# language using ArcGIS Engine functions, which provide enhanced visualization as well as user-friendly and automatic software for the seismic vulnerability assessment of buildings. Weighting of the criteria (indicators) and alternatives (sub-indicators) was done using SWARA. Also, two interpolation methods based on a radial basis function (RBF) and teaching–learning-based optimization (TLBO) were used to optimize the weights of the criteria and the classes of each alternative as well. After weighing the criteria and alternatives, the weighted overlay analysis was used to determine the final vulnerability map in the form of contours and statistical data. The difference between this software and similar ones is that people with a low level of knowledge in the area of earthquake crisis management can use it to determine and estimate the seismic vulnerabilities of their houses. This visualized operational forecasting software provides an applicable tool for both government and people to make quick and correct decisions to determine higher priority structures for seismic retrofitting implementation

    Mapping of Groundwater Spring Potential in Karst Aquifer System Using Novel Ensemble Bivariate and Multivariate Models

    Get PDF
    Groundwater is an important natural resource in arid and semi-arid environments, where discharge from karst springs is utilized as the principal water supply for human use. The occurrence of karst springs over large areas is often poorly documented, and interpolation strategies are often utilized to map the distribution and discharge potential of springs. This study develops a novel method to delineate karst spring zones on the basis of various hydrogeological factors. A case study of the Bojnourd Region, Iran, where spring discharge measurements are available for 359 sites, is used to demonstrate application of the new approach. Spatial mapping is achieved using ensemble modelling, which is based on certainty factors (CF) and logistic regression (LR). Maps of the CF and LR components of groundwater potential were generated individually, and then, combined to prepare an ensemble map of the study area. The accuracy (A) of the ensemble map was then assessed using area under the receiver operating characteristic curve. Results of this analysis show that LR (A = 78%) outperformed CF (A = 67%) in terms of the comparison between model predictions and known occurrences of karst springs (i.e., calibration data). However, combining the CF and LR results through ensemble modelling produced superior accuracy (A = 85%) in terms of spring potential mapping. By combining CF and LR statistical models through ensemble modelling, weaknesses in CF and LR methods are offset, and therefore, we recommend this ensemble approach for similar karst mapping projects. The methodology developed here offers an efficient method for assessing spring discharge and karst spring potentials over regional scales.Validerad;2020;Nivå 2;2020-03-31 (johcin)</p

    Landslide Detection and Susceptibility Mapping by AIRSAR Data Using Support Vector Machine and Index of Entropy Models in Cameron Highlands, Malaysia

    Get PDF
    Since landslide detection using the combination of AIRSAR data and GIS-based susceptibility mapping has been rarely conducted in tropical environments, the aim of this study is to compare and validate support vector machine (SVM) and index of entropy (IOE) methods for landslide susceptibility assessment in Cameron Highlands area, Malaysia. For this purpose, ten conditioning factors and observed landslides were detected by AIRSAR data, WorldView-1 and SPOT 5 satellite images. A spatial database was generated including a total of 92 landslide locations encompassing the same number of observed and detected landslides, which was divided into training (80%; 74 landslide locations) and validation (20%; 18 landslide locations) datasets. Results of the difference between observed and detected landslides using root mean square error (RMSE) indicated that only 16.3% error exists, which is fairly acceptable. The validation process was performed using statistical-based measures and the area under the receiver operating characteristic (AUROC) curves. Results of validation process indicated that the SVM model has the highest values of sensitivity (88.9%), specificity (77.8%), accuracy (83.3%), Kappa (0.663) and AUROC (84.5%), followed by the IOE model. Overall, the SVM model applied to detected landslides is considered to be a promising technique that could be tested and utilized for landslide susceptibility assessment in tropical environments
    corecore