354 research outputs found

    Novel Hybrid Integration Approach of Bagging-Based Fisher’s Linear Discriminant Function for Groundwater Potential Analysis

    Full text link
    © 2019, International Association for Mathematical Geosciences. Groundwater is a vital water source in the rural and urban areas of developing and developed nations. In this study, a novel hybrid integration approach of Fisher’s linear discriminant function (FLDA) with rotation forest (RFLDA) and bagging (BFLDA) ensembles was used for groundwater potential assessment at the Ningtiaota area in Shaanxi, China. A spatial database with 66 groundwater spring locations and 14 groundwater spring contributing factors was prepared; these factors were elevation, aspect, slope, plan and profile curvatures, sediment transport index, stream power index, topographic wetness index, distance to roads and streams, land use, lithology, soil and normalized difference vegetation index. The classifier attribute evaluation method based on the FLDA model was implemented to test the predictive competence of the mentioned contributing factors. The area under curve, confidence interval at 95%, standard error, Friedman test and Wilcoxon signed-rank test were used to compare and validate the success and prediction competence of the three applied models. According to the achieved results, the BFLDA model showed the most prediction competence, followed by the RFLDA and FLDA models, respectively. The resulting groundwater spring potential maps can be used for groundwater development plans and land use planning

    Evaluation of the landslide susceptibility and its spatial difference in the whole Qinghai-Tibetan Plateau region by five learning algorithms

    Get PDF
    AbstractLandslides are considered as major natural hazards that cause enormous property damages and fatalities in Qinghai-Tibetan Plateau (QTP). In this article, we evaluated the landslide susceptibility, and its spatial differencing in the whole Qinghai-Tibetan Plateau region using five state-of-the-art learning algorithms; deep neural network (DNN), logistic regression (LR), Naïve Bayes (NB), random forest (RF), and support vector machine (SVM), differing from previous studies only in local areas of QTP. The 671 landslide events were considered, and thirteen landslide conditioning factors (LCFs) were derived for database generation, including annual rainfall, distance to drainage (Dsd){(\mathrm{Ds}}_{\mathrm{d}}) ( Ds d ) , distance to faults (Dsf){(\mathrm{Ds}}_{\mathrm{f}}) ( Ds f ) , drainage density (Dd){D}_{d}) D d ) , elevation (Elev), fault density (Fd)({F}_{d}) ( F d ) , lithology, normalized difference vegetation index (NDVI), plan curvature (Plc){(\mathrm{Pl}}_{\mathrm{c}}) ( Pl c ) , profile curvature (Prc){(\mathrm{Pr}}_{\mathrm{c}}) ( Pr c ) , slope (S){(S}^{^\circ }) ( S ∘ ) , stream power index (SPI), and topographic wetness index (TWI). The multi-collinearity analysis and mean decrease Gini (MDG) were used to assess the suitability and predictability of these factors. Consequently, five landslide susceptibility prediction (LSP) maps were generated and validated using accuracy, area under the receiver operatic characteristic curve, sensitivity, and specificity. The MDG results demonstrated that the rainfall, elevation, and lithology were the most significant landslide conditioning factors ruling the occurrence of landslides in Qinghai-Tibetan Plateau. The LSP maps depicted that the north-northwestern and south-southeastern regions ( 45% of total area). Moreover, among the five models with a high goodness-of-fit, RF model was highlighted as the superior one, by which higher accuracy of landslide susceptibility assessment and better prone areas management in QTP can be achieved compared to previous results. Graphical Abstrac

    Novel GIS based machine learning algorithms for shallow landslide susceptibility mapping

    Get PDF
    © 2018 by the authors. Licensee MDPI, Basel, Switzerland. The main objective of this research was to introduce a novel machine learning algorithm of alternating decision tree (ADTree) based on the multiboost (MB), bagging (BA), rotation forest (RF) and random subspace (RS) ensemble algorithms under two scenarios of different sample sizes and raster resolutions for spatial prediction of shallow landslides around Bijar City, Kurdistan Province, Iran. The evaluation of modeling process was checked by some statistical measures and area under the receiver operating characteristic curve (AUROC). Results show that, for combination of sample sizes of 60%/40% and 70%/30% with a raster resolution of 10 m, the RS model, while, for 80%/20% and 90%/10% with a raster resolution of 20 m, the MB model obtained a high goodness-of-fit and prediction accuracy. The RS-ADTree and MB-ADTree ensemble models outperformed the ADTree model in two scenarios. Overall, MB-ADTree in sample size of 80%/20% with a resolution of 20 m (area under the curve (AUC) = 0.942) and sample size of 60%/40% with a resolution of 10 m (AUC = 0.845) had the highest and lowest prediction accuracy, respectively. The findings confirm that the newly proposed models are very promising alternative tools to assist planners and decision makers in the task of managing landslide prone areas

    Financial distress prediction using the hybrid associative memory with translation

    Get PDF
    This paper presents an alternative technique for financial distress prediction systems. The method is based on a type of neural network, which is called hybrid associative memory with translation. While many different neural network architectures have successfully been used to predict credit risk and corporate failure, the power of associative memories for financial decision-making has not been explored in any depth as yet. The performance of the hybrid associative memory with translation is compared to four traditional neural networks, a support vector machine and a logistic regression model in terms of their prediction capabilities. The experimental results over nine real-life data sets show that the associative memory here proposed constitutes an appropriate solution for bankruptcy and credit risk prediction, performing significantly better than the rest of models under class imbalance and data overlapping conditions in terms of the true positive rate and the geometric mean of true positive and true negative rates.This work has partially been supported by the Mexican CONACYT through the Postdoctoral Fellowship Program [232167], the Spanish Ministry of Economy [TIN2013-46522-P], the Generalitat Valenciana [PROMETEOII/2014/062] and the Mexican PRODEP [DSA/103.5/15/7004]. We would like to thank the Reviewers for their valuable comments and suggestions, which have helped to improve the quality of this paper substantially

    Multi-label prediction method for lithology, lithofacies and fluid classes based on data augmentation by cascade forest

    Get PDF
    Predicting the lithology, lithofacies and reservoir fluid classes of igneous rocks holds significant value in the domains of CO2 storage and reservoir evaluation. However, no precedent exists for research on the multi-label identification of igneous rocks. This study proposes a multi-label data augmented cascade forest method for the prediction of multilabel lithology, lithofacies and fluid using 9 conventional logging data features of cores collected from the eastern depression of the Liaohe Basin in northeastern China. Data augmentation is performed on an unbalanced multi-label training set using the multi-label synthetic minority over-sampling technique. Sample training is achieved by a multi-label cascade forest consisting of predictive clustering trees. These cascade structures possess adaptive feature selection and layer growth mechanisms. Given the necessity to focus on all possible outcomes and the generalization ability of the method, a simulated well model is built and then compared with 6 typical multi-label learning methods. The outperformance of this method in the evaluation metrics validates its superiority in terms of accuracy and generalization ability. The consistency of the predicted results and geological data of actual wells verifies the reliability of our method. Furthermore, the results show that it can be used as a reliable means of multi-label prediction of igneous lithology, lithofacies and reservoir fluids.Document Type: Original articleCited as: Han, R., Wang, Z., Guo, Y., Wang, X., A, R., Zhong, G. Multi-label prediction method for lithology, lithofacies and fluid classes based on data augmentation by cascade forest. Advances in Geo-Energy Research, 2023, 9(1): 25-37. https://doi.org/10.46690/ager.2023.07.0

    A novel integrated approach of relevance vector machine optimized by imperialist competitive algorithm for spatial modeling of shallow landslides

    Get PDF
    This research aims at proposing a new artificial intelligence approach (namely RVM-ICA) which is based on the Relevance Vector Machine (RVM) and the Imperialist Competitive Algorithm (ICA) optimization for landslide susceptibility modeling. A Geographic Information System (GIS) spatial database was generated from Lang Son city in Lang Son province (Vietnam). This GIS database includes a landslide inventory map and fourteen landslide conditioning factors. The suitability of these factors for landslide susceptibility modeling in the study area was verified by the Information Gain Ratio (IGR) technique. A landslide susceptibility prediction model based on RVM-ICA and the GIS database was established by training and prediction phases. The predictive capability of the new approach was evaluated by calculations of sensitivity, specificity, accuracy, and the area under the Receiver Operating Characteristic curve (AUC). In addition, to assess the applicability of the proposed model, two state-of-the-art soft computing techniques including the support vector machine (SVM) and logistic regression (LR) were used as benchmark methods. The results of this study show that RVM-ICA with AUC = 0.92 achieved a high goodness-of-fit based on both the training and testing datasets. The predictive capability of RVM-ICA outperformed those of SVM with AUC = 0.91 and LR with AUC = 0.87. The experimental results confirm that the newly proposed model is a very promising alternative to assist planners and decision makers in the task of managing landslide prone areas

    Novel ensemble approaches of machine learning techniques in modeling the gully erosion susceptibility

    Full text link
    © 2020 by the authors. Gully erosion has become one of the major environmental issues, due to the severity of its impact in many parts of the world. Gully erosion directly and indirectly affects agriculture and infrastructural development. The Golestan Dam basin, where soil erosion and degradation are very severe problems, was selected as the study area. This research maps gully erosion susceptibility (GES) by integrating four models: maximum entropy (MaxEnt), artificial neural network (ANN), support vector machine (SVM), and general linear model (GLM). Of 1042 gully locations, 729 (70%) and 313 (30%) gully locations were used for modeling and validation purposes, respectively. Fourteen effective gully erosion conditioning factors (GECFs) were selected for spatial gully erosion modeling. Tolerance and variance inflation factors (VIFs) were used to examine the collinearity among the GECFs. The random forest (RF) model was used to assess factors' effectiveness and significance in gully erosion modeling. An ensemble of techniques can provide more accurate results than can single, standalone models. Therefore, we compared two-, three-, and four-model ensembles (ANN-SVM, GLM-ANN, GLM-MaxEnt, GLM-SVM, MaxEnt-ANN, MaxEnt-SVM, ANN-SVM-GLM, GLM-MaxEnt-ANN, GLM-MaxEnt-SVM, MaxEnt-ANN-SVM and GLM-ANN-SVM-MaxEnt) for GES modeling. The susceptibility zones of the GESMs were classified as very-low, low, medium, high, and very-high using Jenks' natural break classification method (NBM). Subsequently, the receiver operating characteristics (ROC) curve and the seed cell area index (SCAI) methods measured the reliability of the models. The success rate curve (SRC) and predication rate curve (PRC) and their area under the curve (AUC) values were obtained from the GES maps. The results show that the ANN model combined with two and three models are more accurate than the other combinations, but the ANN-SVM model had the highest accuracy. The rank of the others from best to worst accuracy is GLM, MaxEnt, SVM, GLM-ANN, GLM-MaxEnt, GLM-SVM, MaxEnt-ANN, MaxEnt-SVM, GLM-ANN-SVM-MaxEnt, GLM-MaxEnt-ANN, GLM-MaxEnt-SVM and MaxEnt-ANN-SVM. The resulting gully erosion susceptibility models (GESMs) are efficient and powerful and could be used to improve soil and water conservation and management

    Landslide Susceptibility Analysis on the Vicinity of Bogotá-Villavicencio Road (Eastern Cordillera of the Colombian Andes)

    Get PDF
    Landslide occurrence in Colombia is very frequent due to its geographical location in the Andean mountain range, with a very pronounced orography, a significant geological complexity and an outstanding climatic variability. More specifically, the study area around the Bogotá-Villavicencio road in the central sector of the Eastern Cordillera is one of the regions with the highest concentration of phenomena, which makes its study a priority. An inventory and detailed analysis of 2506 landslides has been carried out, in which five basic typologies have been differentiated: avalanches, debris flows, slides, earth flows and creeping areas. Debris avalanches and debris flows occur mainly in metamorphic materials (phyllites, schists and quartz-sandstones), areas with sparse vegetation, steep slopes and lower sections of hillslopes; meanwhile, slides, earth flows and creep occur in Cretaceous lutites, crop/grass lands, medium and low slopes and lower-middle sections of the hillslopes. Based on this analysis, landslide susceptibility models have been made for the different typologies and with different methods (matrix, discriminant analysis, random forest and neural networks) and input factors. The results are generally quite good, with average AUC-ROC values above 0.7–0.8, and the machine learning methods are the most appropriate, especially random forest, with a selected number of factors (between 6 and 8). The degree of fit (DF) usually shows relative errors lower than 5% and success higher than 90%. Finally, an integrated landslide susceptibility map (LSM) has been made for shallower and deeper types of movements. All the LSM show a clear zonation as a consequence of the geological control of the susceptibility

    Landslide Susceptibility Analysis on the Vicinity of Bogotá-Villavicencio Road (Eastern Cordillera of the Colombian Andes)

    Get PDF
    first_pagesettingsOrder Article Reprints Open AccessArticle Landslide Susceptibility Analysis on the Vicinity of Bogotá-Villavicencio Road (Eastern Cordillera of the Colombian Andes) by María Camila Herrera-Coy 1ORCID,Laura Paola Calderón 2,Iván Leonardo Herrera-Pérez 1,3,Paul Esteban Bravo-López 1,4ORCID,Christian Conoscenti 2ORCID,Jorge Delgado 1ORCID,Mario Sánchez-Gómez 5,6ORCID andTomás Fernández 1,6,*ORCID 1 Department of Cartographic, Geodetic and Photogrammetric Engineering, University of Jaén, 23071 Jaén, Spain 2 Department of Earth and Marine Sciences (DiSTeM), University of Palermo, 90123 Palermo, Italy 3 Department of Geographic and Environmental Engineering, University of Applied and Environmental Sciences (U.D.C.A.), Bogotá 111166, Colombia 4 Institute for Studies of Sectional Regime of Ecuador (IERSE), University of Azuay, Cuenca 010107, Ecuador 5 Department of Geology, University of Jaén, 23071 Jaén, Spain 6 Natural Hazards Lab of the Centre for Advanced Studies in Earth Sciences, Energy and Environment (CEACTEMA), University of Jaén, 23071 Jaén, Spain * Author to whom correspondence should be addressed. Remote Sens. 2023, 15(15), 3870; https://doi.org/10.3390/rs15153870 Received: 11 June 2023 / Revised: 24 July 2023 / Accepted: 31 July 2023 / Published: 4 August 2023 (This article belongs to the Special Issue Remote Sensing Techniques for Landslides Studies and Their Hazards Assessment) Download Browse Figures Versions Notes Abstract Landslide occurrence in Colombia is very frequent due to its geographical location in the Andean mountain range, with a very pronounced orography, a significant geological complexity and an outstanding climatic variability. More specifically, the study area around the Bogotá-Villavicencio road in the central sector of the Eastern Cordillera is one of the regions with the highest concentration of phenomena, which makes its study a priority. An inventory and detailed analysis of 2506 landslides has been carried out, in which five basic typologies have been differentiated: avalanches, debris flows, slides, earth flows and creeping areas. Debris avalanches and debris flows occur mainly in metamorphic materials (phyllites, schists and quartz-sandstones), areas with sparse vegetation, steep slopes and lower sections of hillslopes; meanwhile, slides, earth flows and creep occur in Cretaceous lutites, crop/grass lands, medium and low slopes and lower-middle sections of the hillslopes. Based on this analysis, landslide susceptibility models have been made for the different typologies and with different methods (matrix, discriminant analysis, random forest and neural networks) and input factors. The results are generally quite good, with average AUC-ROC values above 0.7–0.8, and the machine learning methods are the most appropriate, especially random forest, with a selected number of factors (between 6 and 8). The degree of fit (DF) usually shows relative errors lower than 5% and success higher than 90%. Finally, an integrated landslide susceptibility map (LSM) has been made for shallower and deeper types of movements. All the LSM show a clear zonation as a consequence of the geological control of the susceptibility.Incluye referencias bibliográfica
    corecore