11,316 research outputs found

    Granular risk assessment of earthquake induced landslide via latent representations of stacked autoencoder

    Get PDF
    Earthquake-induced landslides are ubiquitous on slopes in terrestrial environments, which can pose a serious threat to local communities and infrastructures. Data-driven landslide assessments play a crucial role in preventing future landslide occurrences and recurrences. We present a novel granular computing approach that assesses landslide risk by combining fuzzy information granulation and a stacked autoencoder algorithm. The stacked autoencoder is trained using an end-to-end learning strategy to obtain a central latent vector with reduced dimensionality. The multivariate landslide dataset was used as both the input and output to train the stacked autoencoder algorithm. Subsequently, in the central latent vector of the stacked autoencoder, the Fuzzy C-means clustering algorithm was applied to cluster the landslides into various groups with different risk levels, and the intervals for each group were computed using the granular computing approach. An empirical case study in Wenchuan County, Sichuan, China, was conducted. A comparative analysis with other state-of-the-art approaches including Density-based spatial clustering of applications with noise (DBSCAN), K-means clustering, and Principal Component Analysis (PCA), is provided and discussed. The experimental results demonstrate that the proposed approach using a stacked autoencoder integrated with fuzzy information granulation provides superior performance compared to those by other state-of-the-art approaches, and is capable of studying deep patterns in earthquake-induced landslide datasets and provides sufficient interpretation for field engineers

    A systematic review of data quality issues in knowledge discovery tasks

    Get PDF
    Hay un gran crecimiento en el volumen de datos porque las organizaciones capturan permanentemente la cantidad colectiva de datos para lograr un mejor proceso de toma de decisiones. El desafío mas fundamental es la exploración de los grandes volúmenes de datos y la extracción de conocimiento útil para futuras acciones por medio de tareas para el descubrimiento del conocimiento; sin embargo, muchos datos presentan mala calidad. Presentamos una revisión sistemática de los asuntos de calidad de datos en las áreas del descubrimiento de conocimiento y un estudio de caso aplicado a la enfermedad agrícola conocida como la roya del café.Large volume of data is growing because the organizations are continuously capturing the collective amount of data for better decision-making process. The most fundamental challenge is to explore the large volumes of data and extract useful knowledge for future actions through knowledge discovery tasks, nevertheless many data has poor quality. We presented a systematic review of the data quality issues in knowledge discovery tasks and a case study applied to agricultural disease named coffee rust
    corecore