Search CORE

6 research outputs found

Support matrix machine: A review

Author: Akhtar Mushir
Kumari Anuradha
Shah Rupal
Tanveer M.
Publication venue
Publication date: 30/10/2023
Field of study

Support vector machine (SVM) is one of the most studied paradigms in the realm of machine learning for classification and regression problems. It relies on vectorized input data. However, a significant portion of the real-world data exists in matrix format, which is given as input to SVM by reshaping the matrices into vectors. The process of reshaping disrupts the spatial correlations inherent in the matrix data. Also, converting matrices into vectors results in input data with a high dimensionality, which introduces significant computational complexity. To overcome these issues in classifying matrix input data, support matrix machine (SMM) is proposed. It represents one of the emerging methodologies tailored for handling matrix input data. The SMM method preserves the structural information of the matrix data by using the spectral elastic net property which is a combination of the nuclear norm and Frobenius norm. This article provides the first in-depth analysis of the development of the SMM model, which can be used as a thorough summary by both novices and experts. We discuss numerous SMM variants, such as robust, sparse, class imbalance, and multi-class classification models. We also analyze the applications of the SMM model and conclude the article by outlining potential future research avenues and possibilities that may motivate academics to advance the SMM algorithm

arXiv.org e-Print Archive

Selection of contributing factors for predicting landslide susceptibility using machine learning and deep learning models

Author: Chen Cheng
Fan Lei
Publication venue
Publication date: 12/09/2023
Field of study

Landslides are a common natural disaster that can cause casualties, property safety threats and economic losses. Therefore, it is important to understand or predict the probability of landslide occurrence at potentially risky sites. A commonly used means is to carry out a landslide susceptibility assessment based on a landslide inventory and a set of landslide contributing factors. This can be readily achieved using machine learning (ML) models such as logistic regression (LR), support vector machine (SVM), random forest (RF), extreme gradient boosting (Xgboost), or deep learning (DL) models such as convolutional neural network (CNN) and long short time memory (LSTM). As the input data for these models, landslide contributing factors have varying influences on landslide occurrence. Therefore, it is logically feasible to select more important contributing factors and eliminate less relevant ones, with the aim of increasing the prediction accuracy of these models. However, selecting more important factors is still a challenging task and there is no generally accepted method. Furthermore, the effects of factor selection using various methods on the prediction accuracy of ML and DL models are unclear. In this study, the impact of the selection of contributing factors on the accuracy of landslide susceptibility predictions using ML and DL models was investigated. Four methods for selecting contributing factors were considered for all the aforementioned ML and DL models, which included Information Gain Ratio (IGR), Recursive Feature Elimination (RFE), Particle Swarm Optimization (PSO), Least Absolute Shrinkage and Selection Operators (LASSO) and Harris Hawk Optimization (HHO). In addition, autoencoder-based factor selection methods for DL models were also investigated. To assess their performances, an exhaustive approach was adopted,...Comment: Stochastic Environmental Research and Risk Assessmen

arXiv.org e-Print Archive

Design of Multi-View Based Email Classification for IoT Systems via Semi-Supervised Learning

Author: Li Wenjuan
Meng Weizhi
Tan Zhiyuan
Xiang Yang
Publication venue: Elsevier
Publication date: 15/02/2019
Field of study

Suspicious emails are one big threat for Internet of Things (IoT) security, which aim to induce users to click and then redirect them to a phishing webpage. To protect IoT systems, email classification is an essential mechanism to classify spam and legitimate emails. In the literature, most email classification approaches adopt supervised learning algorithms that require a large number of labeled data for classifier training. However, data labeling is very time consuming and expensive, making only a very small set of data available in practice, which would greatly degrade the effectiveness of email classification. To mitigate this problem, in this work, we develop an email classification approach based on multi-view disagreement-based semi-supervised learning. The idea behind is that multi-view method can offer richer information for classification, which is often ignored by literature. The use of semi-supervised learning can help leverage both labeled and unlabeled data. In the evaluation, we investigate the performance of our proposed approach with datasets and in real network environments. Experimental results demonstrate that multi-view can achieve better classification performance than single view, and that our approach can achieve better performance as compared to the existing similar algorithms

Deakin Research Online

Repository@Napier

Online Research Database In Technology

Solución rápida y automática de parámetros hipocentrales para eventos sísmicos, mediante el empleo de técnicas de aprendizaje de máquina

Author: Ochoa Gutiérrez Luis Hernán
Publication venue
Publication date: 15/11/2016
Field of study

La generación de alertas tempranas para sismos es de gran utilidad, en particular para la ciudad de Bogotá-Colombia, dada su importancia social y económica para el país. Con base en la información de la estación sismológica de El Rosal, la cual es una estación de banda ancha y tres componentes, localizada muy cerca de la ciudad, perteneciente al Servicio Geológico Colombiano (SGC) se desarrolló un modelo de regresión basado en máquinas de vectores de soporte (SVM), con un kernel polinomial normalizado, usando como datos de entrada algunas características de la porción inicial de la onda P empleadas en trabajos anteriores tales como la amplitud máxima, los coeficientes de regresión lineal de los mismos, los parámetros de ajuste logarítmico de la envolvente y los valores propios de la relación de las tres componentes del sismograma. El modelo fue entrenado y evaluado aplicando correlación cruzada, permitiendo llevar a cabo el cálculo de la magnitud y la localización de un evento sísmico con una longitud de señal de tan solo cinco segundos. Con el modelo propuesto se logró la determinación de la magnitud local con una precisión de 0.19 unidades de magnitud, la distancia epicentral con una precisión de alrededor de 11 kilómetros, la profundidad hipocentral con una precisión de aproximadamente 40 kilómetros y el azimut de llegada con una precisión de 45°. Las precisiones obtenidas en magnitud y distancia epicentral son mejores que las encontradas en trabajos anteriores, donde se emplean gran número de eventos para la determinación del modelo y en los demás parámetros hipocentrales son del mismo orden. Este trabajo de investigación realiza un aporte considerable en la generación de alertas tempranas para sismos, no solamente para el país sino para cualquier otro lugar donde se deseen implementar los modelos aquí propuestos y es un excelente punto de partida para investigaciones futuras.Abstract. Earthquake early warning alerts generation is very useful, especially for the city of Bogotá-Colombia, given the social and economic importance of this city for the country. Based on the information from the seismological station “El Rosal”, which is a broadband and three components station, located very near the city that belongs to the Servicio Geológico Colombiano (SGC) a Support Vector Machine Regression (SVMR) model was developed, using a Normalized Polynomial Kernel, using as input some characteristics of the initial portion of the P wave used in earlier works such as the maximum amplitude, the linear regression coefficients of such amplitudes, the logarithmic adjustment parameters of the envelope of the waveform and the eigenvalues of the relationship between the three seismogram components of each band. The model was trained and evaluated by applying a cross-correlation strategy, allowing to calculate the magnitude and location of a seismic event with only five seconds of signal. With the proposed model it was possible to estimate local magnitude with an accuracy of 0.19 units of magnitude, epicentral distance with an accuracy of about 11 km, the hipocentral depth with a precision of approximately 40 km and the arrival back-azimut with a precision of 45°. Accuracies obtained in magnitude and epicentral distance are better that those found in earlier works, where a large number of events were used for model determination, and the other hipocentral parameters precisions obtained here are of the same order. This research work makes a considerable contribution in the generation of seismic early warning alerts, not only for the country but for any other place where proposed models here can be applied and is a very good starting point for future research.Doctorad

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Universidad Nacional De Colombia - Repositorio Institucional UN