6 research outputs found

    Smart Attacks Learning Machine Advisor System for Protecting Smart Cities from Smart Threats

    No full text
    The extensive use of Internet of Things (IoT) technology has recently enabled the development of smart cities. Smart cities operate in real-time to improve metropolitan areas’ comfort and efficiency. Sensors in these IoT devices are immediately linked to enormous servers, creating smart city traffic flow. This flow is rapidly increasing and is creating new cybersecurity concerns. Malicious attackers increasingly target essential infrastructure such as electricity transmission and other vital infrastructures. Software-Defined Networking (SDN) is a resilient connectivity technology utilized to address security concerns more efficiently. The controller, which oversees the flows of each appropriate forwarding unit in the SDN architecture, is the most critical component. The controller’s flow statistics are thought to provide relevant information for building an Intrusion Detection System (IDS). As a result, we propose a five-level classification approach based on SDN’s flow statistics to develop a Smart Attacks Learning Machine Advisor (SALMA) system for detecting intrusions and for protecting smart cities from smart threats. We use the Extreme Learning Machine (ELM) technique at all levels. The proposed system was implemented on the NSL-KDD and KDDCUP99 benchmark datasets, and achieved 95% and 99.2%, respectively. As a result, our approach provides an effective method for detecting intrusions in SDNs

    Evaluating Explainable Artificial Intelligence Methods Based on Feature Elimination: A Functionality-Grounded Approach

    No full text
    Although predictions based on machine learning are reaching unprecedented levels of accuracy, understanding the underlying mechanisms of a machine learning model is far from trivial. Therefore, explaining machine learning outcomes is gaining more interest with an increasing need to understand, trust, justify, and improve both the predictions and the prediction process. This, in turn, necessitates providing mechanisms to evaluate explainability methods as well as to measure their ability to fulfill their designated tasks. In this paper, we introduce a technique to extract the most important features from a data perspective. We propose metrics to quantify the ability of an explainability method to convey and communicate the underlying concepts available in the data. Furthermore, we evaluate the ability of an eXplainable Artificial Intelligence (XAI) method to reason about the reliance of a Machine Learning (ML) model on the extracted features. Through experiments, we further, prove that our approach enables differentiating explainability methods independent of the underlying experimental settings. The proposed metrics can be used to functionally evaluate the extent to which an explainability method is able to extract the patterns discovered by a machine learning model. Our approach provides a means to quantitatively differentiate global explainability methods in order to deepen user trust not only in the predictions generated but also in their explanations

    Multi-Label Active Learning-Based Machine Learning Model for Heart Disease Prediction

    No full text
    The rapid growth and adaptation of medical information to identify significant health trends and help with timely preventive care have been recent hallmarks of the modern healthcare data system. Heart disease is the deadliest condition in the developed world. Cardiovascular disease and its complications, including dementia, can be averted with early detection. Further research in this area is needed to prevent strokes and heart attacks. An optimal machine learning model can help achieve this goal with a wealth of healthcare data on heart disease. Heart disease can be predicted and diagnosed using machine-learning-based systems. Active learning (AL) methods improve classification quality by incorporating user–expert feedback with sparsely labelled data. In this paper, five (MMC, Random, Adaptive, QUIRE, and AUDI) selection strategies for multi-label active learning were applied and used for reducing labelling costs by iteratively selecting the most relevant data to query their labels. The selection methods with a label ranking classifier have hyperparameters optimized by a grid search to implement predictive modelling in each scenario for the heart disease dataset. Experimental evaluation includes accuracy and F-score with/without hyperparameter optimization. Results show that the generalization of the learning model beyond the existing data for the optimized label ranking model uses the selection method versus others due to accuracy. However, the selection method was highlighted in regards to the F-score using optimized settings

    Energy Efficiency through the Implementation of an AI Model to Predict Room Occupancy Based on Thermal Comfort Parameters

    No full text
    Room occupancy prediction based on indoor environmental quality may be the breakthrough to ensure energy efficiency and establish an interior ambience tailored to each user. Identifying whether temperature, humidity, lighting, and CO2 levels may be used as efficient predictors of room occupancy accuracy is needed to help designers better utilize the readings and data collected in order to improve interior design, in an effort to better suit users. It also aims to help in energy efficiency and saving in an ever-increasing energy crisis and dangerous levels of climate change. This paper evaluated the accuracy of room occupancy recognition using a dataset with diverse amounts of light, CO2, and humidity. As classification algorithms, K-nearest neighbors (KNN), hybrid Adam optimizer–artificial neural network–back-propagation network (AO–ANN (BP)), and decision trees (DT) were used. Furthermore, this research is based on machine learning interpretability methodologies. Shapley additive explanations (SHAP) improve interpretability by estimating the significance values for each feature for classifiers applied. The results indicate that the KNN performs better than the DT and AO-ANN (BP) classification models have 99.5%. Though the two classifiers are designed to evaluate variations in interpretations, we must ensure that they have accurate detection. The results show that SHAP provides successful implementation following these metrics, with differences detected amongst classifier models that support the assumption that model complexity plays a significant role when predictability is taken into account

    Energy Efficiency through the Implementation of an AI Model to Predict Room Occupancy Based on Thermal Comfort Parameters

    No full text
    Room occupancy prediction based on indoor environmental quality may be the breakthrough to ensure energy efficiency and establish an interior ambience tailored to each user. Identifying whether temperature, humidity, lighting, and CO2 levels may be used as efficient predictors of room occupancy accuracy is needed to help designers better utilize the readings and data collected in order to improve interior design, in an effort to better suit users. It also aims to help in energy efficiency and saving in an ever-increasing energy crisis and dangerous levels of climate change. This paper evaluated the accuracy of room occupancy recognition using a dataset with diverse amounts of light, CO2, and humidity. As classification algorithms, K-nearest neighbors (KNN), hybrid Adam optimizer–artificial neural network–back-propagation network (AO–ANN (BP)), and decision trees (DT) were used. Furthermore, this research is based on machine learning interpretability methodologies. Shapley additive explanations (SHAP) improve interpretability by estimating the significance values for each feature for classifiers applied. The results indicate that the KNN performs better than the DT and AO-ANN (BP) classification models have 99.5%. Though the two classifiers are designed to evaluate variations in interpretations, we must ensure that they have accurate detection. The results show that SHAP provides successful implementation following these metrics, with differences detected amongst classifier models that support the assumption that model complexity plays a significant role when predictability is taken into account

    Fine-Tuning Fuzzy KNN Classifier Based on Uncertainty Membership for the Medical Diagnosis of Diabetes

    No full text
    Diabetes, a metabolic disease in which the blood glucose level rises over time, is one of the most common chronic diseases at present. It is critical to accurately predict and classify diabetes to reduce the severity of the disease and treat it early. One of the difficulties that researchers face is that diabetes datasets are limited and contain outliers and missing data. Additionally, there is a trade-off between classification accuracy and operational law for detecting diabetes. In this paper, an algorithm for diabetes classification is proposed for pregnant women using the Pima Indians Diabetes Dataset (PIDD). First, a preprocessing step in the proposed algorithm includes outlier rejection, imputing missing values, the standardization process, and feature selection of the attributes, which enhance the dataset’s quality. Second, the classifier uses the fuzzy KNN method and modifies the membership function based on the uncertainty theory. Third, a grid search method is applied to achieve the best values for tuning the fuzzy KNN method based on uncertainty membership, as there are hyperparameters that affect the performance of the proposed classifier. In turn, the proposed tuned fuzzy KNN based on uncertainty classifiers (TFKNN) deals with the belief degree, handles membership functions and operation law, and avoids making the wrong categorization. The proposed algorithm performs better than other classifiers that have been trained and evaluated, including KNN, fuzzy KNN, naïve Bayes (NB), and decision tree (DT). The results of different classifiers in an ensemble could significantly improve classification precision. The TFKNN has time complexity O(kn2d), and space complexity O(n2d). The TFKNN model has high performance and outperformed the others in all tests in terms of accuracy, specificity, precision, and average AUC, with values of 90.63, 85.00, 93.18, and 94.13, respectively. Additionally, results of empirical analysis of TFKNN compared to fuzzy KNN, KNN, NB, and DT demonstrate the global superiority of TFKNN in precision, accuracy, and specificity
    corecore