1,047 research outputs found

    TSE-IDS: A Two-Stage Classifier Ensemble for Intelligent Anomaly-based Intrusion Detection System

    Get PDF
    Intrusion detection systems (IDS) play a pivotal role in computer security by discovering and repealing malicious activities in computer networks. Anomaly-based IDS, in particular, rely on classification models trained using historical data to discover such malicious activities. In this paper, an improved IDS based on hybrid feature selection and two-level classifier ensembles is proposed. An hybrid feature selection technique comprising three methods, i.e. particle swarm optimization, ant colony algorithm, and genetic algorithm, is utilized to reduce the feature size of the training datasets (NSL-KDD and UNSW-NB15 are considered in this paper). Features are selected based on the classification performance of a reduced error pruning tree (REPT) classifier. Then, a two-level classifier ensembles based on two meta learners, i.e., rotation forest and bagging, is proposed. On the NSL-KDD dataset, the proposed classifier shows 85.8% accuracy, 86.8% sensitivity, and 88.0% detection rate, which remarkably outperform other classification techniques recently proposed in the literature. Results regarding the UNSW-NB15 dataset also improve the ones achieved by several state of the art techniques. Finally, to verify the results, a two-step statistical significance test is conducted. This is not usually considered by IDS research thus far and, therefore, adds value to the experimental results achieved by the proposed classifier

    Air Quality Prediction in Smart Cities Using Machine Learning Technologies Based on Sensor Data: A Review

    Get PDF
    The influence of machine learning technologies is rapidly increasing and penetrating almost in every field, and air pollution prediction is not being excluded from those fields. This paper covers the revision of the studies related to air pollution prediction using machine learning algorithms based on sensor data in the context of smart cities. Using the most popular databases and executing the corresponding filtration, the most relevant papers were selected. After thorough reviewing those papers, the main features were extracted, which served as a base to link and compare them to each other. As a result, we can conclude that: (1) instead of using simple machine learning techniques, currently, the authors apply advanced and sophisticated techniques, (2) China was the leading country in terms of a case study, (3) Particulate matter with diameter equal to 2.5 micrometers was the main prediction target, (4) in 41% of the publications the authors carried out the prediction for the next day, (5) 66% of the studies used data had an hourly rate, (6) 49% of the papers used open data and since 2016 it had a tendency to increase, and (7) for efficient air quality prediction it is important to consider the external factors such as weather conditions, spatial characteristics, and temporal features

    Improved Genetic Algorithm Multilayer Perceptron Network For Data Classification

    Get PDF
    Secara umumnya, algoritma genetik (GA) konvensional mempunyai beberapa kelemahan seperti penumpuan pramatang, kecenderungan terperangkap pada penyelesaian optima setempat dan ketidakupayaan penalaan di sekitar kawasan berpotensi. Oleh itu, GA ditambahbaik dengan strategi pencarian, penghasilan semula dan elitisma baharu dicadangkan dalam kajian ini. Pernambahbaikan pertama melibatkan perubahan kepada struktur operasi GA yang mana ia menumpu pencarian di sekitar kawasan berpotensi tinggi. Kedua, teknik baharu penghasilan semula yang dinamakan Segmented Multi-Chromosome Crossover (SMCC) telah diperkenalkan. Teknik tersebut mengelak kemusnahan maklumat hampir optima yang terkandung dalam segmen genetik dan membolehkan generasi baharu mewarisi maklumat penting daripada berbilang induk. Ketiganya, tiga jenis variasi elitism dinamakan sebagai Best Among Normal and Improved Population (BANI), Best Between Similar Rank (BBSR) dan Equally Contributed (EQ) telah dibangunkan. Ia melibatkan pertandingan di kalangan individu terbaik daripada populasi normal dan ditambahbaik untuk kelangsungan pada generasi selepasnya. GA yang ditambahbaik kemudiannya digunakan untuk mengoptimasi dan merekabentuk rangkaian perseptron berbilang lapisan (MLP) secara automatik bagi penyelesaian masalah pengkelasan corak. Bilangan nod terlindung, nilai pemberat sambungan awalan dan pemilihan ciri MLP yang memainkan peranan penting dalam menentukan prestasi pengkelasan dipilih untuk dioptimasi secara automatik oleh GA ditambahbaik. Prestasi GA ditambahbaik telah dinilai menggunakan fungsi ujian penanda aras yangv rumit serta berbilang mod dan dibandingkan dengan GA piawai. Berdasarkan kekerapan sesuatu algoritma menghasilkan keputusan terbaik terhadap fungsi ujian yang berbeza; ianya telah terbukti bahawa prestasi teknik yang dicadangkan mengatasi GA piawai. BANI, BBSR dan EQ mencatatkan 30, 18 dan 17 kekerapan keputusan terbaik masing-masing berbanding GA piawai yang hanya mencatatkan 3 keputusan terbaik. Manakala, prestasi pengkelasan GA-MLP yang ditambahbaik telah dinilai menggunakan set-set data yang berbeza dari segi saiz ciri kemasukan dan bilangan kelas keluaran. Keputusan menunjukkan keberkesanan algoritma baharu daripada segi peratusan ujian kejituan. Peratus peningkatan keseluruhan sebanyak 0.6%, 0.1% dan 0.3% bagi ujian kejituan dicatatkan oleh BANI, BBSR dan EQ berbanding dengan GA-MLP piawai. ____________________________________________________________ In general, conventional genetic algorithm (GA) has several drawbacks such as premature convergence, high tendency to get trapped in local optima solution and incapable of fine tuning around potential region. Thus, new improved GA that focuses on new search, reproduction and elitism strategy is proposed in this study. The first improvement involves changes in the operational structure of GA in which it concentrates the search in highly potential area in the search region. Secondly, a novel reproduction technique called Segmented Multi-Chromosome Crossover (SMCC) is introduced. The proposed technique avoids the destruction of nearly optimal information contained in the gene segment and allows offspring to inherit highly important information among multiple parents. Thirdly, three new variations of elitism scheme namely Best Among Normal and Improved Population (BANI), Best Between Similar Rank (BBSR) and Equally Contributed (EQ) are developed. It involves competition among best individuals from normal and improved population to ensure survival in the next generation. The improved GA is then applied for optimization and automatic design of multilayer perceptron (MLP) neural network in solving pattern classification problem. Hidden node size, initial weights and feature selection of the MLP that play significant role in the classification performance are selected to be automatically optimized by the improved GA. The performance of improved GA has been evaluated using highly complicated and multimodal benchmark test functions and compared with the standard GA. Based on the occurrences of the best result obtained by an algorithm across different test functions; it is proven that the proposed method outperforms standard GA. BANI, BBSR and EQ scores 30, 18 and 17 occurrences respectively compared to the standard GA that only scores 3 occurrences. Meanwhile, the improved GA-MLP classification performance has been evaluated using datasets that vary in input features and output sizes. The results demonstrate the effectiveness of the new algorithms in term of test accuracy percentage. There is an overall improvement of 0.6%, 0.1% and 0.3% in test accuracy of BANI, BBSR and EQ compared to the standard GA-MLP

    Effect of nano black rice husk ash on the chemical and physical properties of porous concrete pavement

    Get PDF
    Black rice husk is a waste from this agriculture industry. It has been found that majority inorganic element in rice husk is silica. In this study, the effect of Nano from black rice husk ash (BRHA) on the chemical and physical properties of concrete pavement was investigated. The BRHA produced from uncontrolled burning at rice factory was taken. It was then been ground using laboratory mill with steel balls and steel rods. Four different grinding grades of BRHA were examined. A rice husk ash dosage of 10% by weight of binder was used throughout the experiments. The chemical and physical properties of the Nano BRHA mixtures were evaluated using fineness test, X-ray Fluorescence spectrometer (XRF) and X-ray diffraction (XRD). In addition, the compressive strength test was used to evaluate the performance of porous concrete pavement. Generally, the results show that the optimum grinding time was 63 hours. The result also indicated that the use of Nano black rice husk ash ground for 63hours produced concrete with good strengt

    English character recognition algorithm by improving the weights of MLP neural network with dragonfly algorithm

    Get PDF
    Character Recognition (CR) is taken into consideration for years. Meanwhile, the neural network plays an important role in recognizing handwritten characters. Many character identification reports have been publishing in English, but still the minimum training timing and high accuracy of handwriting English symbols and characters by utilizing a method of neural networks are represents as open problems. Therefore, creating a character recognition system manually and automatically is very important. In this research, an attempt has been done to incubate an automatic symbols and character system for recognition for English with minimum training and a very high recognition accuracy and classification timing. In the proposed idea for improving the weights of the MLP neural network method in the process of teaching and learning character recognition, the dragonfly optimization algorithm has been used. The innovation of the proposed detection system is that with a combination of dragonfly optimization technique and MLP neural networks, the precisions of the system are recovered, and the computing time is minimized. The approach which was used in this study to identify English characters has high accuracy and minimum training time

    Parameter optimization of evolving spiking neural networks using improved firefly algorithm for classification tasks

    Get PDF
    Evolving Spiking Neural Network (ESNN) is the third generation of artificial neural network that has been widely used in numerous studies in recent years. However, there are issues of ESSN that need to be improved; one of which is its parameters namely the modulation factor (Mod), similarity factor (Sim) and threshold factor (C) that have to be manually tuned for optimal values that are suitable for any particular problem. The objective of the proposed work is to automatically determine the optimum values of the ESNN parameters for various datasets by integrating the Firefly Algorithm (FA) optimizer into the ESNN training phase and adaptively searching for the best parameter values. In this study, FA has been modified and improved, and was applied to improve the accuracy of ESNN structure and rates of classification accuracy. Five benchmark datasets from University of California, Irvine (UCI) Machine Learning Repository, have been used to measure the effectiveness of the integration model. Performance analysis of the proposed work was conducted by calculating classification accuracy, and compared with other parameter optimisation methods. The results from the experimentation have proven that the proposed algorithms have attained the optimal parameters values for ESNN
    corecore