Search CORE

20 research outputs found

ServeNet: A Deep Neural Network for Web Services Classification

Author: Grolinger Katarina
Li Zhi
Liao Zhifang
Liu Peng
Qamar Nafees
Wang Weiru
Yang Yilong
Publication venue
Publication date: 06/08/2020
Field of study

Automated service classification plays a crucial role in service discovery, selection, and composition. Machine learning has been widely used for service classification in recent years. However, the performance of conventional machine learning methods highly depends on the quality of manual feature engineering. In this paper, we present a novel deep neural network to automatically abstract low-level representation of both service name and service description to high-level merged features without feature engineering and the length limitation, and then predict service classification on 50 service categories. To demonstrate the effectiveness of our approach, we conduct a comprehensive experimental study by comparing 10 machine learning methods on 10,000 real-world web services. The result shows that the proposed deep neural network can achieve higher accuracy in classification and more robust than other machine learning methods.Comment: Accepted by ICWS'2

arXiv.org e-Print Archive

Crossref

Web server load prediction and anomaly detection from hypertext transfer protocol logs

Author: Benova Lenka
Hudec Ladislav
Publication venue: 'Institute of Advanced Engineering and Science'
Publication date: 01/10/2023
Field of study

As network traffic increases and new intrusions occur, anomaly detection solutions based on machine learning are necessary to detect previously unknown intrusion patterns. Most of the developed models require a labelled dataset, which can be challenging owing to a shortage of publicly available datasets. These datasets are often too small to effectively train machine learning models, which further motivates the use of real unlabeled traffic. By using real traffic, it is possible to more accurately simulate the types of anomalies that might occur in a real-world network and improve the performance of the detection model. We present a method able to predict and categorize anomalies without the aid of a labelled dataset, demonstrating the model’s usability while also gathering a dataset from real noisy network traffic. The proposed long short-term memory (LTSM) based intrusion detection system was tested in a real-world setting of an antivirus company and was successful in detecting various intrusions using 5-minute windowing over both the predicted and real update curves thereby demonstrating its usefulness. Our contribution was the development of a robust model generally applicable to any hypertext transfer protocol (HTTP) traffic with almost real-time anomaly detection, while also outperforming earlier studies in terms of prediction accuracy

Institute of Advanced Engineering and Science

Deep Anomaly Detection for Time-series Data in Industrial IoT: A Communication-Efficient On-device Federated Learning Approach

Author: Garg Sahil
Hossain M. Shamim
Kang Jiawen
Liu Yi
Nie Jiangtian
Xiong Zehui
Zhang Yang
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2020
Field of study

Since edge device failures (i.e., anomalies) seriously affect the production of industrial products in Industrial IoT (IIoT), accurately and timely detecting anomalies is becoming increasingly important. Furthermore, data collected by the edge device may contain the user's private data, which is challenging the current detection approaches as user privacy is calling for the public concern in recent years. With this focus, this paper proposes a new communication-efficient on-device federated learning (FL)-based deep anomaly detection framework for sensing time-series data in IIoT. Specifically, we first introduce a FL framework to enable decentralized edge devices to collaboratively train an anomaly detection model, which can improve its generalization ability. Second, we propose an Attention Mechanism-based Convolutional Neural Network-Long Short Term Memory (AMCNN-LSTM) model to accurately detect anomalies. The AMCNN-LSTM model uses attention mechanism-based CNN units to capture important fine-grained features, thereby preventing memory loss and gradient dispersion problems. Furthermore, this model retains the advantages of LSTM unit in predicting time series data. Third, to adapt the proposed framework to the timeliness of industrial anomaly detection, we propose a gradient compression mechanism based on Top-\textit{k} selection to improve communication efficiency. Extensive experiment studies on four real-world datasets demonstrate that the proposed framework can accurately and timely detect anomalies and also reduce the communication overhead by 50\% compared to the federated learning framework that does not use a gradient compression scheme.Comment: IEEE Internet of Things Journa

arXiv.org e-Print Archive

DR-NTU (Digital Repository of NTU)

Forecasting energy consumption of wastewater treatment plants with a transfer learning approach for sustainable cities

Author: Analide Cesar
Fernandes Bruno
Novais Paulo
Oliveira Pedro
Publication venue: 'MDPI AG'
Publication date: 12/05/2021
Field of study

A major challenge of today’s society is to make large urban centres more sustainable. Improving the energy efficiency of the various infrastructures that make up cities is one aspect being considered when improving their sustainability, with Wastewater Treatment Plants (WWTPs) being one of them. Consequently, this study aims to conceive, tune, and evaluate a set of candidate deep learning models with the goal being to forecast the energy consumption of a WWTP, following a recursive multi-step approach. Three distinct types of models were experimented, in particular, Long Short-Term Memory networks (LSTMs), Gated Recurrent Units (GRUs), and uni-dimensional Convolutional Neural Networks (CNNs). Uni- and multi-variate settings were evaluated, as well as different methods for handling outliers. Promising forecasting results were obtained by CNN-based models, being this difference statistically significant when compared to LSTMs and GRUs, with the best model presenting an approximate overall error of 630 kWh when on a multi-variate setting. Finally, to overcome the problem of data scarcity in WWTPs, transfer learning processes were implemented, with promising results being achieved when using a pre-trained uni-variate CNN model, with the overall error reducing to 325 kWh.The work of Paulo Novais and Cesar Analide has been supported by FCT-Fundação para a Ciência e Tecnologia within the R&D Units Project Scope: UIDB/00319/2020. The work of Pedro Oliveria and Bruno Fernandes is also supported by National Funds through the Portuguese funding agency, FCT-Fundação para a Ciência e a Tecnologia within project DSAIPA/AI/0099/2019

Multidisciplinary Digital Publishing Institute

Universidade do Minho: RepositoriUM

Monitoring the misalignment of machine tools with autoencoders after they are trained with transfer learning data

Author: Demetgul Mustafa
Fleischer Jürgen
Tansel Ibrahim Nur
Zheng Qi
Publication venue: Springer
Publication date: 29/08/2023
Field of study

CNC machines have revolutionized manufacturing by enabling high-quality and high-productivity production. Monitoring the condition of these machines during production would reduce maintenance cost and avoid manufacturing defective parts. Misalignment of the linear tables in CNCs can directly affect the quality of the manufactured parts, and the components of the linear tables wear out over time due to the heavy and fluctuating loads. To address these challenges, an intelligent monitoring system was developed to identify normal operation and misalignments. Since damaging a CNC machine for data collection is too expensive, transfer learning was used in two steps. First, a specially designed experimental feed axis test platform (FATP) was used to sample the current signal at normal and five levels of left-side misalignment conditions ranging from 0.05 to 0.25 mm. Four different algorithm combinations were trained to detect misalignments. These combinations included a 1D convolution neural network (CNN) and autoencoder (AE) combination, a temporal convolutional network (TCN) and AE combination, a long short-term memory neural network (LSTM) and AE combination, and a CNN, LSTM, and AE combination. At the second step, Wasserstein deep convolutional generative adversarial network (W-DCGAN) was used to generate data by integrating the observed characteristics of the FATP at different misalignment levels and collected limited data from the actual CNC machines. To evaluate the similarity and limited diversity of generated and real signals, t-distributed stochastic neighbor embedding (T-SNE) method was used. The hyperparameters of the model were optimized by random and grid search. The CNN, LSTM, and AE combination demonstrated the best performance, which provides a practical way to detect misalignments without stopping production or cluttering the work area with sensors. The proposed intelligent monitoring system can detect misalignments of the linear tables of CNCs, thus enhancing the quality of manufactured parts and reducing production costs

KITopen

SuperB: Superior Behavior-based Anomaly Detection Defining Authorized Users\u27 Traffic Patterns

Author: Karasek Daniel
Publication venue: DigitalCommons@Kennesaw State University
Publication date: 06/05/2020
Field of study

Network anomalies are correlated to activities that deviate from regular behavior patterns in a network, and they are undetectable until their actions are defined as malicious. Current work in network anomaly detection includes network-based and host-based intrusion detection systems. However, network anomaly detection schemes can suffer from high false detection rates due to the base rate fallacy. When the detection rate is less than the false positive rate, which is found in network anomaly detection schemes working with live data, a high false detection rate can occur. To overcome such a drawback, this paper proposes a superior behavior-based anomaly detection system (SuperB) that defines legitimate network behaviors of authorized users in order to identify unauthorized accesses. I define the network behaviors of the authorized users by training the proposed deep learning model with time series data extracted from network packets of each of the users. Then, the trained model is used to classify all other behaviors (I define these as anomalies) from the defined legitimate behaviors. As a result, SuperB effectively detects all anomalies of network behaviors. The simulation results show that SuperB needs at least five end-to-end network conversations to achieve over 95% accuracy and over 93% true positive rate. Some simulations achieved 100% accuracy and true positive rate. The simulations use live network data combined with the CICIDS2017 data set. The performance has an average of less than 1.1% false positive rate, with some simulations showing 0%. The execution time to process each conversation is 85.20 ± 0.60 milliseconds (ms), and thus it takes about only 426 ms to process five conversations to identify an anomaly

DigitalCommons@Kennesaw State University

Ensemble Deep Learning for Detecting Onset of Abnormal Operation in Industrial Multi-component Systems

Author: Balaji Selvanathan
Saurabh Jaywant Desai
Shashank Agarwal
Sri Harsha Nistala
Venkataramana Runkana
Publication venue: 'PHM Society'
Publication date: 01/07/2022
Field of study

Breakdowns and unplanned shutdowns in industrial processes and equipment can lead to significant loss of availability and revenue. It is imperative to perform optimal maintenance of such systems when signs of abnormal behavior are detected and before they propagate and lead to catastrophic failure. This is particularly challenging in systems with interconnected multiple components as it is difficult to isolate the effect of one component on the operation of other components in the system. In this work, an ensemble approach based on Cascaded Convolutional neural network and Long Short-term Memory (CC-LSTM) network models is proposed for detecting and predicting the time of onset of faults in interconnected multicomponent systems. The performance of the ensemble CC-LSTM model was demonstrated on an industrial 4-component system and was found to improve the accuracy of onset time predictions by ~15% compared to individual CC-LSTM models and ~25-40% compared to commonly used deep learning techniques such as dense neural networks, convolutional neural networks and LSTMs. The CC-LSTM and the ensemble models also had the lowest missed detection rates and zero false positive rates making them ideal for real-time monitoring and fault detection in multicomponent systems

Directory of Open Access Journals