959 research outputs found
Air Quality Prediction in Smart Cities Using Machine Learning Technologies Based on Sensor Data: A Review
The influence of machine learning technologies is rapidly increasing and penetrating almost in every field, and air pollution prediction is not being excluded from those fields. This paper covers the revision of the studies related to air pollution prediction using machine learning algorithms based on sensor data in the context of smart cities. Using the most popular databases and executing the corresponding filtration, the most relevant papers were selected. After thorough reviewing those papers, the main features were extracted, which served as a base to link and compare them to each other. As a result, we can conclude that: (1) instead of using simple machine learning techniques, currently, the authors apply advanced and sophisticated techniques, (2) China was the leading country in terms of a case study, (3) Particulate matter with diameter equal to 2.5 micrometers was the main prediction target, (4) in 41% of the publications the authors carried out the prediction for the next day, (5) 66% of the studies used data had an hourly rate, (6) 49% of the papers used open data and since 2016 it had a tendency to increase, and (7) for efficient air quality prediction it is important to consider the external factors such as weather conditions, spatial characteristics, and temporal features
Ensemble Reinforcement Learning: A Survey
Reinforcement Learning (RL) has emerged as a highly effective technique for
addressing various scientific and applied problems. Despite its success,
certain complex tasks remain challenging to be addressed solely with a single
model and algorithm. In response, ensemble reinforcement learning (ERL), a
promising approach that combines the benefits of both RL and ensemble learning
(EL), has gained widespread popularity. ERL leverages multiple models or
training algorithms to comprehensively explore the problem space and possesses
strong generalization capabilities. In this study, we present a comprehensive
survey on ERL to provide readers with an overview of recent advances and
challenges in the field. First, we introduce the background and motivation for
ERL. Second, we analyze in detail the strategies that have been successfully
applied in ERL, including model averaging, model selection, and model
combination. Subsequently, we summarize the datasets and analyze algorithms
used in relevant studies. Finally, we outline several open questions and
discuss future research directions of ERL. By providing a guide for future
scientific research and engineering applications, this survey contributes to
the advancement of ERL.Comment: 42 page
The State-of-the-Art in Air Pollution Monitoring and Forecasting Systems using IoT, Big Data, and Machine Learning
The quality of air is closely linked with the life quality of humans,
plantations, and wildlife. It needs to be monitored and preserved continuously.
Transportations, industries, construction sites, generators, fireworks, and
waste burning have a major percentage in degrading the air quality. These
sources are required to be used in a safe and controlled manner. Using
traditional laboratory analysis or installing bulk and expensive models every
few miles is no longer efficient. Smart devices are needed for collecting and
analyzing air data. The quality of air depends on various factors, including
location, traffic, and time. Recent researches are using machine learning
algorithms, big data technologies, and the Internet of Things to propose a
stable and efficient model for the stated purpose. This review paper focuses on
studying and compiling recent research in this field and emphasizes the Data
sources, Monitoring, and Forecasting models. The main objective of this paper
is to provide the astuteness of the researches happening to improve the various
aspects of air polluting models. Further, it casts light on the various
research issues and challenges also.Comment: 30 pages, 11 figures, Wireless Personal Communications. Wireless Pers
Commun (2023
Review and Comparison of Intelligent Optimization Modelling Techniques for Energy Forecasting and Condition-Based Maintenance in PV Plants
Within the field of soft computing, intelligent optimization modelling techniques include
various major techniques in artificial intelligence. These techniques pretend to generate new business
knowledge transforming sets of "raw data" into business value. One of the principal applications of
these techniques is related to the design of predictive analytics for the improvement of advanced
CBM (condition-based maintenance) strategies and energy production forecasting. These advanced
techniques can be used to transform control system data, operational data and maintenance event data
to failure diagnostic and prognostic knowledge and, ultimately, to derive expected energy generation.
One of the systems where these techniques can be applied with massive potential impact are the
legacy monitoring systems existing in solar PV energy generation plants. These systems produce a
great amount of data over time, while at the same time they demand an important e ort in order to
increase their performance through the use of more accurate predictive analytics to reduce production
losses having a direct impact on ROI. How to choose the most suitable techniques to apply is one of
the problems to address. This paper presents a review and a comparative analysis of six intelligent
optimization modelling techniques, which have been applied on a PV plant case study, using the
energy production forecast as the decision variable. The methodology proposed not only pretends
to elicit the most accurate solution but also validates the results, in comparison with the di erent
outputs for the di erent techniques
Effect of traffic dataset on various machine-learning algorithms when forecasting air quality
© Emerald Publishing Limited. This is the accepted manuscript version of an article which has been published in final form at https://10.1108/JEDT-10-2021-0554Purpose (limit 100 words) Road traffic emissions are generally believed to contribute immensely to air pollution, but the effect of road traffic datasets on air quality predictions has not been clearly investigated. This research investigates the effects traffic dataset have on the performance of Machine Learning (ML) predictive models in air quality prediction. Design/methodology/approach (limit 100 words) To achieve this, we have set up an experiment with the control dataset having only the Air Quality (AQ) dataset and Meteorological (Met) dataset. While the experimental dataset is made up of the AQ dataset, Met dataset and Traffic dataset. Several ML models (such as Extra Trees Regressor, eXtreme Gradient Boosting Regressor, Random Forest Regressor, K-Neighbors Regressor, and five others) were trained, tested, and compared on these individual combinations of datasets to predict the volume of PM2.5, PM10, NO2, and O3 in the atmosphere at various time of the day. Findings (limit 100 words) The result obtained showed that various ML algorithms react differently to the traffic dataset despite generally contributing to the performance improvement of all the ML algorithms considered in this study by at least 20% and an error reduction of at least 18.97%. Research limitations/implications (limit 100 words) This research is limited in terms of the study area and the result cannot be generalized outside of the UK as many conditions may not be similar elsewhere. Additionally, only the ML algorithms commonly used in literature are considered in this research. Therefore, leaving out a few other ML algorithms. Practical implications (limit 100 words) This study reinforces the belief that the traffic dataset has a significant effect on improving the performance of air pollution ML prediction models. Hence, there is an indication that ML algorithms behave differently when trained with a form traffic dataset in the development of an air quality prediction model. This implies that developers and researchers in air quality prediction need to identify the ML algorithms that behave in their best interest before implementation. Originality/value (limit 100 words) This will enable researchers to focus more on algorithms of benefit when using traffic datasets in air quality prediction.Peer reviewe
Toward Building an Intelligent and Secure Network: An Internet Traffic Forecasting Perspective
Internet traffic forecast is a crucial component for the proactive management of self-organizing networks (SON) to ensure better Quality of Service (QoS) and Quality of Experience (QoE). Given the volatile and random nature of traffic data, this forecasting influences strategic development and investment decisions in the Internet Service Provider (ISP) industry. Modern machine learning algorithms have shown potential in dealing with complex Internet traffic prediction tasks, yet challenges persist. This thesis systematically explores these issues over five empirical studies conducted in the past three years, focusing on four key research questions: How do outlier data samples impact prediction accuracy for both short-term and long-term forecasting? How can a denoising mechanism enhance prediction accuracy? How can robust machine learning models be built with limited data? How can out-of-distribution traffic data be used to improve the generalizability of prediction models? Based on extensive experiments, we propose a novel traffic forecast/prediction framework and associated models that integrate outlier management and noise reduction strategies, outperforming traditional machine learning models. Additionally, we suggest a transfer learning-based framework combined with a data augmentation technique to provide robust solutions with smaller datasets. Lastly, we propose a hybrid model with signal decomposition techniques to enhance model generalization for out-of-distribution data samples. We also brought the issue of cyber threats as part of our forecast research, acknowledging their substantial influence on traffic unpredictability and forecasting challenges. Our thesis presents a detailed exploration of cyber-attack detection, employing methods that have been validated using multiple benchmark datasets. Initially, we incorporated ensemble feature selection with ensemble classification to improve DDoS (Distributed Denial-of-Service) attack detection accuracy with minimal false alarms. Our research further introduces a stacking ensemble framework for classifying diverse forms of cyber-attacks. Proceeding further, we proposed a weighted voting mechanism for Android malware detection to secure Mobile Cyber-Physical Systems, which integrates the mobility of various smart devices to exchange information between physical and cyber systems. Lastly, we employed Generative Adversarial Networks for generating flow-based DDoS attacks in Internet of Things environments. By considering the impact of cyber-attacks on traffic volume and their challenges to traffic prediction, our research attempts to bridge the gap between traffic forecasting and cyber security, enhancing proactive management of networks and contributing to resilient and secure internet infrastructure
Mecanismos para controlo e gestão de redes 5G: redes de operador
In 5G networks, time-series data will be omnipresent for the monitoring of network
metrics. With the increase in the number of Internet of Things (IoT) devices
in the next years, it is expected that the number of real-time time-series
data streams increases at a fast pace. To be able to monitor those streams,
test and correlate different algorithms and metrics simultaneously and in a
seamless way, time-series forecasting is becoming essential for the pro-active
successful management of the network.
The objective of this dissertation is to design, implement and test a prediction
system in a communication network, that allows integrating various networks,
such as a vehicular network and a 4G operator network, to improve the network
reliability and Quality-of-Service (QoS). To do that, the dissertation has
three main goals: (1) the analysis of different network datasets and implementation
of different approaches to forecast network metrics, to test different
techniques; (2) the design and implementation of a real-time distributed
time-series forecasting architecture, to enable the network operator to make
predictions about the network metrics; and lastly, (3) to use the forecasting
models made previously and apply them to improve the network performance
using resource management policies.
The tests done with two different datasets, addressing the use cases of congestion
management and resource splitting in a network with a limited number
of resources, show that the network performance can be improved with proactive
management made by a real-time system able to predict the network
metrics and act on the network accordingly.
It is also done a study about what network metrics can cause reduced accessibility
in 4G networks, for the network operator to act more efficiently and
pro-actively to avoid such eventsEm redes 5G, séries temporais serão omnipresentes para a monitorização
de métricas de rede. Com o aumento do número de dispositivos da Internet
das Coisas (IoT) nos próximos anos, é esperado que o número de fluxos de
séries temporais em tempo real cresça a um ritmo elevado. Para monitorizar
esses fluxos, testar e correlacionar diferentes algoritmos e métricas simultaneamente
e de maneira integrada, a previsão de séries temporais está a
tornar-se essencial para a gestão preventiva bem sucedida da rede.
O objetivo desta dissertação é desenhar, implementar e testar um sistema
de previsão numa rede de comunicações, que permite integrar várias redes
diferentes, como por exemplo uma rede veicular e uma rede 4G de operador,
para melhorar a fiabilidade e a qualidade de serviço (QoS). Para isso,
a dissertação tem três objetivos principais: (1) a análise de diferentes datasets
de rede e subsequente implementação de diferentes abordagens para
previsão de métricas de rede, para testar diferentes técnicas; (2) o desenho
e implementação de uma arquitetura distribuÃda de previsão de séries temporais
em tempo real, para permitir ao operador de rede efetuar previsões
sobre as métricas de rede; e finalmente, (3) o uso de modelos de previsão
criados anteriormente e sua aplicação para melhorar o desempenho da rede
utilizando polÃticas de gestão de recursos.
Os testes efetuados com dois datasets diferentes, endereçando os casos de
uso de gestão de congestionamento e divisão de recursos numa rede com
recursos limitados, mostram que o desempenho da rede pode ser melhorado
com gestão preventiva da rede efetuada por um sistema em tempo real capaz
de prever métricas de rede e atuar em conformidade na rede.
Também é efetuado um estudo sobre que métricas de rede podem causar
reduzida acessibilidade em redes 4G, para o operador de rede atuar mais
eficazmente e proativamente para evitar tais acontecimentos.Mestrado em Engenharia de Computadores e Telemátic
- …