Search CORE

126 research outputs found

Unraveling the Complexity of Splitting Sequential Data: Tackling Challenges in Video and Time Series Analysis

Author: Botache Diego
Dingel Kristina
Ehresmann Arno
Huhnstock Rico
Sick Bernhard
Publication venue
Publication date: 26/07/2023
Field of study

Splitting of sequential data, such as videos and time series, is an essential step in various data analysis tasks, including object tracking and anomaly detection. However, splitting sequential data presents a variety of challenges that can impact the accuracy and reliability of subsequent analyses. This concept article examines the challenges associated with splitting sequential data, including data acquisition, data representation, split ratio selection, setting up quality criteria, and choosing suitable selection strategies. We explore these challenges through two real-world examples: motor test benches and particle tracking in liquids

arXiv.org e-Print Archive

Evaluation of Machine Learning Techniques for Inflow Prediction in Lake Como, Italy

Author: Andrea Scalvini
Ivan Serina
Michele Pini
Muhammad Usman Liaqat
Roberto Ranzi
Tahir Mehmood
Publication venue
Publication date: 01/01/2020
Field of study

Abstract Accurate streamflow prediction is a fundamental task for integrated water resources management and flood risk mitigation. The purpose of this study is to forecast the water inflow to lake Como, (Italy) using different machine learning algorithms. The forecast is done for different days ranging from one day to three days. These models are evaluated by three statistical measures including Mean Absolute Error, Root Mean Squared Error, and the Nash-Sutcliffe Efficiency Coefficient. The experimental results show that Neural Network performs better for streamflow estimation with MAE and RMSE followed by Support Vector Regression and Random Forest

Open Access Repository

Representativeness: A new criterion for selecting forecasts

Author: Petropoulos Fotios
Siemsen Enno
Publication venue
Publication date: 31/12/2022
Field of study

OPUS

Towards the prediction of renewable energy unbalance in smart grids

Author: A. Genovese
F. Scotti
R. Donida Labati
V. Piuri
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 29/11/2018
Field of study

The production of renewable energy is increasing worldwide. To integrate renewable sources in electrical smart grids able to adapt to changes in power usage in heterogeneous local zones, it is necessary to accurately predict the power production that can be achieved from renewable energy sources. By using such predictions, it is possible to plan the power production from non-renewable energy plants to properly allocate the produced power and compensate possible unbalances. In particular, it is important to predict the unbalance between the power produced and the actual power intake at a local level (zones). In this paper, we propose a novel method for predicting the sign of the unbalance between the power produced by renewable sources and the power intake at the local level, considering zones composed of multiple power plants and with heterogeneous characteristics. The method uses a set of historical features and is based on Computational Intelligence techniques able to learn the relationship between historical data and the power unbalance in heterogeneous geographical regions. As a case study, we evaluated the proposed method using data collected by a player in the energy market over a period of seven months. In this preliminary study, we evaluated different configurations of the proposed method, achieving results considered as satisfactory by a player in the energy market

Crossref

AIR Universita degli studi di Milano

Multilingual Twitter Sentiment Classification: The Role of Human Annotators

Author: Grcar Miha
Mozetic Igor
Smailovic Jasmina
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 23/02/2016
Field of study

What are the limits of automated Twitter sentiment classification? We analyze a large set of manually labeled tweets in different languages, use them as training data, and construct automated classification models. It turns out that the quality of classification models depends much more on the quality and size of training data than on the type of the model trained. Experimental results indicate that there is no statistically significant difference between the performance of the top classification models. We quantify the quality of training data by applying various annotator agreement measures, and identify the weakest points of different datasets. We show that the model performance approaches the inter-annotator agreement when the size of the training set is sufficiently large. However, it is crucial to regularly monitor the self- and inter-annotator agreements since this improves the training datasets and consequently the model performance. Finally, we show that there is strong evidence that humans perceive the sentiment classes (negative, neutral, and positive) as ordered

arXiv.org e-Print Archive

Common Language Resources and Technology Infrastructure - Slovenia

Directory of Open Access Journals

PubMed Central

Digital repository of Slovenian research organizations