8,873 research outputs found
Predicting recurring concepts on data-streams by me ans of a meta-model and a fuzzy similarity function
Meta-models can be used in the process of enhancing the drift detection mechanisms used by data stream algorithms, by representing and predicting when the change will occur. There are some real-world situations where a concept reappears, as in the case of intrusion detection systems(IDS), where the same incidents or an adaptation of them usually reappear over time. In these environments the early prediction of drift by means of a better knowledge of past models can help to anticipate to the change, thus improving efficiency of the model regarding the training instances needed. In this paper we present MM-PRec, a meta-model for predicting recurring concepts on data-streams which main goal is to predict when the drift is going to occur together with the best model to be used in case of a recurring concept. To fulfill this goal, MM-PRec trains a Hidden Markov Model (HMM) from the instances that appear during the concept drift. The learning process of the base classification learner feeds the meta-model with all the information needed to predict recurrent or similar situations. Thus, the models predicted together with the associated contextual information are stored. In our approach we also propose to use a fuzzy similarity function to decide which is the best model to represent a particular context when drift is detected. The experiments performed show that MM-PRec outperforms the behaviour of other context-aware algorithms in terms of training instances needed, specially in environments characterized by the presence of gradual drifts
Stream Learning in Energy IoT Systems: A Case Study in Combined Cycle Power Plants
The prediction of electrical power produced in combined cycle power plants is a key challenge in the electrical power and energy systems field. This power production can vary depending on environmental variables, such as temperature, pressure, and humidity. Thus, the business problem is how to predict the power production as a function of these environmental conditions, in order to maximize the profit. The research community has solved this problem by applying Machine Learning techniques, and has managed to reduce the computational and time costs in comparison with the traditional thermodynamical analysis. Until now, this challenge has been tackled from a batch learning perspective, in which data is assumed to be at rest, and where models do not continuously integrate new information into already constructed models. We present an approach closer to the Big Data and Internet of Things paradigms, in which data are continuously arriving and where models learn incrementally, achieving significant enhancements in terms of data processing (time, memory and computational costs), and obtaining competitive performances. This work compares and examines the hourly electrical power prediction of several streaming regressors, and discusses about the best technique in terms of time processing and predictive performance to be applied on this streaming scenario.This work has been partially supported by the EU project iDev40. This project has received funding
from the ECSEL Joint Undertaking (JU) under grant agreement No 783163. The JU receives support from the
European Union’s Horizon 2020 research and innovation programme and Austria, Germany, Belgium, Italy,
Spain, Romania. It has also been supported by the Basque Government (Spain) through the project VIRTUAL
(KK-2018/00096), and by Ministerio de Economía y Competitividad of Spain (Grant Ref. TIN2017-85887-C2-2-P)
Assessing the suitable habitat for reintroduction of brown trout (Salmo trutta forma fario) in a lowland river : a modeling approach
Huge efforts have been made during the past decades to improve the water quality and to restore the physical habitat of rivers and streams in western Europe. This has led to an improvement in biological water quality and an increase in fish stocks in many countries. However, several rheophilic fish species such as brown trout are still categorized as vulnerable in lowland streams in Flanders (Belgium). In order to support cost-efficient restoration programs, habitat suitability modeling can be used. In this study, we developed an ensemble of habitat suitability models using metaheuristic algorithms to explore the importance of a large number of environmental variables, including chemical, physical, and hydromorphological characteristics to determine the suitable habitat for reintroduction of brown trout in the Zwalm River basin (Flanders, Belgium), which is included in the Habitats Directive. Mean stream velocity, water temperature, hiding opportunities, and presence of pools or riffles were identified as the most important variables determining the habitat suitability. Brown trout mainly preferred streams with a relatively high mean reach stream velocity (0.2-1m/s), a low water temperature (7-15 degrees C), and the presence of pools. The ensemble of models indicated that most of the tributaries and headwaters were suitable for the species. Synthesis and applications. Our results indicate that this modeling approach can be used to support river management, not only for brown trout but also for other species in similar geographical regions. Specifically for the Zwalm River basin, future restoration of the physical habitat, removal of the remaining migration barriers and the development of suitable spawning grounds could promote the successful restoration of brown trout
Recommended from our members
Simulating California reservoir operation using the classification and regression-tree algorithm combined with a shuffled cross-validation scheme
The controlled outflows from a reservoir or dam are highly dependent on the decisions made by the reservoir operators, instead of a natural hydrological process. Difference exists between the natural upstream inflows to reservoirs and the controlled outflows from reservoirs that supply the downstream users. With the decision maker's awareness of changing climate, reservoir management requires adaptable means to incorporate more information into decision making, such as water delivery requirement, environmental constraints, dry/wet conditions, etc. In this paper, a robust reservoir outflow simulation model is presented, which incorporates one of the well-developed data-mining models (Classification and Regression Tree) to predict the complicated human-controlled reservoir outflows and extract the reservoir operation patterns. A shuffled cross-validation approach is further implemented to improve CART's predictive performance. An application study of nine major reservoirs in California is carried out. Results produced by the enhanced CART, original CART, and random forest are compared with observation. The statistical measurements show that the enhanced CART and random forest overperform the CART control run in general, and the enhanced CART algorithm gives a better predictive performance over random forest in simulating the peak flows. The results also show that the proposed model is able to consistently and reasonably predict the expert release decisions. Experiments indicate that the release operation in the Oroville Lake is significantly dominated by SWP allocation amount and reservoirs with low elevation are more sensitive to inflow amount than others
A machine learning-based framework for preventing video freezes in HTTP adaptive streaming
HTTP Adaptive Streaming (HAS) represents the dominant technology to deliver videos over the Internet, due to its ability to adapt the video quality to the available bandwidth. Despite that, HAS clients can still suffer from freezes in the video playout, the main factor influencing users' Quality of Experience (QoE). To reduce video freezes, we propose a network-based framework, where a network controller prioritizes the delivery of particular video segments to prevent freezes at the clients. This framework is based on OpenFlow, a widely adopted protocol to implement the software-defined networking principle. The main element of the controller is a Machine Learning (ML) engine based on the random undersampling boosting algorithm and fuzzy logic, which can detect when a client is close to a freeze and drive the network prioritization to avoid it. This decision is based on measurements collected from the network nodes only, without any knowledge on the streamed videos or on the clients' characteristics. In this paper, we detail the design of the proposed ML-based framework and compare its performance with other benchmarking HAS solutions, under various video streaming scenarios. Particularly, we show through extensive experimentation that the proposed approach can reduce video freezes and freeze time with about 65% and 45% respectively, when compared to benchmarking algorithms. These results represent a major improvement for the QoE of the users watching multimedia content online
Recommended from our members
Multimodel combination techniques for analysis of hydrological simulations: Application to distributed model intercomparison project results
This paper examines several multimodel combination techniques that are used for streamflow forecasting: the simple model average (SMA), the multimodel superensemble (MMSE), modified multimodel superensemble (M3SE), and the weighted average method (WAM). These model combination techniques were evaluated using the results from the Distributed Model Intercomparison Project (DMIP), an international project sponsored by the National Weather Service (NWS) Office of Hydrologic Development (OHD). All of the multimodel combination results were obtained using uncalibrated DMIP model simulations and were compared against the best-uncalibrated as well as the best-calibrated individual model results. The purpose of this study is to understand how different combination techniques affect the accuracy levels of the multimodel simulations. This study revealed that the multimodel simulations obtained from uncalibrated single-model simulations are generally better than any single-member model simulations, even the best-calibrated single-model simulations. Furthermore, more sophisticated multimodel combination techniques that incorporated bias correction step work better than simple multimodel average simulations or multimodel simulations without bias correction. © 2006 American Meteorological Society
Gating Artificial Neural Network Based Soft Sensor
This work proposes a novel approach to Soft Sensor modelling,
where the Soft Sensor is built by a set of experts which are artificial
neural networks with randomly generated topology. For each of
the experts a meta neural network is trained, the gating Artificial Neural
Network. The role of the gating network is to learn the performance of the
experts in dependency on the input data samples. The final prediction
of the Soft Sensor is a weighted sum of the individual experts predictions.
The proposed meta-learning method is evaluated on two different
process industry data sets
- …