Search CORE

2 research outputs found

Automating concept-drift detection by self-evaluating predictive model degradation

Author: Apiletti Daniele
Baralis Elena
Cerquitelli Tania
Proto Stefano
Ventura Francesco
Publication venue
Publication date: 01/01/2019
Field of study

A key aspect of automating predictive machine learning entails the capability of properly triggering the update of the trained model. To this aim, suitable automatic solutions to self-assess the prediction quality and the data distribution drift between the original training set and the new data have to be devised. In this paper, we propose a novel methodology to automatically detect prediction-quality degradation of machine learning models due to class-based concept drift, i.e., when new data contains samples that do not fit the set of class labels known by the currently-trained predictive model. Experiments on synthetic and real-world public datasets show the effectiveness of the proposed methodology in automatically detecting and describing concept drift caused by changes in the class-label data distributions.Comment: 5 pages, 4 figure

arXiv.org e-Print Archive

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

On the Optimization of Iterative Programming with Distributed Data Collections

Author: Chlyah Sarah
Genevès Pierre
Gesbert Nils
Layaïda Nabil
Publication venue: HAL CCSD
Publication date: 16/10/2020
Field of study

Big data programming frameworks are becoming increasingly important for the development of applications for which performance and scalability are critical. In those complex frameworks, optimizing code by hand is hard and time-consuming, making automated optimization particularly necessary. In order to automate optimization, a prerequisite is to find suitable abstractions to represent programs; for instance, algebras based on monads or monoids to represent distributed data collections. Currently, however, such algebras do not represent recursive programs in a way which allows for analyzing or rewriting them. In this paper, we extend a monoid algebra with a fixpoint operator for representing recursion as a first class citizen and show how it enables new optimizations. Experiments with the Spark platform illustrate performance gains brought by these systematic optimizations

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server