Search CORE

14 research outputs found

OBOE: Collaborative Filtering for AutoML Model Selection

Author: Akimoto Yuji
Kim Dae Won
Udell Madeleine
Yang Chengrun
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 20/05/2019
Field of study

Algorithm selection and hyperparameter tuning remain two of the most challenging tasks in machine learning. Automated machine learning (AutoML) seeks to automate these tasks to enable widespread use of machine learning by non-experts. This paper introduces OBOE, a collaborative filtering method for time-constrained model selection and hyperparameter tuning. OBOE forms a matrix of the cross-validated errors of a large number of supervised learning models (algorithms together with hyperparameters) on a large number of datasets, and fits a low rank model to learn the low-dimensional feature vectors for the models and datasets that best predict the cross-validated errors. To find promising models for a new dataset, OBOE runs a set of fast but informative algorithms on the new dataset and uses their cross-validated errors to infer the feature vector for the new dataset. OBOE can find good models under constraints on the number of models fit or the total time budget. To this end, this paper develops a new heuristic for active learning in time-constrained matrix completion based on optimal experiment design. Our experiments demonstrate that OBOE delivers state-of-the-art performance faster than competing approaches on a test bed of supervised learning problems. Moreover, the success of the bilinear model used by OBOE suggests that AutoML may be simpler than was previously understood

arXiv.org e-Print Archive

Crossref

SUNNY: a Lazy Portfolio Approach for Constraint Solving

Author: Amadini Roberto
Gabbrielli Maurizio
Mauro Jacopo
Publication venue: 'Cambridge University Press (CUP)'
Publication date: 01/01/2014
Field of study

*** To appear in Theory and Practice of Logic Programming (TPLP) *** Within the context of constraint solving, a portfolio approach allows one to exploit the synergy between different solvers in order to create a globally better solver. In this paper we present SUNNY: a simple and flexible algorithm that takes advantage of a portfolio of constraint solvers in order to compute --- without learning an explicit model --- a schedule of them for solving a given Constraint Satisfaction Problem (CSP). Motivated by the performance reached by SUNNY vs. different simulations of other state of the art approaches, we developed sunny-csp, an effective portfolio solver that exploits the underlying SUNNY algorithm in order to solve a given CSP. Empirical tests conducted on exhaustive benchmarks of MiniZinc models show that the actual performance of SUNNY conforms to the predictions. This is encouraging both for improving the power of CSP portfolio solvers and for trying to export them to fields such as Answer Set Programming and Constraint Logic Programming

arXiv.org e-Print Archive

Crossref

INRIA a CCSD electronic archive server

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

ActivMetaL: Algorithm Recommendation with Active Meta Learning

Author: Guyon Isabelle
Sebag Michèle
Sun-Hosoya Lisheng
Publication venue: HAL CCSD
Publication date: 10/09/2018
Field of study

International audienceWe present an active meta learning approach to model selection or algorithm recommendation. We adopt the point of view "collab-orative filtering" recommender systems in which the problem is brought back to a missing data problem: given a sparsely populated matrix of performances of algorithms on given tasks, predict missing performances; more particularly, predict which algorithm will perform best on a new dataset (empty row). In this work, we propose and study an active learning version of the recommender algorithm CofiRank algorithm and compare it with baseline methods. Our benchmark involves three real-world datasets (from StatLog, OpenML, and AutoML) and artificial data. Our results indicate that CofiRank rapidly finds well performing algorithms on new datasets at reasonable computational cost

HAL-CentraleSupelec

INRIA a CCSD electronic archive server

HAL-Rennes 1

OSCAR: Online selection of algorithm portfolios with case study on memetic algorithms

Author: A Nareyek
B Huberman
E Burke
E Lawler
E Taillard
F Hutter
I Borg
I Guyon
J Rice
JR Quinlan
L Breiman
L Xu
P Moscato
RE Burkard
S Kadioglu
SD Handoko
TP Runarsson
Z Yuan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/05/2015
Field of study

Crossref

Institutional Knowledge at Singapore Management University

Algorithm Selection via Ranking

Author: LAU Hoong Chuin
Richard Jayadi Oentaryo
Stephanus Daniel Handoko
Publication venue: AAAI Press
Publication date: 01/01/2015
Field of study

Institutional Knowledge at Singapore Management University

MetaREVEAL: RL-based Meta-learning from Learning Curves

Author: Grinsztajn Nathan
Guyon Isabelle
Nguyen Manh Hung
Sun-Hosoya Lisheng
Publication venue: HAL CCSD
Publication date: 13/09/2021
Field of study

International audienceThis paper addresses a cornerstone of Automated Machine Learning: the problem of rapidly uncovering which machine learning algorithm performs best on a new dataset. Our approach leverages performances of such algorithms on datasets to which they have been previously exposed, i.e., implementing a form of meta-learning. More specifically, the problem is cast as a REVEAL Reinforcement Learning (RL) game: the meta-learning problem is wrapped into a RL environment in which an agent can start, pause, or resume training various machine learning algorithms to progressively "reveal" their learning curves. The learned policy is then applied to quickly uncover the best algorithm on a new dataset. While other similar approaches, such as Freeze-Thaw, were proposed in the past, using Bayesian optimization, our methodology is, to the best of our knowledge, the first that trains a RL agent to do this task on previous datasets. Using real and artificial data, we show that our new RL-based meta-learning paradigm outperforms Free-Thaw and other baseline methods, with respect to the Area under the Learning curve metric, a form of evaluation of Anytime learning (i.e., the capability of interrupting the algorithm at any time while obtaining good performance)

INRIA a CCSD electronic archive server

Learning Meta-features for AutoML

Author: Milijaona Louisot
Rakotoarison Herilalaina
Rasoanaivo Andry
Schoenauer Marc
Sebag Michèle
Publication venue: HAL CCSD
Publication date: 26/04/2022
Field of study

International audienceThis paper tackles the AutoML problem, aimed to automatically select an ML algorithm and its hyper-parameter configuration most appropriate to the dataset at hand. The proposed approach, MetaBu, learns new meta-features via an Optimal Transport procedure, aligning the manually designed meta-features with the space of distributions on the hyper-parameter configurations. MetaBu meta-features, learned once and for all, induce a topology on the set of datasets that is exploited to define a distribution of promising hyper-parameter configurations amenable to AutoML. Experiments on the OpenML CC-18 benchmark demonstrate that using MetaBu meta-features boosts the performance of state of the art AutoML systems, (Feurer et al. 2015) and Probabilistic Matrix Factorization (Fusi et al. 2018). Furthermore, the inspection of MetaBu meta-features gives some hints into when an ML algorithm does well. Finally, the topology based on MetaBu meta-features enables to estimate the intrinsic dimensionality of the OpenML benchmark w.r.t. a given ML algorithm or pipeline

INRIA a CCSD electronic archive server

Algorithm Selection in Auction-based Allocation of Cloud Computing Resources

Author: Gudu Diana
Publication venue: KIT-Bibliothek, Karlsruhe
Publication date: 01/01/2019
Field of study

KITopen