Search CORE

3,266 research outputs found

Exploration vs Exploitation vs Safety: Risk-averse Multi-Armed Bandits

Author: Galichet Nicolas
Sebag Michèle
Teytaud Olivier
Publication venue
Publication date: 13/11/2013
Field of study

Motivated by applications in energy management, this paper presents the Multi-Armed Risk-Aware Bandit (MARAB) algorithm. With the goal of limiting the exploration of risky arms, MARAB takes as arm quality its conditional value at risk. When the user-supplied risk level goes to 0, the arm quality tends toward the essential infimum of the arm distribution density, and MARAB tends toward the MIN multi-armed bandit algorithm, aimed at the arm with maximal minimal value. As a first contribution, this paper presents a theoretical analysis of the MIN algorithm under mild assumptions, establishing its robustness comparatively to UCB. The analysis is supported by extensive experimental validation of MIN and MARAB compared to UCB and state-of-art risk-aware MAB algorithms on artificial and real-world problems.Comment: 16 page

arXiv.org e-Print Archive

HAL-CentraleSupelec

INRIA a CCSD electronic archive server

Black-box optimization benchmarking of IPOP-saACM-ES on the BBOB-2012 noisy testbed

Author: Loshchilov Ilya
Schoenauer Marc
Sebag Michèle
Publication venue
Publication date: 01/01/2012
Field of study

In this paper, we study the performance of IPOP-saACM-ES, recently proposed self-adaptive surrogate-assisted Covariance Matrix Adaptation Evolution Strategy. The algorithm was tested using restarts till a total number of function evaluations of

10^6D

was reached, where

D

is the dimension of the function search space. The experiments show that the surrogate model control allows IPOP-saACM-ES to be as robust as the original IPOP-aCMA-ES and outperforms the latter by a factor from 2 to 3 on 6 benchmark problems with moderate noise. On 15 out of 30 benchmark problems in dimension 20, IPOP-saACM-ES exceeds the records observed during BBOB-2009 and BBOB-2010.Comment: Genetic and Evolutionary Computation Conference (GECCO 2012) (2012

arXiv.org e-Print Archive

Infoscience - École polytechnique fédérale de Lausanne

HAL-CentraleSupelec

Crossref

INRIA a CCSD electronic archive server

HAL-Rennes 1

Self-Adaptive Surrogate-Assisted Covariance Matrix Adaptation Evolution Strategy

Author: Loshchilov Ilya
Schoenauer Marc
Sebag Michèle
Publication venue
Publication date: 01/01/2012
Field of study

This paper presents a novel mechanism to adapt surrogate-assisted population-based algorithms. This mechanism is applied to ACM-ES, a recently proposed surrogate-assisted variant of CMA-ES. The resulting algorithm, saACM-ES, adjusts online the lifelength of the current surrogate model (the number of CMA-ES generations before learning a new surrogate) and the surrogate hyper-parameters. Both heuristics significantly improve the quality of the surrogate model, yielding a significant speed-up of saACM-ES compared to the ACM-ES and CMA-ES baselines. The empirical validation of saACM-ES on the BBOB-2012 noiseless testbed demonstrates the efficiency and the scalability w.r.t the problem dimension and the population size of the proposed approach, that reaches new best results on some of the benchmark problems.Comment: Genetic and Evolutionary Computation Conference (GECCO 2012) (2012

arXiv.org e-Print Archive

HAL-CentraleSupelec

Infoscience - École polytechnique fédérale de Lausanne

Crossref

INRIA a CCSD electronic archive server

HAL-Rennes 1

KL-based Control of the Learning Schedule for Surrogate Black-Box Optimization

Author: Loshchilov Ilya
Schoenauer Marc
Sebag Michèle
Publication venue
Publication date: 03/07/2013
Field of study

This paper investigates the control of an ML component within the Covariance Matrix Adaptation Evolution Strategy (CMA-ES) devoted to black-box optimization. The known CMA-ES weakness is its sample complexity, the number of evaluations of the objective function needed to approximate the global optimum. This weakness is commonly addressed through surrogate optimization, learning an estimate of the objective function a.k.a. surrogate model, and replacing most evaluations of the true objective function with the (inexpensive) evaluation of the surrogate model. This paper presents a principled control of the learning schedule (when to relearn the surrogate model), based on the Kullback-Leibler divergence of the current search distribution and the training distribution of the former surrogate model. The experimental validation of the proposed approach shows significant performance gains on a comprehensive set of ill-conditioned benchmark problems, compared to the best state of the art including the quasi-Newton high-precision BFGS method

arXiv.org e-Print Archive

Infoscience - École polytechnique fédérale de Lausanne

HAL-CentraleSupelec

INRIA a CCSD electronic archive server

Vitreoschisis

Author: A Kakehashi
EA Balazs
I Krebs
J Sebag
J Sebag
J Sebag
J Sebag
J Sebag
J Sebag
J Sebag
J. Sebag
N Ueno
SD Schwartz
SS Badrinath
TG Chu
WR Green
Publication venue: Springer-Verlag
Publication date: 01/01/2008
Field of study

Crossref

Springer - Publisher Connector

PubMed Central