Search CORE

6 research outputs found

Algorithm Portfolios for Noisy Optimization

Author: Baptiste Rozière
Cauwet Marie-Liesse
Liu Jialin
Teytaud Olivier
Publication venue
Publication date: 02/11/2015
Field of study

Noisy optimization is the optimization of objective functions corrupted by noise. A portfolio of solvers is a set of solvers equipped with an algorithm selection tool for distributing the computational power among them. Portfolios are widely and successfully used in combinatorial optimization. In this work, we study portfolios of noisy optimization solvers. We obtain mathematically proved performance (in the sense that the portfolio performs nearly as well as the best of its solvers) by an ad hoc portfolio algorithm dedicated to noisy optimization. A somehow surprising result is that it is better to compare solvers with some lag, i.e., propose the current recommendation of best solver based on their performance earlier in the run. An additional finding is a principled method for distributing the computational power among solvers in the portfolio.Comment: in Annals of Mathematics and Artificial Intelligence, Springer Verlag, 201

arXiv.org e-Print Archive

HAL-CentraleSupelec

INRIA a CCSD electronic archive server

HAL-Rennes 1

Algorithm Portfolios for Noisy Optimization: Compare Solvers Early

Author: A Conn
B Sendhoff
DH Wolpert
DV Arnold
E Nudelman
H Chen
H-G Beyer
L Pulina
M Gagliolo
M Gagliolo
M Jebalia
MD Grigoriadis
P Auer
RS Sutton
S Bubeck
S Kadioglu
T Lai
V Fabian
Y Jin
Publication venue: HAL CCSD
Publication date: 01/01/2014
Field of study

International audienceNoisy optimization is the optimization of objective functions corrupted by noise. A portfolio of algorithms is a set of algorithms equipped with an algorithm selection tool for distributing the compu- tational power among them. We study portfolios of noisy optimization solvers, show that different settings lead to dramatically different perfor- mances, obtain mathematically proved adaptivity by an ad hoc selection algorithm dedicated to noisy optimization. A somehow surprising result is that it is better to compare solvers with some lag; i.e., recommend the current recommendation of the best solver, selected from a comparison based on their recommendations earlier in the run

HAL-CentraleSupelec

CiteSeerX

Crossref

INRIA a CCSD electronic archive server

Algorithm Portfolio for Individual-based Surrogate-Assisted Evolutionary Algorithms

Author: Liu Jialin
Tong Hao
Yao Xin
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 25/10/2019
Field of study

Surrogate-assisted evolutionary algorithms (SAEAs) are powerful optimisation tools for computationally expensive problems (CEPs). However, a randomly selected algorithm may fail in solving unknown problems due to no free lunch theorems, and it will cause more computational resource if we re-run the algorithm or try other algorithms to get a much solution, which is more serious in CEPs. In this paper, we consider an algorithm portfolio for SAEAs to reduce the risk of choosing an inappropriate algorithm for CEPs. We propose two portfolio frameworks for very expensive problems in which the maximal number of fitness evaluations is only 5 times of the problem's dimension. One framework named Par-IBSAEA runs all algorithm candidates in parallel and a more sophisticated framework named UCB-IBSAEA employs the Upper Confidence Bound (UCB) policy from reinforcement learning to help select the most appropriate algorithm at each iteration. An effective reward definition is proposed for the UCB policy. We consider three state-of-the-art individual-based SAEAs on different problems and compare them to the portfolios built from their instances on several benchmark problems given limited computation budgets. Our experimental studies demonstrate that our proposed portfolio frameworks significantly outperform any single algorithm on the set of benchmark problems

arXiv.org e-Print Archive

Crossref

Sorting by Swaps with Noisy Comparisons

Author: Gavenčiak Tomáš
Geissmann Barbara
Lengler Johannes
Publication venue
Publication date: 12/03/2018
Field of study

We study sorting of permutations by random swaps if each comparison gives the wrong result with some fixed probability

p<1/2

. We use this process as prototype for the behaviour of randomized, comparison-based optimization heuristics in the presence of noisy comparisons. As quality measure, we compute the expected fitness of the stationary distribution. To measure the runtime, we compute the minimal number of steps after which the average fitness approximates the expected fitness of the stationary distribution. We study the process where in each round a random pair of elements at distance at most

r

are compared. We give theoretical results for the extreme cases

r=1

and

r=n

, and experimental results for the intermediate cases. We find a trade-off between faster convergence (for large

r

) and better quality of the solution after convergence (for small

r

).Comment: An extended abstract of this paper has been presented at Genetic and Evolutionary Computation Conference (GECCO 2017

arXiv.org e-Print Archive

Repository for Publications and Research Data

Competitive and cooperative heterogeneous deep reinforcement learning

Author: Jiang J
Long G
Wei P
Zhang C
Zheng H
Publication venue
Publication date: 11/05/2021
Field of study

Numerous deep reinforcement learning methods have been proposed, including deterministic, stochastic, and evolutionary-based hybrid methods. However, among these various methodologies, there is no clear winner that consistently outperforms the others in every task in terms of effective exploration, sample efficiency, and stability. In this work, we present a competitive and cooperative heterogeneous deep reinforcement learning framework called C2HRL. C2HRL aims to learn a superior agent that exceeds the capabilities of the individual agent in an agent pool through two agent management mechanisms: one competitive, the other cooperative. The competitive mechanism forces agents to compete for computing resources and to explore and exploit diverse regions of the solution space. To support this strategy, resources are distributed to the most suitable agent for that specific task and random seed setting, which results in better sample efficiency and stability. The other mechanic, cooperation, asks heterogeneous agents to share their exploration experiences so that all agents can learn from a diverse set of policies. The experiences are stored in a two-level replay buffer and the result is an overall more effective exploration strategy. We evaluated C2HRL on a range of continuous control tasks from the benchmark Mujoco. The experimental results demonstrate that C2HRL has better sample efficiency and greater stability than three state-of-the-art DRL baselines

OPUS - University of Technology Sydney

Algorithm portfolios for noisy optimization

Author: A Conn
A Prügel-Bennett
Baptiste Rozière
DH Wolpert
DV Arnold
H Chen
HG Beyer
J Spall
J Spall
Jialin Liu
L Pulina
L Xu
M Gagliolo
Marie-Liesse Cauwet
MD Grigoriadis
Olivier Teytaud
P Auer
R Storn
RS Sutton
T Lai
V Fabian
Y Jin
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref