Search CORE

9,151 research outputs found

Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents

Author: Conti Edoardo
Madhavan Vashisht
Such Felipe Petroski
Lehman Joel
Stanley Kenneth O.
Clune Jeff
Publication venue
Publication date: 29/10/2018
Field of study

Evolution strategies (ES) are a family of black-box optimization algorithms able to train deep neural networks roughly as well as Q-learning and policy gradient methods on challenging deep reinforcement learning (RL) problems, but are much faster (e.g. hours vs. days) because they parallelize better. However, many RL problems require directed exploration because they have reward functions that are sparse or deceptive (i.e. contain local optima), and it is unknown how to encourage such exploration with ES. Here we show that algorithms that have been invented to promote directed exploration in small-scale evolved neural networks via populations of exploring agents, specifically novelty search (NS) and quality diversity (QD) algorithms, can be hybridized with ES to improve its performance on sparse or deceptive deep RL tasks, while retaining scalability. Our experiments confirm that the resultant new algorithms, NS-ES and two QD algorithms, NSR-ES and NSRA-ES, avoid local optima encountered by ES to achieve higher performance on Atari and simulated robots learning to walk around a deceptive trap. This paper thus introduces a family of fast, scalable algorithms for reinforcement learning that are capable of directed exploration. It also adds this new family of exploration algorithms to the RL toolbox and raises the interesting possibility that analogous algorithms with multiple simultaneous paths of exploration might also combine well with existing RL algorithms outside ES

arXiv.org e-Print Archive

FigShare

Stochastic optimization of a cold atom experiment using a genetic algorithm

Author: A. Perrin
Baker J. E.
Ch. Koller
Haupt R. L.
J. Schmiedmayer
Ketterle W.
M. Göbel
Mühlenein H.
Pohlheim H.
R. Bücker
S. Manz
T. Betz
T. Schumm
W. Rohringer
Weibull J. W.
Weise T.
Publication venue: 'AIP Publishing'
Publication date: 15/01/2009
Field of study

We employ an evolutionary algorithm to automatically optimize different stages of a cold atom experiment without human intervention. This approach closes the loop between computer based experimental control systems and automatic real time analysis and can be applied to a wide range of experimental situations. The genetic algorithm quickly and reliably converges to the most performing parameter set independent of the starting population. Especially in many-dimensional or connected parameter spaces the automatic optimization outperforms a manual search.Comment: 4 pages, 3 figure

arXiv.org e-Print Archive