Search CORE

18,992 research outputs found

Progressive growing of self-organized hierarchical representations for exploration

Author: Etcheverry Mayalen
Oudeyer Pierre-Yves
Reinke Chris
Publication venue
Publication date: 13/05/2020
Field of study

Designing agent that can autonomously discover and learn a diversity of structures and skills in unknown changing environments is key for lifelong machine learning. A central challenge is how to learn incrementally representations in order to progressively build a map of the discovered structures and re-use it to further explore. To address this challenge, we identify and target several key functionalities. First, we aim to build lasting representations and avoid catastrophic forgetting throughout the exploration process. Secondly we aim to learn a diversity of representations allowing to discover a "diversity of diversity" of structures (and associated skills) in complex high-dimensional environments. Thirdly, we target representations that can structure the agent discoveries in a coarse-to-fine manner. Finally, we target the reuse of such representations to drive exploration toward an "interesting" type of diversity, for instance leveraging human guidance. Current approaches in state representation learning rely generally on monolithic architectures which do not enable all these functionalities. Therefore, we present a novel technique to progressively construct a Hierarchy of Observation Latent Models for Exploration Stratification, called HOLMES. This technique couples the use of a dynamic modular model architecture for representation learning with intrinsically-motivated goal exploration processes (IMGEPs). The paper shows results in the domain of automated discovery of diverse self-organized patterns, considering as testbed the experimental framework from Reinke et al. (2019)

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server

Empowerment for Continuous Agent-Environment Systems

Author: Anthony T.
Anthony T.
Brafman R.
Daniel Polani
Der R.
Der R.
Dietterich T.G.
Ernst D.
Girard A.
Kaplan F.
Klyubin A.S.
Klyubin A.S.
Lagoudakis M.G.
Lungarella M.
Peter Stone
Prokopenko M.
Rasmussen C.E.
Schmidhuber J.
Singh S.
Sutton R.
Tishby N.
Tobias Jung
Publication venue
Publication date: 14/01/2011
Field of study

This paper develops generalizations of empowerment to continuous states. Empowerment is a recently introduced information-theoretic quantity motivated by hypotheses about the efficiency of the sensorimotor loop in biological organisms, but also from considerations stemming from curiosity-driven learning. Empowemerment measures, for agent-environment systems with stochastic transitions, how much influence an agent has on its environment, but only that influence that can be sensed by the agent sensors. It is an information-theoretic generalization of joint controllability (influence on environment) and observability (measurement by sensors) of the environment by the agent, both controllability and observability being usually defined in control theory as the dimensionality of the control/observation spaces. Earlier work has shown that empowerment has various interesting and relevant properties, e.g., it allows us to identify salient states using only the dynamics, and it can act as intrinsic reward without requiring an external reward. However, in this previous work empowerment was limited to the case of small-scale and discrete domains and furthermore state transition probabilities were assumed to be known. The goal of this paper is to extend empowerment to the significantly more important and relevant case of continuous vector-valued state spaces and initially unknown state transition probabilities. The continuous state space is addressed by Monte-Carlo approximation; the unknown transitions are addressed by model learning and prediction for which we apply Gaussian processes regression with iterated forecasting. In a number of well-known continuous control tasks we examine the dynamics induced by empowerment and include an application to exploration and online model learning

arXiv.org e-Print Archive

Crossref

University of Hertfordshire Research Archive

Curiosity in exploring chemical spaces: Intrinsic rewards for deep molecular reinforcement learning

Author: Aspuru-Guzik A.
Krenn M.
Nigam A.
Thiede L.
Publication venue: 'IOP Publishing'
Publication date: 25/07/2022
Field of study

Computer-aided design of molecules has the potential to disrupt the field of drug and material discovery. Machine learning, and deep learning, in particular, have been topics where the field has been developing at a rapid pace. Reinforcement learning is a particularly promising approach since it allows for molecular design without prior knowledge. However, the search space is vast and efficient exploration is desirable when using reinforcement learning agents. In this study, we propose an algorithm to aid efficient exploration. The algorithm is inspired by a concept known in the literature as curiosity. We show on three benchmarks that a curious agent finds better performing molecules. This indicates an exciting new research direction for reinforcement learning agents that can explore the chemical space out of their own motivation. This has the potential to eventually lead to unexpected new molecules that no human has thought about so far

MPG.PuRe

Changing the Environment Based on Empowerment as Intrinsic Motivation

Author: Anthony
Arimoto
Ay
Ay
Blahut
Browne
Capdepuy
Christoph Salge
Cornelius Glackin
Cover
Csikszentmihalyi
Daniel Polani
Der
Der
Gallagher
Gibson James
Gordon
Kaplan
Klyubin
Oudeyer
Pearl
Persson
Pfeifer
Ryan
Salge
Salge
Salge
Schmidhuber
Scott-Phillips
Seligman
Shalizi
Shannon
Steels
Sutton
Telatar
Varela
Von Foerster
Von Uexku¨ll
Williams
Wissner-Gross
Wright
Zahedi
Publication venue: 'MDPI AG'
Publication date: 01/05/2014
Field of study

This is an open access article distributed under the Creative Commons Attribution License CC BY 3.0 which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.One aspect of intelligence is the ability to restructure your own environment so that the world you live in becomes more beneficial to you. In this paper we investigate how the information-theoretic measure of agent empowerment can provide a task-independent, intrinsic motivation to restructure the world. We show how changes in embodiment and in the environment change the resulting behaviour of the agent and the artefacts left in the world. For this purpose, we introduce an approximation of the established empowerment formalism based on sparse sampling, which is simpler and significantly faster to compute for deterministic dynamics. Sparse sampling also introduces a degree of randomness into the decision making process, which turns out to beneficial for some cases. We then utilize the measure to generate agent behaviour for different agent embodiments in a Minecraft-inspired three dimensional block world. The paradigmatic results demonstrate that empowerment can be used as a suitable generic intrinsic motivation to not only generate actions in given static environments, as shown in the past, but also to modify existing environmental conditions. In doing so, the emerging strategies to modify an agent’s environment turn out to be meaningful to the specific agent capabilities, i.e., de facto to its embodiment.Peer reviewedFinal Published versio

arXiv.org e-Print Archive

Crossref

Directory of Open Access Journals

University of Hertfordshire Research Archive