1,039 research outputs found
Symmetry-Based Disentangled Representation Learning requires Interaction with Environments
Finding a generally accepted formal definition of a disentangled representation in the context of an agent behaving in an environment is an important challenge towards the construction of data-efficient autonomous agents. Higgins et al. (2018) recently proposed Symmetry-Based Disentangled Representation Learning, a definition based on a characterization of symmetries in the environment using group theory. We build on their work and make observations, theoretical and empirical, that lead us to argue that Symmetry-Based Disentangled Representation Learning cannot only be based on static observations: agents should interact with the environment to discover its symmetries. Our experiments can be reproduced in Colab and the code is available on GitHub
Recommended from our members
Continual State Representation Learning for Reinforcement Learning using Generative Replay
We consider the problem of building a state representation model in a continual fashion. As the environment changes, the aim is to efficiently compress the sensory state's information without losing past knowledge. The learned features are then fed to a Reinforcement Learning algorithm to learn a policy. We propose to use Variational Auto-Encoders for state representation, and Generative Replay, i.e. the use of generated samples, to maintain past knowledge. We also provide a general and statistically sound method for automatic environment change detection. Our method provides efficient state representation as well as forward transfer, and avoids catastrophic forgetting. The resulting model is capable of incrementally learning information without using past data and with a bounded system size
Recommended from our members
Flatland: a Lightweight First-Person 2-D Environment for Reinforcement Learning
Flatlandis a simple, lightweight environment for fastprototyping and testing of reinforcement learning agents. It is oflower complexity compared to similar 3D platforms (e.g. Deep-Mind Lab or VizDoom), but emulates physical properties of thereal world, such as continuity, multi-modal partially-observablestates with first-person view and coherent physics. We proposeto use it as an intermediary benchmark for problems related toLifelong Learning.Flatlandis highly customizable and offers awide range of task difficulty to extensively evaluate the propertiesof artificial agents. We experiment with three reinforcementlearning baseline agents and show that they can rapidly solvea navigation task inFlatland. A video of an agent acting inFlatlandis available here: https://youtu.be/I5y6Y2ZypdA
Recommended from our members
Generative Models from the perspective of Continual Learning
Which generative model is the most suitablefor Continual Learning? This paper aims at evaluating andcomparing generative models on disjoint sequential imagegeneration tasks. We investigate how several models learn andforget, considering various strategies: rehearsal, regularization,generative replay and fine-tuning. We used two quantitativemetrics to estimate the generation quality and memory ability.We experiment with sequential tasks on three commonly usedbenchmarks for Continual Learning (MNIST, Fashion MNISTand CIFAR10). We found that among all models, the originalGAN performs best and among Continual Learning strategies,generative replay outperforms all other methods. Even ifwe found satisfactory combinations on MNIST and FashionMNIST, training generative models sequentially on CIFAR10is particularly instable, and remains a challenge. Our code isavailable online
NUV/Blue spectral observations of sprites in the 320-460 nm region: (2PG) Emissions
A near-ultraviolet (NUV) spectrograph (320-460 nm) was flown on the EXL98
aircraft sprite observation campaign during July 1998. In this wavelength range
video rate (60 fields/sec) spectrographic observations found the NUV/blue
emissions to be predominantly N2 (2PG). The negligible level of N2+ (1NG)
present in the spectrum is confirmed by observations of a co-aligned, narrowly
filtered 427.8 nm imager and is in agreement with previous ground-based
filtered photometer observations. The synthetic spectral fit to the
observations indicates a characteristic energy of ~1.8 eV, in agreement with
our other NUV observations.Comment: 7 pages, 2 figures, 1 table, JGR Space Physics "Effects of
Thunderstorms and Lightning in the Upper Atmosphere" Special Sectio
Recommended from our members
Flatland: a Lightweight First-Person 2-D Environment for Reinforcement Learning
Flatlandis a simple, lightweight environment for fastprototyping and testing of reinforcement learning agents. It is oflower complexity compared to similar 3D platforms (e.g. Deep-Mind Lab or VizDoom), but emulates physical properties of thereal world, such as continuity, multi-modal partially-observablestates with first-person view and coherent physics. We proposeto use it as an intermediary benchmark for problems related toLifelong Learning.Flatlandis highly customizable and offers awide range of task difficulty to extensively evaluate the propertiesof artificial agents. We experiment with three reinforcementlearning baseline agents and show that they can rapidly solvea navigation task inFlatland. A video of an agent acting inFlatlandis available here: https://youtu.be/I5y6Y2ZypdA
Grounding knowledge and normative valuation in agent-based action and scientific commitment
Philosophical investigation in synthetic biology has focused on the knowledge-seeking questions pursued, the kind of engineering techniques used, and on the ethical impact of the products produced. However, little work has been done to investigate the processes by which these epistemological, metaphysical, and ethical forms of inquiry arise in the course of synthetic biology research. An attempt at this work relying on a particular area of synthetic biology will be the aim of this chapter. I focus on the reengineering of metabolic pathways through the manipulation and construction of small DNA-based devices and systems synthetic biology. Rather than focusing on the engineered products or ethical principles that result, I will investigate the processes by which these arise. As such, the attention will be directed to the activities of practitioners, their manipulation of tools, and the use they make of techniques to construct new metabolic devices. Using a science-in-practice approach, I investigate problems at the intersection of science, philosophy of science, and sociology of science. I consider how practitioners within this area of synthetic biology reconfigure biological understanding and ethical categories through active modelling and manipulation of known functional parts, biological pathways for use in the design of microbial machines to solve problems in medicine, technology, and the environment. We might describe this kind of problem-solving as relying on what Helen Longino referred to as “social cognition” or the type of scientific work done within what Hasok Chang calls “systems of practice”. My aim in this chapter will be to investigate the relationship that holds between systems of practice within metabolic engineering research and social cognition. I will attempt to show how knowledge and normative valuation are generated from this particular network of practitioners. In doing so, I suggest that the social nature of scientific inquiry is ineliminable to both knowledge acquisition and ethical evaluations
- …