Search CORE

25,721 research outputs found

Learning Control of Quantum Systems

Author: A Acín
C Altafini
C Brif
C Chen
C Chen
C Wu
CC Shu
D Dong
D Dong
D Dong
D Dong
D D’Alessandro
E Zahedinejad
E Zahedinejad
E Zahedinejad
G Jäger
H Ma
H Rabitz
HM Wiseman
JS Li
M Bukov
N Khaneja
R Chakrabarti
R Sutton
R Wu
RB Wu
RS Judson
S Kuang
SJ Glaser
T Fösel
T Schulte-Herbrüggen
X Xing
Y Guo
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 18/01/2021
Field of study

This paper provides a brief introduction to learning control of quantum systems. In particular, the following aspects are outlined, including gradient-based learning for optimal control of quantum systems, evolutionary computation for learning control of quantum systems, learning-based quantum robust control, and reinforcement learning for quantum control.Comment: 9 page

arXiv.org e-Print Archive

Crossref

Generating Interpretable Fuzzy Controllers using Particle Swarm Optimization and Genetic Programming

Author: Alba E.
Alba E.
Hein D.
Hein D.
Kennedy J.
Koshiyama A.S.
Koza J.R.
Le N.
Ramos L.S.
Tesmer M.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 29/04/2018
Field of study

Autonomously training interpretable control strategies, called policies, using pre-existing plant trajectory data is of great interest in industrial applications. Fuzzy controllers have been used in industry for decades as interpretable and efficient system controllers. In this study, we introduce a fuzzy genetic programming (GP) approach called fuzzy GP reinforcement learning (FGPRL) that can select the relevant state features, determine the size of the required fuzzy rule set, and automatically adjust all the controller parameters simultaneously. Each GP individual's fitness is computed using model-based batch reinforcement learning (RL), which first trains a model using available system samples and subsequently performs Monte Carlo rollouts to predict each policy candidate's performance. We compare FGPRL to an extended version of a related method called fuzzy particle swarm reinforcement learning (FPSRL), which uses swarm intelligence to tune the fuzzy policy parameters. Experiments using an industrial benchmark show that FGPRL is able to autonomously learn interpretable fuzzy policies with high control performance.Comment: Accepted at Genetic and Evolutionary Computation Conference 2018 (GECCO '18

arXiv.org e-Print Archive

Crossref

Learning to Generate Genotypes with Neural Networks

Author: Churchill AW
Fernando C
Sigtia S
Publication venue
Publication date: 14/04/2016
Field of study

Neural networks and evolutionary computation have a rich intertwined history. They most commonly appear together when an evolutionary algorithm optimises the parameters and topology of a neural network for reinforcement learning problems, or when a neural network is applied as a surrogate fitness function to aid the evolutionary optimisation of expensive fitness functions. In this paper we take a different approach, asking the question of whether a neural network can be used to provide a mutation distribution for an evolutionary algorithm, and what advantages this approach may offer? Two modern neural network models are investigated, a Denoising Autoencoder modified to produce stochastic outputs and the Neural Autoregressive Distribution Estimator. Results show that the neural network approach to learning genotypes is able to solve many difficult discrete problems, such as MaxSat and HIFF, and regularly outperforms other evolutionary techniques

arXiv.org e-Print Archive

Queen Mary Research Online

Evolving Inborn Knowledge For Fast Adaptation in Dynamic POMDP Problems

Author: Bengio Samy
Blynel Jesper
Duan Yan
Floreano Dario
Mishra Nikhil
Rakelly Kate
Rothfuss Jonas
Soltoggio Andrea
Thrun Sebastian
Werbos Paul J
Zintgraf Luisa
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 28/04/2020
Field of study

Rapid online adaptation to changing tasks is an important problem in machine learning and, recently, a focus of meta-reinforcement learning. However, reinforcement learning (RL) algorithms struggle in POMDP environments because the state of the system, essential in a RL framework, is not always visible. Additionally, hand-designed meta-RL architectures may not include suitable computational structures for specific learning problems. The evolution of online learning mechanisms, on the contrary, has the ability to incorporate learning strategies into an agent that can (i) evolve memory when required and (ii) optimize adaptation speed to specific online learning problems. In this paper, we exploit the highly adaptive nature of neuromodulated neural networks to evolve a controller that uses the latent space of an autoencoder in a POMDP. The analysis of the evolved networks reveals the ability of the proposed algorithm to acquire inborn knowledge in a variety of aspects such as the detection of cues that reveal implicit rewards, and the ability to evolve location neurons that help with navigation. The integration of inborn knowledge and online plasticity enabled fast adaptation and better performance in comparison to some non-evolutionary meta-reinforcement learning algorithms. The algorithm proved also to succeed in the 3D gaming environment Malmo Minecraft.Comment: 9 pages. Accepted as a full paper in the Genetic and Evolutionary Computation Conference (GECCO 2020

arXiv.org e-Print Archive

Crossref

Loughborough University Institutional Repository

The University of Manchester - Institutional Repository

From the social learning theory to a social learning algorithm for global optimization

Author: Gong Yue-Jiao
Li Yun
Zhang Jun
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2014
Field of study

Traditionally, the Evolutionary Computation (EC) paradigm is inspired by Darwinian evolution or the swarm intelligence of animals. Bandura's Social Learning Theory pointed out that the social learning behavior of humans indicates a high level of intelligence in nature. We found that such intelligence of human society can be implemented by numerical computing and be utilized in computational algorithms for solving optimization problems. In this paper, we design a novel and generic optimization approach that mimics the social learning process of humans. Emulating the observational learning and reinforcement behaviors, a virtual society deployed in the algorithm seeks the strongest behavioral patterns with the best outcome. This corresponds to searching for the best solution in solving optimization problems. Experimental studies in this paper showed the appealing search behavior of this human intelligence-inspired approach, which can reach the global optimum even in ill conditions. The effectiveness and high efficiency of the proposed algorithm has further been verified by comparing to some representative EC algorithms and variants on a set of benchmarks

Crossref

Enlighten

Evolving Plasticity for Autonomous Learning under Changing Environmental Conditions

Author: Coler Matt
Fletcher George
Iacca Giovanni
Mocanu Decebal Constantin
Pechenizkiy Mykola
Yaman Anil
Publication venue: 'MIT Press - Journals'
Publication date: 07/12/2020
Field of study

A fundamental aspect of learning in biological neural networks is the plasticity property which allows them to modify their configurations during their lifetime. Hebbian learning is a biologically plausible mechanism for modeling the plasticity property in artificial neural networks (ANNs), based on the local interactions of neurons. However, the emergence of a coherent global learning behavior from local Hebbian plasticity rules is not very well understood. The goal of this work is to discover interpretable local Hebbian learning rules that can provide autonomous global learning. To achieve this, we use a discrete representation to encode the learning rules in a finite search space. These rules are then used to perform synaptic changes, based on the local interactions of the neurons. We employ genetic algorithms to optimize these rules to allow learning on two separate tasks (a foraging and a prey-predator scenario) in online lifetime learning settings. The resulting evolved rules converged into a set of well-defined interpretable types, that are thoroughly discussed. Notably, the performance of these rules, while adapting the ANNs during the learning tasks, is comparable to that of offline learning methods such as hill climbing.Comment: Evolutionary Computation Journa

arXiv.org e-Print Archive

Proceedings - University of Groningen

ARTS repository - University of Groningen

Dissertations of the University of Groningen