Search CORE

2,413 research outputs found

Short-term plasticity as cause-effect hypothesis testing in distal reward learning

Author: Soltoggio Andrea
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 05/09/2014
Field of study

Asynchrony, overlaps and delays in sensory-motor signals introduce ambiguity as to which stimuli, actions, and rewards are causally related. Only the repetition of reward episodes helps distinguish true cause-effect relationships from coincidental occurrences. In the model proposed here, a novel plasticity rule employs short and long-term changes to evaluate hypotheses on cause-effect relationships. Transient weights represent hypotheses that are consolidated in long-term memory only when they consistently predict or cause future rewards. The main objective of the model is to preserve existing network topologies when learning with ambiguous information flows. Learning is also improved by biasing the exploration of the stimulus-response space towards actions that in the past occurred before rewards. The model indicates under which conditions beliefs can be consolidated in long-term memory, it suggests a solution to the plasticity-stability dilemma, and proposes an interpretation of the role of short-term plasticity.Comment: Biological Cybernetics, September 201

arXiv.org e-Print Archive

Loughborough University Institutional Repository

Learning with Delayed Synaptic Plasticity

Author: Fletcher George
Iacca Giovanni
Mocanu Decebal Constantin
Pechenizkiy Mykola
Yaman Anil
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 17/04/2019
Field of study

The plasticity property of biological neural networks allows them to perform learning and optimize their behavior by changing their configuration. Inspired by biology, plasticity can be modeled in artificial neural networks by using Hebbian learning rules, i.e. rules that update synapses based on the neuron activations and reinforcement signals. However, the distal reward problem arises when the reinforcement signals are not available immediately after each network output to associate the neuron activations that contributed to receiving the reinforcement signal. In this work, we extend Hebbian plasticity rules to allow learning in distal reward cases. We propose the use of neuron activation traces (NATs) to provide additional data storage in each synapse to keep track of the activation of the neurons. Delayed reinforcement signals are provided after each episode relative to the networks' performance during the previous episode. We employ genetic algorithms to evolve delayed synaptic plasticity (DSP) rules and perform synaptic updates based on NATs and delayed reinforcement signals. We compare DSP with an analogous hill climbing algorithm that does not incorporate domain knowledge introduced with the NATs, and show that the synaptic updates performed by the DSP rules demonstrate more effective training performance relative to the HC algorithm.Comment: GECCO201

arXiv.org e-Print Archive

Crossref

Pure OAI Repository

Rare neural correlations implement robotic conditioning with delayed rewards and disturbances

Author: Andre Lemme (7167692)
Andrea Soltoggio (1248822)
Felix Reinhart (7168535)
Jochen Steil (5426480)
Publication venue
Publication date: 01/01/2013
Field of study

Neural conditioning associates cues and actions with following rewards. The environments in which robots operate, however, are pervaded by a variety of disturbing stimuli and uncertain timing. In particular, variable reward delays make it difficult to reconstruct which previous actions are responsible for following rewards. Such an uncertainty is handled by biological neural networks, but represents a challenge for computational models, suggesting the lack of a satisfactory theory for robotic neural conditioning. The present study demonstrates the use of rare neural correlations in making correct associations between rewards and previous cues or actions. Rare correlations are functional in selecting sparse synapses to be eligible for later weight updates if a reward occurs. The repetition of this process singles out the associating and reward-triggering pathways, and thereby copes with distal rewards. The neural network displays macro-level classical and operant conditioning, which is demonstrated in an interactive real-life human-robot interaction. The proposed mechanism models realistic conditioning in humans and animals and implements similar behaviors in neuro-robotic platforms

Loughborough University Institutional Repository

Deep Reinforcement Learning with Modulated Hebbian plus Q Network Architecture

Author: Ben-Iwhiwhu Eseoghene
Dick Jeffery
Hu Yang
Ketz Nicholas
Kolouri Soheil
Krichmar Jeffrey L.
Ladosz Pawel
Pilly Praveen
Soltoggio Andrea
Publication venue
Publication date: 24/09/2021
Field of study

This paper presents a new neural architecture that combines a modulated Hebbian network (MOHN) with DQN, which we call modulated Hebbian plus Q network architecture (MOHQA). The hypothesis is that such a combination allows MOHQA to solve difficult partially observable Markov decision process (POMDP) problems which impair temporal difference (TD)-based RL algorithms such as DQN, as the TD error cannot be easily derived from observations. The key idea is to use a Hebbian network with bio-inspired neural traces in order to bridge temporal delays between actions and rewards when confounding observations and sparse rewards result in inaccurate TD errors. In MOHQA, DQN learns low level features and control, while the MOHN contributes to the high-level decisions by associating rewards with past states and actions. Thus the proposed architecture combines two modules with significantly different learning algorithms, a Hebbian associative network and a classical DQN pipeline, exploiting the advantages of both. Simulations on a set of POMDPs and on the MALMO environment show that the proposed algorithm improved DQN's results and even outperformed control tests with A2C, QRDQN+LSTM and REINFORCE algorithms on some POMDPs with confounding stimuli and sparse rewards

arXiv.org e-Print Archive

Loughborough University Institutional Repository

ScholarWorks@UNIST

Real-time hebbian learning from autoencoder features for control tasks

Author: Andrea Soltoggio (1248822)
Justin K. Pugh (7169441)
Kenneth O. Stanley (7167806)
Publication venue
Publication date: 01/01/2014
Field of study

Neural plasticity and in particular Hebbian learning play an important role in many research areas related to artficial life. By allowing artificial neural networks (ANNs) to adjust their weights in real time, Hebbian ANNs can adapt over their lifetime. However, even as researchers improve and extend Hebbian learning, a fundamental limitation of such systems is that they learn correlations between preexisting static features and network outputs. A Hebbian ANN could in principle achieve significantly more if it could accumulate new features over its lifetime from which to learn correlations. Interestingly, autoencoders, which have recently gained prominence in deep learning, are themselves in effect a kind of feature accumulator that extract meaningful features from their inputs. The insight in this paper is that if an autoencoder is connected to a Hebbian learning layer, then the resulting Realtime Autoencoder-Augmented Hebbian Network (RAAHN) can actually learn new features (with the autoencoder) while simultaneously learning control policies from those new features (with the Hebbian layer) in real time as an agent experiences its environment. In this paper, the RAAHN is shown in a simulated robot maze navigation experiment to enable a controller to learn the perfect navigation strategy significantly more often than several Hebbian-based variant approaches that lack the autoencoder. In the long run, this approach opens up the intriguing possibility of real-time deep learning for control

CiteSeerX

Loughborough University Institutional Repository

University of Central Florida (UCF): STARS (Showcase of Text, Archives, Research & Scholarship)

Editorial: Neural plasticity for rich and uncertain robotic information streams

Author: Andrea Soltoggio (1248822)
Frank van der Velde (7167842)
Publication venue
Publication date: 01/01/2015
Field of study

Editorial: Neural plasticity for rich and uncertain robotic information stream

Loughborough University Institutional Repository

Neural plasticity for rich and uncertain robotic information streams

Author: Soltoggio A.
Van der Velde F.
Publication venue
Publication date: 01/01/2015
Field of study

Leiden University Scholary Publications

Reward-modulated Hebbian plasticity as leverage for partially embodied control in compliant robotics

Author: Burms Jeroen
Caluwaerts Ken
Dambre Joni
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2015
Field of study

In embodied computation (or morphological computation), part of the complexity of motor control is offloaded to the body dynamics. We demonstrate that a simple Hebbian-like learning rule can be used to train systems with (partial) embodiment, and can be extended outside of the scope of traditional neural networks. To this end, we apply the learning rule to optimize the connection weights of recurrent neural networks with different topologies and for various tasks. We then apply this learning rule to a simulated compliant tensegrity robot by optimizing static feedback controllers that directly exploit the dynamics of the robot body. This leads to partially embodied controllers, i.e., hybrid controllers that naturally integrate the computations that are performed by the robot body into a neural network architecture. Our results demonstrate the universal applicability of reward-modulated Hebbian learning. Furthermore, they demonstrate the robustness of systems trained with the learning rule. This study strengthens our belief that compliant robots should or can be seen as computational units, instead of dumb hardware that needs a complex controller. This link between compliant robotics and neural networks is also the main reason for our search for simple universal learning rules for both neural networks and robotics

Ghent University Academic Bibliography

Frontiers - Publisher Connector

PubMed Central

A differential Hebbian framework for biologically-plausible motor control

Author: DeSchutter Erik
Dorrell William
Verduzco-Flores Sergio
Publication venue
Publication date: 10/11/2021
Field of study

In the realm of motor control, artificial agents cannot match the performance of their biological counterparts. We thus explore a neural control architecture that is both biologically plausible, and capable of fully autonomous learning. The architecture consists of feedback controllers that learn to achieve a desired state by selecting the errors that should drive them. This selection happens through a family of differential Hebbian learning rules that, through interaction with the environment, can learn to control systems where the error responds monotonically to the control signal. We next show that in a more general case, neural reinforcement learning can be coupled with a feedback controller to reduce errors that arise non-monotonically from the control signal. The use of feedback control reduces the complexity of the reinforcement learning problem, because only a desired value must be learned, with the controller handling the details of how it is reached. This makes the function to be learned simpler, potentially allowing to learn more complex actions. We discuss how this approach could be extended to hierarchical architectures.Comment: 35 pages, 10 figures. Appendix: 9 pages, 2 figure

arXiv.org e-Print Archive

OIST Institutional Repository