Search CORE

7 research outputs found

Is a 4-bit synaptic weight resolution enough? - Constraints on enabling spike-timing dependent plasticity in neuromorphic hardware

Author: Johannes eSchemmel
Karlheinz eMeier
Markus eDiesmann
Markus eDiesmann
Markus eDiesmann
Sven eSchrader
Thomas ePfeil
Tobias C Potjans
Tobias C Potjans
Wiebke ePotjans
Wiebke ePotjans
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2012
Field of study

Large-scale neuromorphic hardware systems typically bear the trade-off between detail level and required chip resources. Especially when implementing spike-timing-dependent plasticity, reduction in resources leads to limitations as compared to floating point precision. By design, a natural modification that saves resources would be reducing synaptic weight resolution. In this study, we give an estimate for the impact of synaptic weight discretization on different levels, ranging from random walks of individual weights to computer simulations of spiking neural networks. The FACETS wafer-scale hardware system offers a 4-bit resolution of synaptic weights, which is shown to be sufficient within the scope of our network benchmark. Our findings indicate that increasing the resolution may not even be useful in light of further restrictions of customized mixed-signal synapses. In addition, variations due to production imperfections are investigated and shown to be uncritical in the context of the presented study. Our results represent a general framework for setting up and configuring hardware-constrained synapses. We suggest how weight discretization could be considered for other backends dedicated to large-scale simulations. Thus, our proposition of a good hardware verification practice may rise synergy effects between hardware developers and neuroscientists

arXiv.org e-Print Archive

Crossref

Directory of Open Access Journals

Frontiers - Publisher Connector

PubMed Central

Publikationsserver der RWTH Aachen University

Juelich Shared Electronic Resources

An Imperfect Dopaminergic Error Signal Can Drive Temporal-Difference Learning

Author: A Barto
A Garthe
A Hanuschkin
A Soltani
A Soltani
AA Prinz
Abigail Morrison
AG Barto
B Porr
B Seymour
B Seymour
BI Hyland
BJ Knowlton
CA Paladini
CD Fiorillo
D Baras
D Joel
DC Dennett
DJ Foster
DZ Jin
E Brazhnik
E Nordlie
E Vasilaki
EA Ludvig
EM Izhikevich
F Wörgötter
G La Camera
G Morris
G Morris
GS Berns
HC Tuckwell
HE Attalah
HM Bayer
HS Seung
IH Witten
J Brown
J O'Doherty
J Wickens
J Yacubian
JC Horvitz
JC Houk
JC Houk
JC Houk
JC Houk
JJ Hopfield
JL Contreras-Vidal
JM Tepper
JN Reynolds
JNJ Reynolds
K Doya
K Gurney
KD Sethi
KJ Friston
M Dai
M Helias
M Matsumoto
M Matsumoto
M Pessiglione
MA Farries
MA Häusser
Markus Diesmann
MD Humphries
MO Gewaltig
N Bowery
N Frémaux
N Schweighofer
ND Daw
O Arias-Carrion
P Calabresi
P Dayan
P Dayan
P Dayan
P Montague
P Redgrave
P Redgrave
PA Garris
PN Tobler
PR Montague
PR Montague
R Legenstein
R Suri
R VanRullen
RC Froemke
RE Suri
RE Suri
RJ McDonald
RJ Steele
RPN Rao
RS Sutton
RS Sutton
RV Florian
S Fusi
S Pecina
S Schrader
S Sugita
SM Reynolds
SM Reynolds
T Ljungberg
T Nakano
Tim Behrens
V Pawlak
V Pawlak
W Potjans
W Potjans
W Schultz
W Schultz
W Schultz
Wiebke Potjans
X Xie
Y Loewenstein
Y Niv
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

An open problem in the field of computational neuroscience is how to link synaptic plasticity to system-level learning. A promising framework in this context is temporal-difference (TD) learning. Experimental evidence that supports the hypothesis that the mammalian brain performs temporal-difference learning includes the resemblance of the phasic activity of the midbrain dopaminergic neurons to the TD error and the discovery that cortico-striatal synaptic plasticity is modulated by dopamine. However, as the phasic dopaminergic signal does not reproduce all the properties of the theoretical TD error, it is unclear whether it is capable of driving behavior adaptation in complex tasks. Here, we present a spiking temporal-difference learning model based on the actor-critic architecture. The model dynamically generates a dopaminergic signal with realistic firing rates and exploits this signal to modulate the plasticity of synapses as a third factor. The predictions of our proposed plasticity dynamics are in good agreement with experimental results with respect to dopamine, pre- and post-synaptic activity. An analytical mapping from the parameters of our proposed plasticity dynamics to those of the classical discrete-time TD algorithm reveals that the biological constraints of the dopaminergic signal entail a modified TD algorithm with self-adapting learning parameters and an adapting offset. We show that the neuronal network is able to learn a task with sparse positive rewards as fast as the corresponding classical discrete-time TD algorithm. However, the performance of the neuronal network is impaired with respect to the traditional algorithm on a task with both positive and negative rewards and breaks down entirely on a task with purely negative rewards. Our model demonstrates that the asymmetry of a realistic dopaminergic signal enables TD learning when learning is driven by positive rewards but not when driven by negative rewards

Public Library of Science (PLOS)

Crossref

FreiDok plus

Directory of Open Access Journals

PubMed Central

Publikationsserver der RWTH Aachen University

Juelich Shared Electronic Resources

The Francis Crick Institute