Search CORE

2,265 research outputs found

Learning by message-passing in networks of discrete synapses

Author: A. Engel
Alfredo Braunstein
D. J. Amit
D. J. Amit
D. J. Thouless
D. J. C. MacKay
D. O. Hebb
E. Amaldi
E. Barkai
J. Hertz
P. Dayan
Riccardo Zecchina
W. Krauth
Y. Kabashima
Publication venue: 'American Physical Society (APS)'
Publication date: 09/12/2005
Field of study

We show that a message-passing process allows to store in binary "material" synapses a number of random patterns which almost saturates the information theoretic bounds. We apply the learning algorithm to networks characterized by a wide range of different connection topologies and of size comparable with that of biological systems (e.g.

n\simeq10^{5}-10^{6}

). The algorithm can be turned into an on-line --fault tolerant-- learning protocol of potential interest in modeling aspects of synaptic plasticity and in building neuromorphic devices.Comment: 4 pages, 3 figures; references updated and minor corrections; accepted in PR

arXiv.org e-Print Archive

Crossref

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

PORTO Publications Open Repository TOrino

On the role of synaptic stochasticity in training low-precision neural networks

Author: Baldassi Carlo
Gerace Federica
Kappen Hilbert J.
Lucibello Carlo
Saglietti Luca
Tartaglione Enzo
Zecchina Riccardo
Publication venue: 'American Physical Society (APS)'
Publication date: 01/01/2018
Field of study

Stochasticity and limited precision of synaptic weights in neural network models are key aspects of both biological and hardware modeling of learning processes. Here we show that a neural network model with stochastic binary weights naturally gives prominence to exponentially rare dense regions of solutions with a number of desirable properties such as robustness and good generalization performance, while typical solutions are isolated and hard to find. Binary solutions of the standard perceptron problem are obtained from a simple gradient descent procedure on a set of real values parametrizing a probability distribution over the binary synapses. Both analytical and numerical results are presented. An algorithmic extension aimed at training discrete deep neural networks is also investigated.Comment: 7 pages + 14 pages of supplementary materia

arXiv.org e-Print Archive

Archivio istituzionale della Ricerca - Bocconi

HAL Descartes

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

Radboud Repository

Shaping the learning landscape in neural networks around wide flat minima

Author: Baldassi Carlo
Pittorino Fabrizio
Zecchina Riccardo
Publication venue: 'Proceedings of the National Academy of Sciences'
Publication date: 01/01/2020
Field of study

Learning in Deep Neural Networks (DNN) takes place by minimizing a non-convex high-dimensional loss function, typically by a stochastic gradient descent (SGD) strategy. The learning process is observed to be able to find good minimizers without getting stuck in local critical points, and that such minimizers are often satisfactory at avoiding overfitting. How these two features can be kept under control in nonlinear devices composed of millions of tunable connections is a profound and far reaching open question. In this paper we study basic non-convex one- and two-layer neural network models which learn random patterns, and derive a number of basic geometrical and algorithmic features which suggest some answers. We first show that the error loss function presents few extremely wide flat minima (WFM) which coexist with narrower minima and critical points. We then show that the minimizers of the cross-entropy loss function overlap with the WFM of the error loss. We also show examples of learning devices for which WFM do not exist. From the algorithmic perspective we derive entropy driven greedy and message passing algorithms which focus their search on wide flat regions of minimizers. In the case of SGD and cross-entropy loss, we show that a slow reduction of the norm of the weights along the learning process also leads to WFM. We corroborate the results by a numerical study of the correlations between the volumes of the minimizers, their Hessian and their generalization performance on real data.Comment: 37 pages (16 main text), 10 figures (7 main text

arXiv.org e-Print Archive

Archivio istituzionale della Ricerca - Bocconi

Archivio istituzionale della ricerca - Politecnico di Milano

Optimal learning rules for discrete synapses

Author: Barrett Adam B
van Rossum M C W
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2008
Field of study

There is evidence that biological synapses have a limited number of discrete weight states. Memory storage with such synapses behaves quite differently from synapses with unbounded, continuous weights, as old memories are automatically overwritten by new memories. Consequently, there has been substantial discussion about how this affects learning and storage capacity. In this paper, we calculate the storage capacity of discrete, bounded synapses in terms of Shannon information. We use this to optimize the learning rules and investigate how the maximum information capacity depends on the number of synapses, the number of synaptic states, and the coding sparseness. Below a certain critical number of synapses per neuron (comparable to numbers found in biology), we find that storage is similar to unbounded, continuous synapses. Hence, discrete synapses do not necessarily have lower storage capacity

CiteSeerX

Directory of Open Access Journals

PubMed Central

Edinburgh Research Explorer

Sussex Research Online