Search CORE

5,343 research outputs found

NAIS-Net: Stable Deep Networks from Non-Autonomous Differential Equations

Author: Ciccone Marco
Gallieri Marco
Gomez Faustino
Masci Jonathan
Osendorfer Christian
Publication venue
Publication date: 01/01/2018
Field of study

This paper introduces Non-Autonomous Input-Output Stable Network (NAIS-Net), a very deep architecture where each stacked processing block is derived from a time-invariant non-autonomous dynamical system. Non-autonomy is implemented by skip connections from the block input to each of the unrolled processing stages and allows stability to be enforced so that blocks can be unrolled adaptively to a pattern-dependent processing depth. NAIS-Net induces non-trivial, Lipschitz input-output maps, even for an infinite unroll length. We prove that the network is globally asymptotically stable so that for every initial condition there is exactly one input-dependent equilibrium assuming tanh units, and multiple stable equilibria for ReL units. An efficient implementation that enforces the stability under derived conditions for both fully-connected and convolutional layers is also presented. Experimental results show how NAIS-Net exhibits stability in practice, yielding a significant reduction in generalization gap compared to ResNets.Comment: NIPS 201

arXiv.org e-Print Archive

Archivio istituzionale della ricerca - Politecnico di Milano

Complexity without chaos: Plasticity within random recurrent networks generates robust timing and motor control

Author: A Banerjee
A Litwin-Kumar
C Miall
C van Vreeswijk
CA Skarda
CV Buhusi
D Bueti
D Durstewitz
D Durstewitz
D Sussillo
DA Crowe
Dean V Buonomano
DJ Watts
DV Buonomano
DV Buonomano
DV Buonomano
DV Buonomano
DV Buonomano
E Pastalkova
EM Izhikevich
G Boffetta
G Fagiolo
H Jaeger
H Jaeger
H Kantz
H Merchant
H Sompolinsky
J Coull
JF Medina
JF Medina
JJ Hopfield
JK Liu
JX Li
K Rajan
M London
M Monteforte
M Rabinovich
MA Long
MB Ahrens
MD Mauk
MM Churchland
MS Goldman
MS Matell
N Brunel
P Janssen
P Simen
RB Ivry
RB Ivry
RHR Hahnloser
RM Church
Rodrigo Laje
S Ganguli
S Song
XJ Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 07/10/2012
Field of study

It is widely accepted that the complex dynamics characteristic of recurrent neural circuits contributes in a fundamental manner to brain function. Progress has been slow in understanding and exploiting the computational power of recurrent dynamics for two main reasons: nonlinear recurrent networks often exhibit chaotic behavior and most known learning rules do not work in robust fashion in recurrent networks. Here we address both these problems by demonstrating how random recurrent networks (RRN) that initially exhibit chaotic dynamics can be tuned through a supervised learning rule to generate locally stable neural patterns of activity that are both complex and robust to noise. The outcome is a novel neural network regime that exhibits both transiently stable and chaotic trajectories. We further show that the recurrent learning rule dramatically increases the ability of RRNs to generate complex spatiotemporal motor patterns, and accounts for recent experimental data showing a decrease in neural variability in response to stimulus onset

arXiv.org e-Print Archive

Integer Echo State Networks: Hyperdimensional Reservoir Computing

Author: Frady Edward Paxon
Kleyko Denis
Osipov Evgeny
Publication venue
Publication date: 22/09/2018
Field of study

We propose an approximation of Echo State Networks (ESN) that can be efficiently implemented on digital hardware based on the mathematics of hyperdimensional computing. The reservoir of the proposed Integer Echo State Network (intESN) is a vector containing only n-bits integers (where n<8 is normally sufficient for a satisfactory performance). The recurrent matrix multiplication is replaced with an efficient cyclic shift operation. The intESN architecture is verified with typical tasks in reservoir computing: memorizing of a sequence of inputs; classifying time-series; learning dynamic processes. Such an architecture results in dramatic improvements in memory footprint and computational efficiency, with minimal performance loss.Comment: 10 pages, 10 figures, 1 tabl

arXiv.org e-Print Archive

Learning to Recognize Actions from Limited Training Examples Using a Recurrent Spiking Neural Model

Author: Panda Priyadarshini
Srinivasa Narayan
Publication venue
Publication date: 19/10/2017
Field of study

A fundamental challenge in machine learning today is to build a model that can learn from few examples. Here, we describe a reservoir based spiking neural model for learning to recognize actions with a limited number of labeled videos. First, we propose a novel encoding, inspired by how microsaccades influence visual perception, to extract spike information from raw video data while preserving the temporal correlation across different frames. Using this encoding, we show that the reservoir generalizes its rich dynamical activity toward signature action/movements enabling it to learn from few training examples. We evaluate our approach on the UCF-101 dataset. Our experiments demonstrate that our proposed reservoir achieves 81.3%/87% Top-1/Top-5 accuracy, respectively, on the 101-class data while requiring just 8 video examples per class for training. Our results establish a new benchmark for action recognition from limited video examples for spiking neural models while yielding competetive accuracy with respect to state-of-the-art non-spiking neural models.Comment: 13 figures (includes supplementary information

arXiv.org e-Print Archive

Directory of Open Access Journals

Frontiers - Publisher Connector

Extracting finite structure from infinite language

Author: A.A. Hopgood
Baretto
Bengio
Chomsky
Cleeremans
Cohen
Collier
Elman
Gold
Hopcroft
Hopgood
J.A. Tepper
Kohonen
Kolen
Lawrence
Marcus
Omlin
Pinker
Sharkey
T. McQueen
T.J. Allen
Voegtlin
Publication venue: 'Elsevier BV'
Publication date: 01/01/2004
Field of study

This paper presents a novel connectionist memory-rule based model capable of learning the finite-state properties of an input language from a set of positive examples. The model is based upon an unsupervised recurrent self-organizing map [T. McQueen, A. Hopgood, J. Tepper, T. Allen, A recurrent self-organizing map for temporal sequence processing, in: Proceedings of Fourth International Conference in Recent Advances in Soft Computing (RASC2002), Nottingham, 2002] with laterally interconnected neurons. A derivation of functionalequivalence theory [J. Hopcroft, J. Ullman, Introduction to Automata Theory, Languages and Computation, vol. 1, Addison-Wesley, Reading, MA, 1979] is used that allows the model to exploit similarities between the future context of previously memorized sequences and the future context of the current input sequence. This bottom-up learning algorithm binds functionally related neurons together to form states. Results show that the model is able to learn the Reber grammar [A. Cleeremans, D. Schreiber, J. McClelland, Finite state automata and simple recurrent networks, Neural Computation, 1 (1989) 372–381] perfectly from a randomly generated training set and to generalize to sequences beyond the length of those found in the training set

Crossref

Nottingham Trent Institutional Repository (IRep)

De Montfort University Open Research Archive