Search CORE

398 research outputs found

Loss of Plasticity in Deep Continual Learning

Author: Dohare Shibhansh
Hernandez-Garcia J. Fernando
Mahmood A. Rupam
Rahman Parash
Sutton Richard S.
Publication venue
Publication date: 18/08/2023
Field of study

Modern deep-learning systems are specialized to problem settings in which training occurs once and then never again, as opposed to continual-learning settings in which training occurs continually. If deep-learning systems are applied in a continual learning setting, then it is well known that they may fail to remember earlier examples. More fundamental, but less well known, is that they may also lose their ability to learn on new examples, a phenomenon called loss of plasticity. We provide direct demonstrations of loss of plasticity using the MNIST and ImageNet datasets repurposed for continual learning as sequences of tasks. In ImageNet, binary classification performance dropped from 89\% accuracy on an early task down to 77\%, about the level of a linear network, on the 2000th task. Loss of plasticity occurred with a wide range of deep network architectures, optimizers, activation functions, batch normalization, dropout, but was substantially eased by

L^2

-regularization, particularly when combined with weight perturbation. Further, we introduce a new algorithm -- continual backpropagation -- which slightly modifies conventional backpropagation to reinitialize a small fraction of less-used units after each example and appears to maintain plasticity indefinitely

arXiv.org e-Print Archive

Sparse Neural Network Training with In-Time Over-Parameterization

Author: Liu Shiwei
Publication venue: Eindhoven University of Technology
Publication date: 06/04/2022
Field of study

Pure OAI Repository

Sparse Neural Network Training with In-Time Over-Parameterization

Author: Liu Shiwei
Publication venue: Eindhoven University of Technology
Publication date: 06/04/2022
Field of study

Pure OAI Repository

FALF ConvNets: Fatuous auxiliary loss based filter-pruning for efficient deep CNNs

Author: Kadi Vinay Sameer Raja
Namboodiri Vinay P.
Singh Pravendra
Publication venue: 'Elsevier BV'
Publication date: 01/01/2020
Field of study

OPUS

A Synaptic Pruning-Based Spiking Neural Network for Hand-Written Digits Classification

Author: Alashwal Hany
Faghihi Faramarz
Moustafa Ahmed A.
Publication venue: 'Frontiers Media SA'
Publication date: 24/02/2022
Field of study

A spiking neural network model inspired by synaptic pruning is developed and trained to extract features of hand-written digits. The network is composed of three spiking neural layers and one output neuron whose firing rate is used for classification. The model detects and collects the geometric features of the images from the Modified National Institute of Standards and Technology database (MNIST). In this work, a novel learning rule is developed to train the network to detect features of different digit classes. For this purpose, randomly initialized synaptic weights between the first and second layers are updated using average firing rates of pre- and postsynaptic neurons. Then, using a neuroscience-inspired mechanism named, “synaptic pruning” and its predefined threshold values, some of the synapses are deleted. Hence, these sparse matrices named, “information channels” are constructed so that they show highly specific patterns for each digit class as connection matrices between the first and second layers. The “information channels” are used in the test phase to assign a digit class to each test image. In addition, the role of feed-back inhibition as well as the connectivity rates of the second and third neural layers are studied. Similar to the abilities of the humans to learn from small training trials, the developed spiking neural network needs a very small dataset for training, compared to the conventional deep learning methods that have shown a very good performance on the MNIST dataset. This work introduces a new class of brain-inspired spiking neural networks to extract the features of complex data images

Bond University Research Portal

PubMed Central