2,339 research outputs found
Catastrophic forgetting: still a problem for DNNs
We investigate the performance of DNNs when trained on class-incremental
visual problems consisting of initial training, followed by retraining with
added visual classes. Catastrophic forgetting (CF) behavior is measured using a
new evaluation procedure that aims at an application-oriented view of
incremental learning. In particular, it imposes that model selection must be
performed on the initial dataset alone, as well as demanding that retraining
control be performed only using the retraining dataset, as initial dataset is
usually too large to be kept. Experiments are conducted on class-incremental
problems derived from MNIST, using a variety of different DNN models, some of
them recently proposed to avoid catastrophic forgetting. When comparing our new
evaluation procedure to previous approaches for assessing CF, we find their
findings are completely negated, and that none of the tested methods can avoid
CF in all experiments. This stresses the importance of a realistic empirical
measurement procedure for catastrophic forgetting, and the need for further
research in incremental learning for DNNs.Comment: 10 pages, 11 figures, Artificial Neural Networks and Machine Learning
- ICANN 201
A note on confirmation and Matthew properties
There are numerous (Bayesian) confirmation measures in the literature. Festa provides a formal characterization of a certain class of such measures. He calls the members of this class “incremental measures”. Festa then introduces six rather interesting properties called “Matthew properties” and puts forward two theses, hereafter “T1” and “T2”, concerning which of the various extant incremental measures have which of the various Matthew properties. Festa’s discussion is potentially helpful with the problem of measure sensitivity. I argue, that, while Festa’s discussion is illuminating on the whole and worthy of careful study, T1 and T2 are strictly speaking incorrect (though on the right track) and should be rejected in favor of two similar but distinct theses
Unified Probabilistic Deep Continual Learning through Generative Replay and Open Set Recognition
We introduce a probabilistic approach to unify open set recognition with the
prevention of catastrophic forgetting in deep continual learning, based on
variational Bayesian inference. Our single model combines a joint probabilistic
encoder with a generative model and a linear classifier that get shared across
sequentially arriving tasks. In order to successfully distinguish unseen
unknown data from trained known tasks, we propose to bound the class specific
approximate posterior by fitting regions of high density on the basis of
correctly classified data points. These bounds are further used to
significantly alleviate catastrophic forgetting by avoiding samples from low
density areas in generative replay. Our approach requires neither storing of
old, nor upfront knowledge of future data, and is empirically validated on
visual and audio tasks in class incremental, as well as cross-dataset scenarios
across modalities
A Novel Progressive Multi-label Classifier for Classincremental Data
In this paper, a progressive learning algorithm for multi-label
classification to learn new labels while retaining the knowledge of previous
labels is designed. New output neurons corresponding to new labels are added
and the neural network connections and parameters are automatically
restructured as if the label has been introduced from the beginning. This work
is the first of the kind in multi-label classifier for class-incremental
learning. It is useful for real-world applications such as robotics where
streaming data are available and the number of labels is often unknown. Based
on the Extreme Learning Machine framework, a novel universal classifier with
plug and play capabilities for progressive multi-label classification is
developed. Experimental results on various benchmark synthetic and real
datasets validate the efficiency and effectiveness of our proposed algorithm.Comment: 5 pages, 3 figures, 4 table
iCaRL: Incremental Classifier and Representation Learning
A major open problem on the road to artificial intelligence is the
development of incrementally learning systems that learn about more and more
concepts over time from a stream of data. In this work, we introduce a new
training strategy, iCaRL, that allows learning in such a class-incremental way:
only the training data for a small number of classes has to be present at the
same time and new classes can be added progressively. iCaRL learns strong
classifiers and a data representation simultaneously. This distinguishes it
from earlier works that were fundamentally limited to fixed data
representations and therefore incompatible with deep learning architectures. We
show by experiments on CIFAR-100 and ImageNet ILSVRC 2012 data that iCaRL can
learn many classes incrementally over a long period of time where other
strategies quickly fail.Comment: Accepted paper at CVPR 201
- …
