12,846 research outputs found
Riemannian Walk for Incremental Learning: Understanding Forgetting and Intransigence
Incremental learning (IL) has received a lot of attention recently, however,
the literature lacks a precise problem definition, proper evaluation settings,
and metrics tailored specifically for the IL problem. One of the main
objectives of this work is to fill these gaps so as to provide a common ground
for better understanding of IL. The main challenge for an IL algorithm is to
update the classifier whilst preserving existing knowledge. We observe that, in
addition to forgetting, a known issue while preserving knowledge, IL also
suffers from a problem we call intransigence, inability of a model to update
its knowledge. We introduce two metrics to quantify forgetting and
intransigence that allow us to understand, analyse, and gain better insights
into the behaviour of IL algorithms. We present RWalk, a generalization of
EWC++ (our efficient version of EWC [Kirkpatrick2016EWC]) and Path Integral
[Zenke2017Continual] with a theoretically grounded KL-divergence based
perspective. We provide a thorough analysis of various IL algorithms on MNIST
and CIFAR-100 datasets. In these experiments, RWalk obtains superior results in
terms of accuracy, and also provides a better trade-off between forgetting and
intransigence
Detection of exomoons in simulated light curves with a regularized convolutional neural network
Many moons have been detected around planets in our Solar System, but none
has been detected unambiguously around any of the confirmed extrasolar planets.
We test the feasibility of a supervised convolutional neural network to
classify photometric transit light curves of planet-host stars and identify
exomoon transits, while avoiding false positives caused by stellar variability
or instrumental noise. Convolutional neural networks are known to have
contributed to improving the accuracy of classification tasks. The network
optimization is typically performed without studying the effect of noise on the
training process. Here we design and optimize a 1D convolutional neural network
to classify photometric transit light curves. We regularize the network by the
total variation loss in order to remove unwanted variations in the data
features. Using numerical experiments, we demonstrate the benefits of our
network, which produces results comparable to or better than the standard
network solutions. Most importantly, our network clearly outperforms a
classical method used in exoplanet science to identify moon-like signals. Thus
the proposed network is a promising approach for analyzing real transit light
curves in the future
- …