36,260 research outputs found
Supervised Dictionary Learning
It is now well established that sparse signal models are well suited to
restoration tasks and can effectively be learned from audio, image, and video
data. Recent research has been aimed at learning discriminative sparse models
instead of purely reconstructive ones. This paper proposes a new step in that
direction, with a novel sparse representation for signals belonging to
different classes in terms of a shared dictionary and multiple class-decision
functions. The linear variant of the proposed model admits a simple
probabilistic interpretation, while its most general variant admits an
interpretation in terms of kernels. An optimization framework for learning all
the components of the proposed model is presented, along with experimental
results on standard handwritten digit and texture classification tasks
Weakly-Supervised Neural Text Classification
Deep neural networks are gaining increasing popularity for the classic text
classification task, due to their strong expressive power and less requirement
for feature engineering. Despite such attractiveness, neural text
classification models suffer from the lack of training data in many real-world
applications. Although many semi-supervised and weakly-supervised text
classification models exist, they cannot be easily applied to deep neural
models and meanwhile support limited supervision types. In this paper, we
propose a weakly-supervised method that addresses the lack of training data in
neural text classification. Our method consists of two modules: (1) a
pseudo-document generator that leverages seed information to generate
pseudo-labeled documents for model pre-training, and (2) a self-training module
that bootstraps on real unlabeled data for model refinement. Our method has the
flexibility to handle different types of weak supervision and can be easily
integrated into existing deep neural models for text classification. We have
performed extensive experiments on three real-world datasets from different
domains. The results demonstrate that our proposed method achieves inspiring
performance without requiring excessive training data and outperforms baseline
methods significantly.Comment: CIKM 2018 Full Pape
Applying Machine Learning to Catalogue Matching in Astrophysics
We present the results of applying automated machine learning techniques to
the problem of matching different object catalogues in astrophysics. In this
study we take two partially matched catalogues where one of the two catalogues
has a large positional uncertainty. The two catalogues we used here were taken
from the HI Parkes All Sky Survey (HIPASS), and SuperCOSMOS optical survey.
Previous work had matched 44% (1887 objects) of HIPASS to the SuperCOSMOS
catalogue.
A supervised learning algorithm was then applied to construct a model of the
matched portion of our catalogue. Validation of the model shows that we
achieved a good classification performance (99.12% correct).
Applying this model, to the unmatched portion of the catalogue found 1209 new
matches. This increases the catalogue size from 1887 matched objects to 3096.
The combination of these procedures yields a catalogue that is 72% matched.Comment: 8 Pages, 5 Figure
- …