Search CORE

36,260 research outputs found

Supervised Dictionary Learning

Author: Andrew Zisserman
Francis Bach
Guillermo Sapiro
Jean Ponce
Julien Mairal
Thème Cog
Publication venue
Publication date: 01/01/2008
Field of study

It is now well established that sparse signal models are well suited to restoration tasks and can effectively be learned from audio, image, and video data. Recent research has been aimed at learning discriminative sparse models instead of purely reconstructive ones. This paper proposes a new step in that direction, with a novel sparse representation for signals belonging to different classes in terms of a shared dictionary and multiple class-decision functions. The linear variant of the proposed model admits a simple probabilistic interpretation, while its most general variant admits an interpretation in terms of kernels. An optimization framework for learning all the components of the proposed model is presented, along with experimental results on standard handwritten digit and texture classification tasks

arXiv.org e-Print Archive

CiteSeerX

INRIA a CCSD electronic archive server

University of Minnesota Digital Conservancy

Weakly-Supervised Neural Text Classification

Author: Han Jiawei
Meng Yu
Shen Jiaming
Zhang Chao
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 12/09/2018
Field of study

Deep neural networks are gaining increasing popularity for the classic text classification task, due to their strong expressive power and less requirement for feature engineering. Despite such attractiveness, neural text classification models suffer from the lack of training data in many real-world applications. Although many semi-supervised and weakly-supervised text classification models exist, they cannot be easily applied to deep neural models and meanwhile support limited supervision types. In this paper, we propose a weakly-supervised method that addresses the lack of training data in neural text classification. Our method consists of two modules: (1) a pseudo-document generator that leverages seed information to generate pseudo-labeled documents for model pre-training, and (2) a self-training module that bootstraps on real unlabeled data for model refinement. Our method has the flexibility to handle different types of weak supervision and can be easily integrated into existing deep neural models for text classification. We have performed extensive experiments on three real-world datasets from different domains. The results demonstrate that our proposed method achieves inspiring performance without requiring excessive training data and outperforms baseline methods significantly.Comment: CIKM 2018 Full Pape

arXiv.org e-Print Archive

Crossref

Applying Machine Learning to Catalogue Matching in Astrophysics

Author: Andreon
Bazell
Bertin
Bishop
Budavári
Cristianini
D. J. Rohde
Drinkwater
Fellegi
Guyon
Hambly
Hambly
Joachims
M. J. Drinkwater
M. R. Gallagher
M. T. Doyle
Meyer
Richards
Rohde
Schlkopf
Sutherland
T. Downs
Tagliaferri
Vapnik
Voisin
Wakamatsu
Zwaan
Publication venue: 'Wiley'
Publication date: 01/04/2005
Field of study

We present the results of applying automated machine learning techniques to the problem of matching different object catalogues in astrophysics. In this study we take two partially matched catalogues where one of the two catalogues has a large positional uncertainty. The two catalogues we used here were taken from the HI Parkes All Sky Survey (HIPASS), and SuperCOSMOS optical survey. Previous work had matched 44% (1887 objects) of HIPASS to the SuperCOSMOS catalogue. A supervised learning algorithm was then applied to construct a model of the matched portion of our catalogue. Validation of the model shows that we achieved a good classification performance (99.12% correct). Applying this model, to the unmatched portion of the catalogue found 1209 new matches. This increases the catalogue size from 1887 matched objects to 3096. The combination of these procedures yields a catalogue that is 72% matched.Comment: 8 Pages, 5 Figure

arXiv.org e-Print Archive

Crossref

CERN Document Server