Search CORE

61,869 research outputs found

Explicit Learning Curves for Transduction and Application to Clustering and Compression Algorithms

Author: Derbeko P.
El-Yaniv R.
Meir R.
Publication venue: 'AI Access Foundation'
Publication date: 30/06/2011
Field of study

Inductive learning is based on inferring a general rule from a finite data set and using it to label new data. In transduction one attempts to solve the problem of using a labeled training set to label a set of unlabeled points, which are given to the learner prior to learning. Although transduction seems at the outset to be an easier task than induction, there have not been many provably useful algorithms for transduction. Moreover, the precise relation between induction and transduction has not yet been determined. The main theoretical developments related to transduction were presented by Vapnik more than twenty years ago. One of Vapnik's basic results is a rather tight error bound for transductive classification based on an exact computation of the hypergeometric tail. While tight, this bound is given implicitly via a computational routine. Our first contribution is a somewhat looser but explicit characterization of a slightly extended PAC-Bayesian version of Vapnik's transductive bound. This characterization is obtained using concentration inequalities for the tail of sums of random variables obtained by sampling without replacement. We then derive error bounds for compression schemes such as (transductive) support vector machines and for transduction algorithms based on clustering. The main observation used for deriving these new error bounds and algorithms is that the unlabeled test points, which in the transductive setting are known in advance, can be used in order to construct useful data dependent prior distributions over the hypothesis space

arXiv.org e-Print Archive

Crossref

Multi-space Variational Encoder-Decoders for Semi-supervised Labeled Sequence Transduction

Author: Neubig Graham
Zhou Chunting
Publication venue
Publication date: 01/01/2017
Field of study

Labeled sequence transduction is a task of transforming one sequence into another sequence that satisfies desiderata specified by a set of labels. In this paper we propose multi-space variational encoder-decoders, a new model for labeled sequence transduction with semi-supervised learning. The generative model can use neural networks to handle both discrete and continuous latent variables to exploit various features of data. Experiments show that our model provides not only a powerful supervised framework but also can effectively take advantage of the unlabeled data. On the SIGMORPHON morphological inflection benchmark, our model outperforms single-model state-of-art results by a large margin for the majority of languages.Comment: Accepted by ACL 201

arXiv.org e-Print Archive

Crossref

Deep Tree Transductions - A Short Survey

Author: C Gallicchio
D Bacciu
D Bacciu
D Bacciu
J Clarke
M Diligenti
P Frasconi
S Hochreiter
T Cohn
Publication venue
Publication date: 01/01/2019
Field of study

The paper surveys recent extensions of the Long-Short Term Memory networks to handle tree structures from the perspective of learning non-trivial forms of isomorph structured transductions. It provides a discussion of modern TreeLSTM models, showing the effect of the bias induced by the direction of tree processing. An empirical analysis is performed on real-world benchmarks, highlighting how there is no single model adequate to effectively approach all transduction problems.Comment: To appear in the Proceedings of the 2019 INNS Big Data and Deep Learning (INNSBDDL 2019). arXiv admin note: text overlap with arXiv:1809.0909

arXiv.org e-Print Archive

Crossref

Archivio della Ricerca - Università di Pisa

Distribution matching for transduction

Author: Petterson James
Quadrianto Novi
Smola Alex
Publication venue: Curran Associates, Inc.
Publication date: 01/01/2009
Field of study

Many transductive inference algorithms assume that distributions over training and test estimates should be related, e.g. by providing a large margin of separation on both sets. We use this idea to design a transduction algorithm which can be used without modification for classification, regression, and structured estimation. At its heart we exploit the fact that for a good learner the distributions over the outputs on training and test sets should match. This is a classical two-sample problem which can be solved efficiently in its most general form by using distance measures in Hilbert Space. It turns out that a number of existing heuristics can be viewed as special cases of our approach.

CiteSeerX

Sussex Research Online