Search CORE

17 research outputs found

PAC-Bayesian Compression Bounds on the Prediction Error of Learning Algorithms for Classification

Author: A. Cannon
A. D. Wyner
C. Cortes
J. Rissanen
J. Shawe-Taylor
John Shawe-Taylor
M. Tipping
N. Littlestone
R. Herbrich
R. Herbrich
Ralf Herbrich
S. Floyd
T. M. Cover
Thore Graepel
W. Hoeffding
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

On the proliferation of support vectors in high dimensions

Author: Hsu Daniel
Muthukumar Vidya
Xu Ji
Publication venue
Publication date: 22/09/2020
Field of study

The support vector machine (SVM) is a well-established classification method whose name refers to the particular training examples, called support vectors, that determine the maximum margin separating hyperplane. The SVM classifier is known to enjoy good generalization properties when the number of support vectors is small compared to the number of training examples. However, recent research has shown that in sufficiently high-dimensional linear classification problems, the SVM can generalize well despite a proliferation of support vectors where all training examples are support vectors. In this paper, we identify new deterministic equivalences for this phenomenon of support vector proliferation, and use them to (1) substantially broaden the conditions under which the phenomenon occurs in high-dimensional settings, and (2) prove a nearly matching converse result

arXiv.org e-Print Archive

Active Nearest-Neighbor Learning in Metric Spaces

Author: Kontorovich Aryeh
Sabato Sivan
Urner Ruth
Publication venue
Publication date: 01/06/2017
Field of study

We propose a pool-based non-parametric active learning algorithm for general metric spaces, called MArgin Regularized Metric Active Nearest Neighbor (MARMANN), which outputs a nearest-neighbor classifier. We give prediction error guarantees that depend on the noisy-margin properties of the input sample, and are competitive with those obtained by previously proposed passive learners. We prove that the label complexity of MARMANN is significantly lower than that of any passive learner with similar error guarantees. MARMANN is based on a generalized sample compression scheme, and a new label-efficient active model-selection procedure

arXiv.org e-Print Archive

MPG.PuRe