Search CORE

581 research outputs found

Regularization and Optimal Multiclass Learning

Author: Asilis Julian
Devic Siddartha
Dughmi Shaddin
Sharan Vatsal
Teng Shang-Hua
Publication venue
Publication date: 24/09/2023
Field of study

The quintessential learning algorithm of empirical risk minimization (ERM) is known to fail in various settings for which uniform convergence does not characterize learning. It is therefore unsurprising that the practice of machine learning is rife with considerably richer algorithmic techniques for successfully controlling model capacity. Nevertheless, no such technique or principle has broken away from the pack to characterize optimal learning in these more general settings. The purpose of this work is to characterize the role of regularization in perhaps the simplest setting for which ERM fails: multiclass learning with arbitrary label sets. Using one-inclusion graphs (OIGs), we exhibit optimal learning algorithms that dovetail with tried-and-true algorithmic principles: Occam's Razor as embodied by structural risk minimization (SRM), the principle of maximum entropy, and Bayesian reasoning. Most notably, we introduce an optimal learner which relaxes structural risk minimization on two dimensions: it allows the regularization function to be "local" to datapoints, and uses an unsupervised learning stage to learn this regularizer at the outset. We justify these relaxations by showing that they are necessary: removing either dimension fails to yield a near-optimal learner. We also extract from OIGs a combinatorial sequence we term the Hall complexity, which is the first to characterize a problem's transductive error rate exactly. Lastly, we introduce a generalization of OIGs and the transductive learning setting to the agnostic case, where we show that optimal orientations of Hamming graphs -- judged using nodes' outdegrees minus a system of node-dependent credits -- characterize optimal learners exactly. We demonstrate that an agnostic version of the Hall complexity again characterizes error rates exactly, and exhibit an optimal learner using maximum entropy programs.Comment: 40 pages, 2 figure

arXiv.org e-Print Archive

12th SC@RUG 2015 proceedings:Student Colloquium 2014-2015

Author
Publication venue: Rijksuniversiteit Groningen. Universiteitsbibliotheek
Publication date: 01/01/2015
Field of study

ARTS repository - University of Groningen

12th SC@RUG 2015 proceedings:Student Colloquium 2014-2015

Author
Publication venue: Rijksuniversiteit Groningen. Universiteitsbibliotheek
Publication date: 01/01/2015
Field of study

ARTS repository - University of Groningen

Multiclass Classification Using Support Vector Machines

Author: Rathgamage Don Duleep Prasanna W
Publication venue: Digital Commons@Georgia Southern
Publication date: 01/01/2018
Field of study

In this thesis, we discuss different SVM methods for multiclass classification and introduce the Divide and Conquer Support Vector Machine (DCSVM) algorithm which relies on data sparsity in high dimensional space and performs a smart partitioning of the whole training data set into disjoint subsets that are easily separable. A single prediction performed between two partitions eliminates one or more classes in a single partition, leaving only a reduced number of candidate classes for subsequent steps. The algorithm continues recursively, reducing the number of classes at each step until a final binary decision is made between the last two classes left in the process. In the best case scenario, our algorithm makes a final decision between k classes in O(log2 k) decision steps and in the worst case scenario, DCSVM makes a final decision in k - 1 steps

Georgia Southern University: Digital Commons@Georgia Southern

12th SC@RUG 2015 proceedings:Student Colloquium 2014-2015

Author
Publication venue: Rijksuniversiteit Groningen. Universiteitsbibliotheek
Publication date: 01/01/2015
Field of study

University of Groningen