Search CORE

3,245 research outputs found

On the Consistency of Output Code Based Learning Algorithms for Multiclass Learning Problems

Author: Agarwal Shivani
Babu Balaji Srinivasan
Ramaswamy Harish G.
Williamson Robert
Publication venue: JMLR
Publication date: 14/06/2016
Field of study

Multiclass Learning with Simplex Coding

Author: Mroueh Youssef
Poggio Tomaso
Rosasco Lorenzo
Slotine Jean-Jacques
Publication venue
Publication date: 01/01/2012
Field of study

In this paper we discuss a novel framework for multiclass learning, defined by a suitable coding/decoding strategy, namely the simplex coding, that allows to generalize to multiple classes a relaxation approach commonly used in binary classification. In this framework, a relaxation error analysis can be developed avoiding constraints on the considered hypotheses class. Moreover, we show that in this setting it is possible to derive the first provably consistent regularized method with training/tuning complexity which is independent to the number of classes. Tools from convex analysis are introduced that can be used beyond the scope of this paper

arXiv.org e-Print Archive

DSpace@MIT

Archivio istituzionale della ricerca - Università di Genova

Inhibition in multiclass classification

Author: Bottou L.
Chang Y.-W.
Charles Elkan
Dempster A. P.
José M. Amigó
Kivinen J.
LeCun Y.
Lugosi G.
Platt J. C.
Platt J. C.
Ramón Huerta
Rifkin R.
Shankar Vembu
Smith B. H.
Tewari A.
Thomas Nowotny
Tsochantaridis I.
Weston J.
Publication venue: 'MIT Press - Journals'
Publication date: 01/09/2012
Field of study

The role of inhibition is investigated in a multiclass support vector machine formalism inspired by the brain structure of insects. The so-called mushroom bodies have a set of output neurons, or classification functions, that compete with each other to encode a particular input. Strongly active output neurons depress or inhibit the remaining outputs without knowing which is correct or incorrect. Accordingly, we propose to use a classification function that embodies unselective inhibition and train it in the large margin classifier framework. Inhibition leads to more robust classifiers in the sense that they perform better on larger areas of appropriate hyperparameters when assessed with leave-one-out strategies. We also show that the classifier with inhibition is a tight bound to probabilistic exponential models and is Bayes consistent for 3-class problems. These properties make this approach useful for data sets with a limited number of labeled examples. For larger data sets, there is no significant comparative advantage to other multiclass SVM approaches

Crossref

Directory of Open Access Journals

Red de Bibliotecas Virtuales de Ciencias Sociales de América Latina y El Caribe

DIALNET

Sussex Research Online

Repositorio de Objetos de Docencia e Investigación de la Universidad de Cádiz

idUS. Depósito de Investigación Universidad de Sevilla

Inhibition in multiclass classification

Author: Bottou L.
Chang Y.-W.
Charles Elkan
Dempster A. P.
José M. Amigó
Kivinen J.
LeCun Y.
Lugosi G.
Platt J. C.
Platt J. C.
Ramón Huerta
Rifkin R.
Shankar Vembu
Smith B. H.
Tewari A.
Thomas Nowotny
Tsochantaridis I.
Weston J.
Publication venue: 'MIT Press - Journals'
Publication date: 01/09/2012
Field of study

Crossref

PubMed Central

Sussex Research Online

API design for machine learning software: experiences from the scikit-learn project

Author: Blondel Mathieu
Buitinck Lars
Gramfort Alexandre
Grisel Olivier
Grobler Jaques
Holt Brian
Joly Arnaud
Layton Robert
Louppe Gilles
Mueller Andreas
Niculae Vlad
Pedregosa Fabian
Prettenhofer Peter
Vanderplas Jake
Varoquaux Gaël
Publication venue
Publication date: 01/09/2013
Field of study

Scikit-learn is an increasingly popular machine learning li- brary. Written in Python, it is designed to be simple and efficient, accessible to non-experts, and reusable in various contexts. In this paper, we present and discuss our design choices for the application programming interface (API) of the project. In particular, we describe the simple and elegant interface shared by all learning and processing units in the library and then discuss its advantages in terms of composition and reusability. The paper also comments on implementation details specific to the Python ecosystem and analyzes obstacles faced by users and developers of the library

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server

Federation ResearchOnline

HAL-CEA

Distributed Machine Learning via Sufficient Factor Broadcasting

Author: Ho Qirong
Kim Jin Kyu
Kumar Abhimanu
Xie Pengtao
Xing Eric
Yu Yaoliang
Zhou Yi
Publication venue
Publication date: 07/09/2015
Field of study

Matrix-parametrized models, including multiclass logistic regression and sparse coding, are used in machine learning (ML) applications ranging from computer vision to computational biology. When these models are applied to large-scale ML problems starting at millions of samples and tens of thousands of classes, their parameter matrix can grow at an unexpected rate, resulting in high parameter synchronization costs that greatly slow down distributed learning. To address this issue, we propose a Sufficient Factor Broadcasting (SFB) computation model for efficient distributed learning of a large family of matrix-parameterized models, which share the following property: the parameter update computed on each data sample is a rank-1 matrix, i.e., the outer product of two "sufficient factors" (SFs). By broadcasting the SFs among worker machines and reconstructing the update matrices locally at each worker, SFB improves communication efficiency --- communication costs are linear in the parameter matrix's dimensions, rather than quadratic --- without affecting computational correctness. We present a theoretical convergence analysis of SFB, and empirically corroborate its efficiency on four different matrix-parametrized ML models

arXiv.org e-Print Archive

CiteSeerX

Axiomatic Interpretability for Multiclass Additive Models

Author: Caruana Rich
Chajewska Urszula
Koch Paul
Lou Yin
Tan Sarah
Zhang Xuezhou
Publication venue
Publication date: 30/05/2019
Field of study

Generalized additive models (GAMs) are favored in many regression and binary classification problems because they are able to fit complex, nonlinear functions while still remaining interpretable. In the first part of this paper, we generalize a state-of-the-art GAM learning algorithm based on boosted trees to the multiclass setting, and show that this multiclass algorithm outperforms existing GAM learning algorithms and sometimes matches the performance of full complexity models such as gradient boosted trees. In the second part, we turn our attention to the interpretability of GAMs in the multiclass setting. Surprisingly, the natural interpretability of GAMs breaks down when there are more than two classes. Naive interpretation of multiclass GAMs can lead to false conclusions. Inspired by binary GAMs, we identify two axioms that any additive model must satisfy in order to not be visually misleading. We then develop a technique called Additive Post-Processing for Interpretability (API), that provably transforms a pre-trained additive model to satisfy the interpretability axioms without sacrificing accuracy. The technique works not just on models trained with our learning algorithm, but on any multiclass additive model, including multiclass linear and logistic regression. We demonstrate the effectiveness of API on a 12-class infant mortality dataset.Comment: KDD 201

arXiv.org e-Print Archive

Crossref