Search CORE

29,594 research outputs found

Explaining Support Vector Machines: A Color Based Nomogram.

Author: A Karatzoglou
A Navia-Vázquez
A Verikas
B Moghaddam
B Schölkopf
B Üstün
Ben Van Calster
BH Cho
D Caragea
D Decoste
D Martens
E Romero
F Harrell
J Balfer
J Björk
J Gallier
JAK Suykens
JAK Suykens
JC Platt
Johan A. K. Suykens
K Hansen
K Pelckmans
M Kuhn
M Stitson
NH Barakat
Paulo Lisboa
PJG Lisboa
PO Dea
PWT Krooshof
RA Fisher
S Hua
Sabine Van Huffel
Santosh Patnaik
SR Gunn
T Joachims
T Kohonen
V Van Belle
V Van Belle
V Vapnik
Vanya Van Belle
WN Venables
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 10/10/2016
Field of study

PROBLEM SETTING: Support vector machines (SVMs) are very popular tools for classification, regression and other problems. Due to the large choice of kernels they can be applied with, a large variety of data can be analysed using these tools. Machine learning thanks its popularity to the good performance of the resulting models. However, interpreting the models is far from obvious, especially when non-linear kernels are used. Hence, the methods are used as black boxes. As a consequence, the use of SVMs is less supported in areas where interpretability is important and where people are held responsible for the decisions made by models. OBJECTIVE: In this work, we investigate whether SVMs using linear, polynomial and RBF kernels can be explained such that interpretations for model-based decisions can be provided. We further indicate when SVMs can be explained and in which situations interpretation of SVMs is (hitherto) not possible. Here, explainability is defined as the ability to produce the final decision based on a sum of contributions which depend on one single or at most two input variables. RESULTS: Our experiments on simulated and real-life data show that explainability of an SVM depends on the chosen parameter values (degree of polynomial kernel, width of RBF kernel and regularization constant). When several combinations of parameter values yield the same cross-validation performance, combinations with a lower polynomial degree or a larger kernel width have a higher chance of being explainable. CONCLUSIONS: This work summarizes SVM classifiers obtained with linear, polynomial and RBF kernels in a single plot. Linear and polynomial kernels up to the second degree are represented exactly. For other kernels an indication of the reliability of the approximation is presented. The complete methodology is available as an R package and two apps and a movie are provided to illustrate the possibilities offered by the method

LJMU Research Online (Liverpool John Moores University)

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

A novel Boolean kernels family for categorical data

Author: Aiolli Fabio
Lauriola Ivano
Polato Mirko
Publication venue: 'MDPI AG'
Publication date: 01/01/2018
Field of study

Kernel based classifiers, such as SVM, are considered state-of-the-art algorithms and are widely used on many classification tasks. However, this kind of methods are hardly interpretable and for this reason they are often considered as black-box models. In this paper, we propose a new family of Boolean kernels for categorical data where features correspond to propositional formulas applied to the input variables. The idea is to create human-readable features to ease the extraction of interpretation rules directly from the embedding space. Experiments on artificial and benchmark datasets show the effectiveness of the proposed family of kernels with respect to established ones, such as RBF, in terms of classification accuracy

Directory of Open Access Journals

Archivio istituzionale della ricerca - Università di Padova

Graph Signal Representation with Wasserstein Barycenters

Author: Frossard Pascal
Simou Effrosyni
Publication venue
Publication date: 13/12/2018
Field of study

In many applications signals reside on the vertices of weighted graphs. Thus, there is the need to learn low dimensional representations for graph signals that will allow for data analysis and interpretation. Existing unsupervised dimensionality reduction methods for graph signals have focused on dictionary learning. In these works the graph is taken into consideration by imposing a structure or a parametrization on the dictionary and the signals are represented as linear combinations of the atoms in the dictionary. However, the assumption that graph signals can be represented using linear combinations of atoms is not always appropriate. In this paper we propose a novel representation framework based on non-linear and geometry-aware combinations of graph signals by leveraging the mathematical theory of Optimal Transport. We represent graph signals as Wasserstein barycenters and demonstrate through our experiments the potential of our proposed framework for low-dimensional graph signal representation

arXiv.org e-Print Archive

Infoscience - École polytechnique fédérale de Lausanne

Crossref