Search CORE

22,609 research outputs found

Binary Classifier Calibration using an Ensemble of Near Isotonic Regression Models

Author: Cooper Gregory F.
Naeini Mahdi Pakdaman
Publication venue
Publication date: 16/11/2015
Field of study

Learning accurate probabilistic models from data is crucial in many practical tasks in data mining. In this paper we present a new non-parametric calibration method called \textit{ensemble of near isotonic regression} (ENIR). The method can be considered as an extension of BBQ, a recently proposed calibration method, as well as the commonly used calibration method based on isotonic regression. ENIR is designed to address the key limitation of isotonic regression which is the monotonicity assumption of the predictions. Similar to BBQ, the method post-processes the output of a binary classifier to obtain calibrated probabilities. Thus it can be combined with many existing classification models. We demonstrate the performance of ENIR on synthetic and real datasets for the commonly used binary classification models. Experimental results show that the method outperforms several common binary classifier calibration methods. In particular on the real data, ENIR commonly performs statistically significantly better than the other methods, and never worse. It is able to improve the calibration power of classifiers, while retaining their discrimination power. The method is also computationally tractable for large scale datasets, as it is

O(N \log N)

time, where

N

is the number of samples

arXiv.org e-Print Archive

Crossref

Functional Multi-Layer Perceptron: a Nonlinear Tool for Functional Data Analysis

Author: Abraham
Andrews
Besse
Besse
Besse
Besse
Breiman
Brieuc Conan-Guez
Cardot
Cardot
Chen
Chen
Cristianini
Fabrice Rossi
Ferraty
Ferraty
Ferraty
Ferré
Hastie
Hastie
Hornik
Hornik
James
James
Leshno
Li
Marx
Ramsay
Rudin
Sandberg
Sandberg
Stinchcombe
White
Publication venue: 'Elsevier BV'
Publication date: 01/01/2003
Field of study

In this paper, we study a natural extension of Multi-Layer Perceptrons (MLP) to functional inputs. We show that fundamental results for classical MLP can be extended to functional MLP. We obtain universal approximation results that show the expressive power of functional MLP is comparable to that of numerical MLP. We obtain consistency results which imply that the estimation of optimal parameters for functional MLP is statistically well defined. We finally show on simulated and real world data that the proposed model performs in a very satisfactory way.Comment: http://www.sciencedirect.com/science/journal/0893608

arXiv.org e-Print Archive

Base de publications de l'université Paris-Dauphine

CiteSeerX

Crossref

INRIA a CCSD electronic archive server

A Simple Iterative Algorithm for Parsimonious Binary Kernel Fisher Discrimination

Author: B Chien
B Efron
B Krishnapuram
B Schölkopf
CM Bishop
D Hunter
D Masip
E Andelić
G Baudat
G Rätsch
J Lu
J Yang
J Zhu
K Fukunaga
K Lange
Kitsuchart Pasupa
M Figueiredo
M Last
M Osborne
M. Figueiredo
N Hsieh
R Duda
R Dutter
R Harrison
Robert F. Harrison
S Abe
S Billings
S Keerthi
S Mika
T Hastie
V Roth
Y Park
Y Sun
Y Washizawa
Y Xu
Y Xu
Y Xu
Z Liang
Z Liang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/02/2010
Field of study

By applying recent results in optimization theory variously known as optimization transfer or majorize/minimize algorithms, an algorithm for binary, kernel, Fisher discriminant analysis is introduced that makes use of a non-smooth penalty on the coefficients to provide a parsimonious solution. The problem is converted into a smooth optimization that can be solved iteratively with no greater overhead than iteratively re-weighted least-squares. The result is simple, easily programmed and is shown to perform, in terms of both accuracy and parsimony, as well as or better than a number of leading machine learning algorithms on two well-studied and substantial benchmarks

Southampton (e-Prints Soton)

Crossref

White Rose Research Online

The Practice of Non Parametric Estimation by Solving Inverse Problems: The Example of Transformation Models

Author: Florens Jean-Pierre
Fève Frédérique
Publication venue
Publication date
Field of study

Research Papers in Economics

kLog: A Language for Logical and Relational Learning with Kernels

Author: Altun
Ando
Antanas
Antanas
Antanas
Argyriou
Blockeel
Blockeel
Bottou
Boulicaut
Bröcheler
Ceroni
Chang
Chang
Cook
Costa
Costa
De
De Grave
De Grave
De Raedt
De Raedt
De Raedt
Dietterich
Dietterich
Evgeniou
Fabrizio Costa
Frasconi
Frasconi
Friedman
Gross
Gärtner
Gärtner
Haussler
Heckerman
Helma
Helma
Horváth
Joachims
Kazius
Kersting
Kersting
Kersting
Kimmig
Koller
Kordjamshidi
Kou
Kramer
Kurt De Grave
Lanckriet
Landwehr
Lao
Lari
London
Lowd
Luc De Raedt
Luks
Macskassy
Mahe
McCallum
McKay
Menchetti
Mitchell
Muggleton
Muggleton
Neville
Ng
Paolo Frasconi
Quinlan
Ralaivola
Richardson
Rizzolo
Rossi
Serebrenik
Shervashidze
Shi
Sorlin
Srinivasan
Srinivasan
Sun
Sutton
Taskar
Taskar
Tsochantaridis
van de Waterbeemd
Vazquez
Verbeke
Verbeke
Vishwanathan
Wachman
Wang
Wolpert
Yan
Publication venue: 'Elsevier BV'
Publication date: 28/07/2014
Field of study

We introduce kLog, a novel approach to statistical relational learning. Unlike standard approaches, kLog does not represent a probability distribution directly. It is rather a language to perform kernel-based learning on expressive logical and relational representations. kLog allows users to specify learning problems declaratively. It builds on simple but powerful concepts: learning from interpretations, entity/relationship data modeling, logic programming, and deductive databases. Access by the kernel to the rich representation is mediated by a technique we call graphicalization: the relational representation is first transformed into a graph --- in particular, a grounded entity/relationship diagram. Subsequently, a choice of graph kernel defines the feature space. kLog supports mixed numerical and symbolic data, as well as background knowledge in the form of Prolog or Datalog programs as in inductive logic programming systems. The kLog framework can be applied to tackle the same range of tasks that has made statistical relational learning so popular, including classification, regression, multitask learning, and collective classification. We also report about empirical comparisons, showing that kLog can be either more accurate, or much faster at the same level of accuracy, than Tilde and Alchemy. kLog is GPLv3 licensed and is available at http://klog.dinfo.unifi.it along with tutorials

arXiv.org e-Print Archive

Lirias

Crossref

Discovering Valuable Items from Massive Data

Author: Bubeck S.
Dani V.
Desautels T.
Garnett R.
Kale S.
Krause A.
Kulesza A.
Lawrence N. D.
Lin H.
Nemhauser G.
Rasmussen C. E.
Schölkopf B.
Settles B.
Slivkins A.
Streeter M.
Streeter M.
Tran-Thanh L.
Yue Y.
Publication venue
Publication date: 02/06/2015
Field of study

Suppose there is a large collection of items, each with an associated cost and an inherent utility that is revealed only once we commit to selecting it. Given a budget on the cumulative cost of the selected items, how can we pick a subset of maximal value? This task generalizes several important problems such as multi-arm bandits, active search and the knapsack problem. We present an algorithm, GP-Select, which utilizes prior knowledge about similarity be- tween items, expressed as a kernel function. GP-Select uses Gaussian process prediction to balance exploration (estimating the unknown value of items) and exploitation (selecting items of high value). We extend GP-Select to be able to discover sets that simultaneously have high utility and are diverse. Our preference for diversity can be specified as an arbitrary monotone submodular function that quantifies the diminishing returns obtained when selecting similar items. Furthermore, we exploit the structure of the model updates to achieve an order of magnitude (up to 40X) speedup in our experiments without resorting to approximations. We provide strong guarantees on the performance of GP-Select and apply it to three real-world case studies of industrial relevance: (1) Refreshing a repository of prices in a Global Distribution System for the travel industry, (2) Identifying diverse, binding-affine peptides in a vaccine de- sign task and (3) Maximizing clicks in a web-scale recommender system by recommending items to users

arXiv.org e-Print Archive

CiteSeerX

Repository for Publications and Research Data

Crossref

FFT-Based Fast Computation of Multivariate Kernel Estimators with Unconstrained Bandwidth Matrices

Author: Gramacki Artur
Gramacki Jarosław
Publication venue
Publication date: 07/09/2016
Field of study

The problem of fast computation of multivariate kernel density estimation (KDE) is still an open research problem. In our view, the existing solutions do not resolve this matter in a satisfactory way. One of the most elegant and efficient approach utilizes the fast Fourier transform. Unfortunately, the existing FFT-based solution suffers from a serious limitation, as it can accurately operate only with the constrained (i.e., diagonal) multivariate bandwidth matrices. In this paper we describe the problem and give a satisfactory solution. The proposed solution may be successfully used also in other research problems, for example for the fast computation of the optimal bandwidth for KDE.Comment: 10 pages, 1 figure, R source code

arXiv.org e-Print Archive

FigShare