Search CORE

1,907 research outputs found

On surrogate loss functions and $f$ -divergences

Author: Jordan Michael I.
Nguyen XuanLong
Wainwright Martin J.
Publication venue: 'Institute of Mathematical Statistics'
Publication date: 03/12/2008
Field of study

The goal of binary classification is to estimate a discriminant function

\gamma

from observations of covariate vectors and corresponding binary labels. We consider an elaboration of this problem in which the covariates are not available directly but are transformed by a dimensionality-reducing quantizer

Q

. We present conditions on loss functions such that empirical risk minimization yields Bayes consistency when both the discriminant function and the quantizer are estimated. These conditions are stated in terms of a general correspondence between loss functions and a class of functionals known as Ali-Silvey or

f

-divergence functionals. Whereas this correspondence was established by Blackwell [Proc. 2nd Berkeley Symp. Probab. Statist. 1 (1951) 93--102. Univ. California Press, Berkeley] for the 0--1 loss, we extend the correspondence to the broader class of surrogate loss functions that play a key role in the general theory of Bayes consistency for binary classification. Our result makes it possible to pick out the (strict) subset of surrogate loss functions that yield Bayes consistency for joint estimation of the discriminant function and the quantizer.Comment: Published in at http://dx.doi.org/10.1214/08-AOS595 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org

arXiv.org e-Print Archive

CiteSeerX

Crossref

Learning Probability Measures with respect to Optimal Transport Metrics

Author: Canas Guillermo D.
Rosasco Lorenzo
Publication venue
Publication date: 01/01/2012
Field of study

We study the problem of estimating, in the sense of optimal transport metrics, a measure which is assumed supported on a manifold embedded in a Hilbert space. By establishing a precise connection between optimal transport metrics, optimal quantization, and learning theory, we derive new probabilistic bounds for the performance of a classic algorithm in unsupervised learning (k-means), when used to produce a probability measure derived from the data. In the course of the analysis, we arrive at new lower bounds, as well as probabilistic upper bounds on the convergence rate of the empirical law of large numbers, which, unlike existing bounds, are applicable to a wide class of measures.Comment: 13 pages, 2 figures. Advances in Neural Information Processing Systems, NIPS 201

arXiv.org e-Print Archive

CiteSeerX

Archivio istituzionale della ricerca - Università di Genova

Quadratic optimal functional quantization of stochastic processes and numerical applications

Author: A Benveniste
A Gersho
AV Trushkin
B Lapeyre
D Pollard
D Revuz
DJ Newman
EF Abaya
EF Abaya
G Pagès
G Pagès
G Pagès
G Pagès
G Pagès
H Doss
H Luschgy
H Luschgy
H Luschgy
HJ Kushner
J-C Fort
JA Bucklew
JA Cuesta-Albertos
JC Kieffer
JC Kieffer
K Pärna
KF Roth
N Bouleau
PD Proinov
PE Fleischer
PL Zador
S Delattre
S Dereich
S Dereich
S Graf
S Graf
S Graf
S Graf
SL Heston
T Tarpey
T Tarpey
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2006
Field of study

In this paper, we present an overview of the recent developments of functional quantization of stochastic processes, with an emphasis on the quadratic case. Functional quantization is a way to approximate a process, viewed as a Hilbert-valued random variable, using a nearest neighbour projection on a finite codebook. A special emphasis is made on the computational aspects and the numerical applications, in particular the pricing of some path-dependent European options.Comment: 41 page

arXiv.org e-Print Archive

Crossref

Hal-Diderot