Search CORE

6,683 research outputs found

On robustness properties of convex risk minimization methods for pattern recognition

Author: Christmann Andreas
Steinwart Ingo
Publication venue
Publication date
Field of study

The paper brings together methods from two disciplines: machine learning theory and robust statistics. Robustness properties of machine learning methods based on convex risk minimization are investigated for the problem of pattern recognition. Assumptions are given for the existence of the influence function of the classifiers and for bounds of the influence function. Kernel logistic regression, support vector machines, least squares and the AdaBoost loss function are treated as special cases. A sensitivity analysis of the support vector machine is given. --AdaBoost loss function,influence function,kernel logistic regression,robustness,sensitivity curve,statistical learning,support vector machine,total variation

Research Papers in Economics

The Degrees of Freedom of Partial Least Squares Regression

Author: Akaike H.
Brown P.
Krämer N.
Lanczos C.
Leisch F.
Masashi Sugiyama
Nicole Krämer
Publication venue
Publication date: 01/01/2010
Field of study

The derivation of statistical properties for Partial Least Squares regression can be a challenging task. The reason is that the construction of latent components from the predictor variables also depends on the response variable. While this typically leads to good performance and interpretable models in practice, it makes the statistical analysis more involved. In this work, we study the intrinsic complexity of Partial Least Squares Regression. Our contribution is an unbiased estimate of its Degrees of Freedom. It is defined as the trace of the first derivative of the fitted values, seen as a function of the response. We establish two equivalent representations that rely on the close connection of Partial Least Squares to matrix decompositions and Krylov subspace techniques. We show that the Degrees of Freedom depend on the collinearity of the predictor variables: The lower the collinearity is, the higher the Degrees of Freedom are. In particular, they are typically higher than the naive approach that defines the Degrees of Freedom as the number of components. Further, we illustrate how the Degrees of Freedom approach can be used for the comparison of different regression methods. In the experimental section, we show that our Degrees of Freedom estimate in combination with information criteria is useful for model selection.Comment: to appear in the Journal of the American Statistical Associatio

arXiv.org e-Print Archive

CiteSeerX

Crossref

Publications Server of the Weierstrass Institute for Applied Analysis and Stochastics

Repositorium für Naturwissenschaften und Technik

Research Papers in Economics

Rates of convergence for nearest neighbor estimators with the smoother regression function

Author: Ayano Takanori
Publication venue: 'Institute of Electronics, Information and Communications Engineers (IEICE)'
Publication date: 01/01/2011
Field of study

In regression analysis one wants to estimate the regression function from a data. In this paper we consider the rate of convergence for the nearest neighbor estimator in case that the regression function is

(p,C)

-smooth. It is an open problem whether the optimal rate can be achieved by some nearest neighbor estimator in case that

p

is on (1,1.5]. We solve the problem affirmatively. This is the main result of this paper. Throughout this paper, we assume that the data is independent and identically distributed and as an error criterion we use the expected

L_2

error.Comment: 12 pages, 1 tabl

arXiv.org e-Print Archive

Crossref

A Comparative Review of Dimension Reduction Methods in Approximate Bayesian Computation

Author: Blum M. G. B.
Nunes M. A.
Prangle D.
Sisson S. A.
Publication venue: 'Institute of Mathematical Statistics'
Publication date: 01/01/2013
Field of study

Approximate Bayesian computation (ABC) methods make use of comparisons between simulated and observed summary statistics to overcome the problem of computationally intractable likelihood functions. As the practical implementation of ABC requires computations based on vectors of summary statistics, rather than full data sets, a central question is how to derive low-dimensional summary statistics from the observed data with minimal loss of information. In this article we provide a comprehensive review and comparison of the performance of the principal methods of dimension reduction proposed in the ABC literature. The methods are split into three nonmutually exclusive classes consisting of best subset selection methods, projection techniques and regularization. In addition, we introduce two new methods of dimension reduction. The first is a best subset selection method based on Akaike and Bayesian information criteria, and the second uses ridge regression as a regularization procedure. We illustrate the performance of these dimension reduction techniques through the analysis of three challenging models and data sets.Comment: Published in at http://dx.doi.org/10.1214/12-STS406 the Statistical Science (http://www.imstat.org/sts/) by the Institute of Mathematical Statistics (http://www.imstat.org

arXiv.org e-Print Archive

Central Archive at the University of Reading

Crossref

Hal - Université Grenoble Alpes

Lancaster E-Prints

A Consistent Regularization Approach for Structured Prediction

Author: Ciliberto Carlo
Rosasco Lorenzo
Rudi Alessandro
Publication venue
Publication date: 01/01/2016
Field of study

We propose and analyze a regularization approach for structured prediction problems. We characterize a large class of loss functions that allows to naturally embed structured outputs in a linear space. We exploit this fact to design learning algorithms using a surrogate loss approach and regularization techniques. We prove universal consistency and finite sample bounds characterizing the generalization properties of the proposed methods. Experimental results are provided to demonstrate the practical usefulness of the proposed approach.Comment: 39 pages, 2 Tables, 1 Figur

arXiv.org e-Print Archive

Archivio istituzionale della ricerca - Università di Genova