Search CORE

21,878 research outputs found

On boosting kernel regression

Author: Breiman
Bühlmann
Bühlmann
Charles C. Taylor
Chaudhuri
Di Marzio
Doksum
Fan
Freund
Friedman
Friedman
Harrison
Hastie
Härdle
Jiang
Jones
Lax
Lugosi
Marco Di Marzio
Müller
Rice
Schapire
Stuetzle
Tukey
Zhang
Zhang
Publication venue: 'Elsevier BV'
Publication date: 01/01/2008
Field of study

In this paper we propose a simple multistep regression smoother which is constructed in an iterative manner, by learning the Nadaraya-Watson estimator with L-2 boosting. We find, in both theoretical analysis and simulation experiments, that the bias converges exponentially fast. and the variance diverges exponentially slow. The first boosting step is analysed in more detail, giving asymptotic expressions as functions of the smoothing parameter, and relationships with previous work are explored. Practical performance is illustrated by both simulated and real data

CiteSeerX

Crossref

White Rose Research Online

Marginal integration for nonparametric causal inference

Author: Bühlmann Peter
Ernest Jan
Publication venue: 'Institute of Mathematical Statistics'
Publication date: 01/01/2015
Field of study

We consider the problem of inferring the total causal effect of a single variable intervention on a (response) variable of interest. We propose a certain marginal integration regression technique for a very general class of potentially nonlinear structural equation models (SEMs) with known structure, or at least known superset of adjustment variables: we call the procedure S-mint regression. We easily derive that it achieves the convergence rate as for nonparametric regression: for example, single variable intervention effects can be estimated with convergence rate

n^{-2/5}

assuming smoothness with twice differentiable functions. Our result can also be seen as a major robustness property with respect to model misspecification which goes much beyond the notion of double robustness. Furthermore, when the structure of the SEM is not known, we can estimate (the equivalence class of) the directed acyclic graph corresponding to the SEM, and then proceed by using S-mint based on these estimates. We empirically compare the S-mint regression method with more classical approaches and argue that the former is indeed more robust, more reliable and substantially simpler.Comment: 40 pages, 14 figure

arXiv.org e-Print Archive

Crossref

Conditional Transformation Models

Author: Bühlmann
Bühlmann
Bühlmann
Bühlmann
Chen
Chen
Cheng
Cheng
Currie
Dette
Doksum
Eilers
Fenske
Friedman
Gilchrist
Gneiting
Gneiting
Gneiting
Hall
Hall
Hayfield
He
Hofner
Hothorn
Koenker
Koenker
Koenker
Kriegler
Li
Lu
Lu
Mayr
Ridgeway
Rigby
Schemper
Schild
Schmid
Schnabel
Sexton
Shen
Tutz
Tutz
van de Geer
Wu
Zeng
Zheng
Publication venue: 'Wiley'
Publication date: 28/11/2012
Field of study

The ultimate goal of regression analysis is to obtain information about the conditional distribution of a response given a set of explanatory variables. This goal is, however, seldom achieved because most established regression models only estimate the conditional mean as a function of the explanatory variables and assume that higher moments are not affected by the regressors. The underlying reason for such a restriction is the assumption of additivity of signal and noise. We propose to relax this common assumption in the framework of transformation models. The novel class of semiparametric regression models proposed herein allows transformation functions to depend on explanatory variables. These transformation functions are estimated by regularised optimisation of scoring rules for probabilistic forecasts, e.g. the continuous ranked probability score. The corresponding estimated conditional distribution functions are consistent. Conditional transformation models are potentially useful for describing possible heteroscedasticity, comparing spatially varying distributions, identifying extreme events, deriving prediction intervals and selecting variables beyond mean regression effects. An empirical investigation based on a heteroscedastic varying coefficient simulation model demonstrates that semiparametric estimation of conditional distribution functions can be more beneficial than kernel-based non-parametric approaches or parametric generalised additive models for location, scale and shape

arXiv.org e-Print Archive

Crossref

ZORA

Sparse support vector regression based on orthogonal forward selection for the generalised kernel model

Author: Chen S.
Harris C.J.
Lowe D.
Wang X.X.
Publication venue
Publication date: 01/12/2006
Field of study

Southampton (e-Prints Soton)

NARX-based nonlinear system identification using orthogonal least squares basis hunting

Author: Chen Sheng
Harris Chris J.
Wang X.X.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2008
Field of study

An orthogonal least squares technique for basis hunting (OLS-BH) is proposed to construct sparse radial basis function (RBF) models for NARX-type nonlinear systems. Unlike most of the existing RBF or kernel modelling methods, whichplaces the RBF or kernel centers at the training input data points and use a fixed common variance for all the regressors, the proposed OLS-BH technique tunes the RBF center and diagonal covariance matrix of individual regressor by minimizing the training mean square error. An efficient optimization method isadopted for this basis hunting to select regressors in an orthogonal forward selection procedure. Experimental results obtained using this OLS-BH technique demonstrate that it offers a state-of-the-art method for constructing parsimonious RBF models with excellent generalization performance

Southampton (e-Prints Soton)

Crossref