1,487 research outputs found
The existence and asymptotic properties of a backfitting projection algorithm under weak conditions.
We derive the asymptotic distribution of a new backfitting procedure for estimating the closest additive approximation to a nonparametric regression function. The procedure employs a recent projection interpretation of popular kernel estimators provided by Mammen, Marron, Turlach and Wand and the asymptotic theory of our estimators is derived using the theory of additive projections reviewed in Bickel, Klaassen, Ritov and Wellner. Our procedure achieves the same bias and variance as the oracle estimator based on knowing the other components, and in this sense improves on the method analyzed in Opsomer and Ruppert. We provide ‘‘high level’’ conditions independent of the sampling scheme. We then verify that these conditions are satisfied in a regression and a time series autoregression under weak conditions.
A comparison of in-sample forecasting methods
In-sample forecasting is a recent continuous modification of well-known forecasting methods based on aggregated data. These aggregated methods are known as age-cohort methods in demography, economics, epidemiology and sociology and as chain ladder in non-life insurance. Data is organized in a two-way table with age and cohort as indices, but without measures of exposure. It has recently been established that such structured forecasting methods based on aggregated data can be interpreted as structured histogram estimators. Continuous in-sample forecasting transfers these classical forecasting models into a modern statistical world including smoothing methodology that is more efficient than smoothing via histograms. All in-sample forecasting estimators are collected and their performance is compared via a finite sample simulation study. All methods are extended via multiplicative bias correction. Asymptotic theory is being developed for the histogram-type method of sieves and for the multiplicatively corrected estimators. The multiplicative bias corrected estimators improve all other known in-sample forecasters in the simulation study. The density projection approach seems to have the best performance with forecasting based on survival densities being the runner-up
The Existence and Asymptotic Properties of a Backfitting Projection Algorithm Under Weak Conditions
We derive the asymptotic distribution of a new backfitting procedure for estimating the closest additive approximation to a nonparametric regression function. The procedure employs a recent projection interpretation of popular kernel estimators provided by Mammen et al. (1997), and the asymptotic theory of our estimators is derived using the theory of additive projections reviewed in Bickel et al. (1995). Our procedure achieves the same bias and variance as the oracle estimator based on knowing the other components, and in this sense improves on the method analyzed in Opsomer and Ruppert (1997). We provide 'high level' conditions independent of the sampling scheme. We then verify that these conditions are satisfied in a time series autoregression under weak conditions.Additive models, alternating projections, backfitting, kernel smoothing, local polynomials, nonparametric regression
Generalised additive dependency inflated models including aggregated covariates
Let us assume that X, Y and U are observed and that the conditional mean of U given X and Y can be expressed via an additive dependency of X, λ(X)Y and X + Y for some unspecified function . This structured regression model can be transferred to a hazard model or a density model when applied on some appropriate grid, and has important forecasting applications via structured marker dependent hazards models or structured density models including age-period-cohort relationships. The structured regression model is also important when the severity of the dependent variable has a complicated dependency on waiting times X, Y and the total waiting time X+Y . In case the conditional mean of U approximates a density, the regression model can be used to analyse the age-period-cohort model, also when exposure data are not available. In case the conditional mean of U approximates a marker dependent hazard, the regression model introduces new relevant age-period-cohort time scale interdependencies in understanding longevity. A direct use of the regression relationship introduced in this paper is the estimation of the severity of outstanding liabilities in non-life insurance companies. The technical approach taken is to use B-splines to capture the underlying one-dimensional unspecified functions. It is shown via finite sample simulation studies and an application for forecasting future asbestos related deaths in the UK that the B-spline approach works well in practice. Special consideration has been given to ensure identifiability of all models considered
Estimating Yield Curves by Kernel Smoothing Methods
We introduce a new method for the estimation of discount functions, yield curves and forward curves for coupon bonds. Our approach is nonparametric and does not assume a particular functional form for the discount function although we do show how to impose various important restrictions in the estimation. Our method is based on kernel smoothing and is defined as the minimum of some localized population moment condition. The solution to the sample problem is not explicit and our estimation procedure is iterative, rather like the backfitting method of estimating additive nonparametric models. We establish the asymptotic normality of our methods using the asymptotic representation of our estimator as an infinite series with declining coefficients. The rate of convergence is standard for one dimensional nonparametric regression.Coupon bonds; forward curve; Hilbert space; local linear; nonparametric regression; yield curve
Generalized linear time series regression
We consider a cross-section model that contains an individual component, a deterministic time trend and an unobserved latent common time series component. We show the following oracle property: the parameters of the latent time series and the parameters of the deterministic time trend can be estimated with the same asymptotic accuracy as if the parameters of the individual component were known. We consider this model in two settings: least squares fits of linear specifications of the individual component and the parameters of the deterministic time trend and, more generally, quasilikelihood estimation in a generalized linear time series model
Testing Parametric versus Semiparametric Modelling in Generalized Linear Models
We consider a generalized partially linear model E(Y|X,T) = G{X'b + m(T)} where G is a known function, b is an unknown parameter vector, and m is an unknown function.The paper introduces a test statistic which allows to decide between a parametric and a semiparametric model: (i) m is linear, i.e. m(t) = t'g for a parameter vector g, (ii) m is a smooth (nonlinear) function.Under linearity (i) it is shown that the test statistic is asymptotically normal. Moreover, for the case of binary responses, it is proved that the bootstrap works asymptotically.Simulations suggest that (in small samples) bootstrap outperforms the calculation of critical values from the normal approximation.The practical performance of the test is shown in applications to data on East--West German migration and credit scoring.linear models
Asymptotics for In-Sample Density Forecasting
This paper generalizes recent proposals of density forecasting models and it develops theory for this class of models. In density forecasting the density of observations is estimated in regions where the density is not observed. Identification of the density in such regions is guaranteed by structural assumptions on the density that allows exact extrapolation. In this paper the structural assumption is made that the density is a product of one-dimensional functions. The theory is quite general in assuming the shape of the region where the density is observed. Such models naturally arise when the time point of an observation can be written as the sum of two terms (e.g. onset and incubation period of a disease). The developed theory also allows for a multiplicative factor of seasonal effects. Seasonal effects are present in many actuarial, biostatistical, econometric and statistical studies. Smoothing estimators are proposed that are based on backfitting. Full asymptotic theory is derived for them. A practical example from the insurance business is given producing a within year budget of reported insurance claims. A small sample study supports the theoretical results
In-Sample Forecasting Applied to Reserving and Mesothelioma Mortality
This paper shows that recent published mortality projections with unobserved exposure can be understood as structured density estimation. The structured density is only observed on a sub-sample corresponding to historical calendar time. The mortality forecast is obtained by extrapolating the structured density to future calendar times using that the components of the density are identified within sample. The new method is illustrated on the important practical problem of forecasting mesothelioma for the UK population. Full asymptotic theory is provided. The theory is given in such generality that it also introduces mathematical statistical theory for the recent continuous chain ladder model. This allows a modern approach to classical reserving techniques used every day in any non-life insurance company around the globe. Applications to mortality data and non-life insurance data are provided along with relevant small sample simulation studies
PAC-Bayesian Bounds for Randomized Empirical Risk Minimizers
The aim of this paper is to generalize the PAC-Bayesian theorems proved by
Catoni in the classification setting to more general problems of statistical
inference. We show how to control the deviations of the risk of randomized
estimators. A particular attention is paid to randomized estimators drawn in a
small neighborhood of classical estimators, whose study leads to control the
risk of the latter. These results allow to bound the risk of very general
estimation procedures, as well as to perform model selection
- …
