54 research outputs found
New -estimators in semi-parametric regression with errors in variables
In the regression model with errors in variables, we observe i.i.d.
copies of satisfying and
involving independent and unobserved random variables plus a
regression function , known up to a finite dimensional
. The common densities of the 's and of the 's are
unknown, whereas the distribution of is completely known. We aim at
estimating the parameter by using the observations
. We propose an estimation procedure based on the
least square criterion \tilde{S}_{\theta^0,g}(\theta)=\m
athbb{E}_{\theta^0,g}[((Y-f_{\theta}(X))^2w(X)] where is a weight function
to be chosen. We propose an estimator and derive an upper bound for its risk
that depends on the smoothness of the errors density and on the
smoothness properties of . Furthermore, we give sufficient
conditions that ensure that the parametric rate of convergence is achieved. We
provide practical recipes for the choice of in the case of nonlinear
regression functions which are smooth on pieces allowing to gain in the order
of the rate of convergence, up to the parametric rate in some cases. We also
consider extensions of the estimation procedure, in particular, when a choice
of depending on would be more appropriate.Comment: Published in at http://dx.doi.org/10.1214/07-AIHP107 the Annales de
l'Institut Henri Poincar\'e - Probabilit\'es et Statistiques
(http://www.imstat.org/aihp/) by the Institute of Mathematical Statistics
(http://www.imstat.org
Semi-parametric estimation of the hazard function in a model with covariate measurement error
We consider a model where the failure hazard function, conditional on a
covariate is given by ,
with . The baseline
hazard function and relative risk belong both
to parametric families. The covariate is measured through the error model
where is independent from , with known density
. We observe a -sample , , where
is the minimum between the failure time and the censoring time, and
is the censoring indicator. We aim at estimating in presence
of the unknown density . Our estimation procedure based on least squares
criterion provide two estimators. The first one minimizes an estimation of the
least squares criterion where is estimated by density deconvolution. Its
rate depends on the smoothnesses of and as a
function of ,. We derive sufficient conditions that ensure the
-consistency. The second estimator is constructed under conditions
ensuring that the least squares criterion can be directly estimated with the
parametric rate. These estimators, deeply studied through examples are in
particular -consistent and asymptotically Gaussian in the Cox model
and in the excess risk model, whatever is
Adaptive density deconvolution with dependent inputs
In the convolution model , we give a model selection
procedure to estimate the density of the unobserved variables , when the sequence is strictly stationary but
not necessarily independent. This procedure depends on wether the density of
is super smooth or ordinary smooth. The rates of convergence of
the penalized contrast estimators are the same as in the independent framework,
and are minimax over most classes of regularity on . Our results
apply to mixing sequences, but also to many other dependent sequences. When the
errors are super smooth, the condition on the dependence coefficients is the
minimal condition of that type ensuring that the sequence
is not a long-memory process
Adaptive density estimation for general ARCH models
We consider a model in which is not
independent of the noise process , but is independent of
for each . We assume that is stationary and we
propose an adaptive estimator of the density of based on the
observations . Under various dependence structures, the rates of this
nonparametric estimator coincide with the minimax rates obtained in the i.i.d.
case when and are independent, in all cases where
these minimax rates are known. The results apply to various linear and non
linear ARCH processes
Adaptive kernel estimation of the baseline function in the Cox model, with high-dimensional covariates
The aim of this article is to propose a novel kernel estimator of the
baseline function in a general high-dimensional Cox model, for which we derive
non-asymptotic rates of convergence. To construct our estimator, we first
estimate the regression parameter in the Cox model via a Lasso procedure. We
then plug this estimator into the classical kernel estimator of the baseline
function, obtained by smoothing the so-called Breslow estimator of the
cumulative baseline function. We propose and study an adaptive procedure for
selecting the bandwidth, in the spirit of Gold-enshluger and Lepski (2011). We
state non-asymptotic oracle inequalities for the final estimator, which reveal
the reduction of the rates of convergence when the dimension of the covariates
grows
Penalized contrast estimator for adaptive density deconvolution
The authors consider the problem of estimating the density of independent
and identically distributed variables , from a sample
where , , is a noise
independent of , with having known distribution. They
present a model selection procedure allowing to construct an adaptive estimator
of and to find non-asymptotic bounds for its
-risk. The estimator achieves the minimax rate of
convergence, in most cases where lowers bounds are available. A simulation
study gives an illustration of the good practical performances of the method
Model selection in logistic regression
This paper is devoted to model selection in logistic regression. We extend
the model selection principle introduced by Birg\'e and Massart (2001) to
logistic regression model. This selection is done by using penalized maximum
likelihood criteria. We propose in this context a completely data-driven
criteria based on the slope heuristics. We prove non asymptotic oracle
inequalities for selected estimators. Theoretical results are illustrated
through simulation studies
Estimation of the hazard function in a semiparametric model with covariate measurement error
International audienceWe consider a failure hazard function, conditional on a time-independent covariate , given by . The baseline hazard function and the relative risk both belong to parametric families with . The covariate has an unknown density and is measured with an error through an additive error model where is a random variable, independent from , with known density . We observe a -sample , = 1, ..., , where is the minimum between the failure time and the censoring time, and is the censoring indicator. Using least square criterion and deconvolution methods, we propose a consistent estimator of using the observations , = 1, ..., .
We give an upper bound for its risk which depends on the smoothness properties of and as a function of , and we derive sufficient conditions for the -consistency. We give detailed examples considering various type of relative risks and various types of error density . In particular, in the Cox model and in the excess risk model, the estimator of is -consistent and asymptotically Gaussian regardless of the form of
- …