research

New MM-estimators in semi-parametric regression with errors in variables

Abstract

In the regression model with errors in variables, we observe nn i.i.d. copies of (Y,Z)(Y,Z) satisfying Y=fθ0(X)+ξY=f_{\theta^0}(X)+\xi and Z=X+ϵZ=X+\epsilon involving independent and unobserved random variables X,ξ,ϵX,\xi,\epsilon plus a regression function fθ0f_{\theta^0}, known up to a finite dimensional θ0\theta^0. The common densities of the XiX_i's and of the ξi\xi_i's are unknown, whereas the distribution of ϵ\epsilon is completely known. We aim at estimating the parameter θ0\theta^0 by using the observations (Y1,Z1),...,(Yn,Zn)(Y_1,Z_1),...,(Y_n,Z_n). We propose an estimation procedure based on the least square criterion \tilde{S}_{\theta^0,g}(\theta)=\m athbb{E}_{\theta^0,g}[((Y-f_{\theta}(X))^2w(X)] where ww is a weight function to be chosen. We propose an estimator and derive an upper bound for its risk that depends on the smoothness of the errors density pϵp_{\epsilon} and on the smoothness properties of w(x)fθ(x)w(x)f_{\theta}(x). Furthermore, we give sufficient conditions that ensure that the parametric rate of convergence is achieved. We provide practical recipes for the choice of ww in the case of nonlinear regression functions which are smooth on pieces allowing to gain in the order of the rate of convergence, up to the parametric rate in some cases. We also consider extensions of the estimation procedure, in particular, when a choice of wθw_{\theta} depending on θ\theta would be more appropriate.Comment: Published in at http://dx.doi.org/10.1214/07-AIHP107 the Annales de l'Institut Henri Poincar\'e - Probabilit\'es et Statistiques (http://www.imstat.org/aihp/) by the Institute of Mathematical Statistics (http://www.imstat.org

    Similar works