Skip to main content
Article thumbnail
Location of Repository

Variable selection for the single index model

By Efang Kong and Yingcun Xia

Abstract

We consider variable selection in the single-index model. We prove that the popular leave-m-out crossvalidation method has different behaviour in the single-index model from that in linear regression models or nonparametric regression models. A new consistent variable selection method, called separated crossvalidation, is proposed. Further analysis suggests that the method has better finite-sample performance and is computationally easier than leave-m-out crossvalidation. Separated crossvalidation, applied to the Swiss banknotes data and the ozone concentration data, leads to single-index models with selected variables that have better prediction capability than models based on all the covariates

Topics: QA276
Year: 2007
OAI identifier: oai:kar.kent.ac.uk:23951

Suggested articles

Citations

  1. (1974). A new look at statistical model identification. doi
  2. (2002). An adaptive estimation of dimension reduction space (with Discussion). doi
  3. (1993). An efficient semiparametric estimator for binary response models. doi
  4. (2003). Applied Multivariate Statistical Analysis.
  5. (1974). Cross-Validation choice and assessment of statistical prediction (with Discussion).
  6. (1986). Density Estimation for Statistics and Data Analysis. doi
  7. (2001). Direct estimation of the index coefficients in a single-index model.
  8. (1996). Direct semiparametric estimation of single-index models with discrete covariates. doi
  9. (1958). Estimation of relationships for limited dependent variables. doi
  10. (1996). Finite sample variance of local polynomials: analysis and solutions. doi
  11. (1989). Investigating smooth multiple regression by method of average derivatives.
  12. (1993). Linear model selection by cross-validation. doi
  13. (1996). Local Polynomial Modeling and its Applications.
  14. (1993). Model selection via multifold cross validation. doi
  15. (1996). Multivariate local polynomial regression for time series: uniform strong consistency and rates. doi
  16. (1994). Multivariate locally weighted least-squared regression. doi
  17. (1994). Nonparametric identification of nonlinear time series: selecting significant lags. doi
  18. (1992). On consistent nonparametric order determination and chaos (with Discussion).
  19. (2006). On efficiencies of estimations for single index models.
  20. (1994). On subset selection in nonparametric stochastic regression.
  21. (1993). Optimal smoothing in single-index models.
  22. (1989). Semiparametric estimation of index coefficients. doi
  23. (2004). Semiparametric nonlinear time series model selection.
  24. (2001). Single-index model selections. doi
  25. (2002). Subset Selection in Regression. London: doi
  26. (2005). The bootstrap and the Edgeworth correction for semiparametric averaged derivatives. doi
  27. (2001). Variable selection via nonconcave penalized likelihood and its oracle properties. doi

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.