664 research outputs found
Optimal Estimation and Prediction for Dense Signals in High-Dimensional Linear Models
Estimation and prediction problems for dense signals are often framed in
terms of minimax problems over highly symmetric parameter spaces. In this
paper, we study minimax problems over l2-balls for high-dimensional linear
models with Gaussian predictors. We obtain sharp asymptotics for the minimax
risk that are applicable in any asymptotic setting where the number of
predictors diverges and prove that ridge regression is asymptotically minimax.
Adaptive asymptotic minimax ridge estimators are also identified. Orthogonal
invariance is heavily exploited throughout the paper and, beyond serving as a
technical tool, provides additional insight into the problems considered here.
Most of our results follow from an apparently novel analysis of an equivalent
non-Gaussian sequence model with orthogonally invariant errors. As with many
dense estimation and prediction problems, the minimax risk studied here has
rate d/n, where d is the number of predictors and n is the number of
observations; however, when d is roughly proportional to n the minimax risk is
influenced by the spectral distribution of the predictors and is notably
different from the linear minimax risk for the Gaussian sequence model
(Pinsker, 1980) that often appears in other dense estimation and prediction
problems.Comment: 29 pages, 0 figure
A New Class of Dantzig Selectors for Censored Linear Regression Models
The Dantzig variable selector has recently emerged as a powerful tool for fitting regularized regression models. A key advantage is that it does not pertain to a particular likelihood or objective function, as opposed to the existing penalized likelihood methods, and hence has the potential for wide applicability. To our knowledge, limited work has been done for the Dantzig selector when the outcome is subject to censoring. This paper proposes a new class of Dantzig variable selectors for linear regression models for right-censored outcomes. We first establish the finite sample error bound for the estimator and show the proposed selector is nearly optimal in the `2 sense. To improve model selection performance, we further propose an adaptive Dantzig variable selector and discuss its large sample properties, namely, consistency in model selection and asymptotic normality of the estimator. The practical utility of the proposed adaptive Dantzig selectors is verified via extensive simulations. We apply the proposed methods to a myeloma clinical trial and identify important predictive genes for patients β survival
- β¦