30 research outputs found
Asymptotics in Minimum Distance from Independence Estimation
In this paper we introduce a family of minimum distance from independence estimators, suggested by Manski's minimum mean square from independence estimator. We establish strong consistency, asymptotic normality and consistency of resampling estimates of the distribution and variance of these estimators. For Manski's estimator we derive both strong consistency and asymptotic normality.Donsker class, empirical processes, extremum estimator, nonlinear simultaneous equations models, resampling estimators
Optimal selection of reduced rank estimators of high-dimensional matrices
We introduce a new criterion, the Rank Selection Criterion (RSC), for
selecting the optimal reduced rank estimator of the coefficient matrix in
multivariate response regression models. The corresponding RSC estimator
minimizes the Frobenius norm of the fit plus a regularization term proportional
to the number of parameters in the reduced rank model. The rank of the RSC
estimator provides a consistent estimator of the rank of the coefficient
matrix; in general, the rank of our estimator is a consistent estimate of the
effective rank, which we define to be the number of singular values of the
target matrix that are appropriately large. The consistency results are valid
not only in the classic asymptotic regime, when , the number of responses,
and , the number of predictors, stay bounded, and , the number of
observations, grows, but also when either, or both, and grow, possibly
much faster than . We establish minimax optimal bounds on the mean squared
errors of our estimators. Our finite sample performance bounds for the RSC
estimator show that it achieves the optimal balance between the approximation
error and the penalty term. Furthermore, our procedure has very low
computational complexity, linear in the number of candidate models, making it
particularly appealing for large scale problems. We contrast our estimator with
the nuclear norm penalized least squares (NNP) estimator, which has an
inherently higher computational complexity than RSC, for multivariate
regression models. We show that NNP has estimation properties similar to those
of RSC, albeit under stronger conditions. However, it is not as parsimonious as
RSC. We offer a simple correction of the NNP estimator which leads to
consistent rank estimation.Comment: Published in at http://dx.doi.org/10.1214/11-AOS876 the Annals of
Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical
Statistics (http://www.imstat.org) (some typos corrected
Weighted Minimum Mean-Square Distance from Independence Estimation
In this paper we introduce a family of semi-parametric estimators, suggested by Manski's minimum mean-square distance from independence estimator. We establish the strong consistency, asymptotic normality and consistency of bootstrap estimates of the sampling distribution and the asymptotic variance of these estimators.Semiparametric estimation, simultaneous equations models, empirical processes, extremum estimators
Aggregation for Gaussian regression
This paper studies statistical aggregation procedures in the regression
setting. A motivating factor is the existence of many different methods of
estimation, leading to possibly competing estimators. We consider here three
different types of aggregation: model selection (MS) aggregation, convex (C)
aggregation and linear (L) aggregation. The objective of (MS) is to select the
optimal single estimator from the list; that of (C) is to select the optimal
convex combination of the given estimators; and that of (L) is to select the
optimal linear combination of the given estimators. We are interested in
evaluating the rates of convergence of the excess risks of the estimators
obtained by these procedures. Our approach is motivated by recently published
minimax results [Nemirovski, A. (2000). Topics in non-parametric statistics.
Lectures on Probability Theory and Statistics (Saint-Flour, 1998). Lecture
Notes in Math. 1738 85--277. Springer, Berlin; Tsybakov, A. B. (2003). Optimal
rates of aggregation. Learning Theory and Kernel Machines. Lecture Notes in
Artificial Intelligence 2777 303--313. Springer, Heidelberg]. There exist
competing aggregation procedures achieving optimal convergence rates for each
of the (MS), (C) and (L) cases separately. Since these procedures are not
directly comparable with each other, we suggest an alternative solution. We
prove that all three optimal rates, as well as those for the newly introduced
(S) aggregation (subset selection), are nearly achieved via a single
``universal'' aggregation procedure. The procedure consists of mixing the
initial estimators with weights obtained by penalized least squares. Two
different penalties are considered: one of them is of the BIC type, the second
one is a data-dependent -type penalty.Comment: Published in at http://dx.doi.org/10.1214/009053606000001587 the
Annals of Statistics (http://www.imstat.org/aos/) by the Institute of
Mathematical Statistics (http://www.imstat.org
Tests of Independence in Separable Econometric Models: Theory and Application
A common stochastic restriction in econometric models separable in the latent variables is the assumption of stochastic independence between the unobserved and observed exogenous variables. Both simple and composite tests of this assumption are derived from properties of independence empirical processes and the consistency of these tests is established. As an application, we simulate estimation of a random quasilinear utility function, where we apply our tests of independence.Cramerāvon Mises distance, Empirical independence processes, Random utility models, Semiparametric econometric models, Specification test of independence
Tests of Independence in Separable Econometric Models: Theory and Application
A common stochastic restriction in econometric models separable in the latent variables is the assumption of stochastic independence between the unobserved and observed exogenous variables. Both simple and composite tests of this assumption are derived from properties of independence empirical processes and the consistency of these tests is established. As an application, we simulate estimation of a random quasilinear utility function, where we apply our tests of independence.Cramer-von Mises distance, Empirical independence processes, Random utility models, Semiparametric econometric models, Specification test of independence
Tests of Independence in Separable Econometric Models
A common stochastic restriction in econometric models separable in the latent variables is the assumption of stochastic independence between the unobserved and observed exogenous variables. Both simple and composite tests of this assumption are derived from properties of independence empirical processes and the consistency of these tests is established.Cramer-von Mises distance, Empirical independence processes, Random utility models, Semiparametric econometric models, Specification test of independence
Joint variable and rank selection for parsimonious estimation of high-dimensional matrices
We propose dimension reduction methods for sparse, high-dimensional
multivariate response regression models. Both the number of responses and that
of the predictors may exceed the sample size. Sometimes viewed as
complementary, predictor selection and rank reduction are the most popular
strategies for obtaining lower-dimensional approximations of the parameter
matrix in such models. We show in this article that important gains in
prediction accuracy can be obtained by considering them jointly. We motivate a
new class of sparse multivariate regression models, in which the coefficient
matrix has low rank and zero rows or can be well approximated by such a matrix.
Next, we introduce estimators that are based on penalized least squares, with
novel penalties that impose simultaneous row and rank restrictions on the
coefficient matrix. We prove that these estimators indeed adapt to the unknown
matrix sparsity and have fast rates of convergence. We support our theoretical
results with an extensive simulation study and two data analyses.Comment: Published in at http://dx.doi.org/10.1214/12-AOS1039 the Annals of
Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical
Statistics (http://www.imstat.org
Tests of Independence in Separable Econometric Models: Theory and Application
A common stochastic restriction in econometric models separable in the latent variables is the assumption of stochastic independence between the unobserved and observed exogenous variables. Both simple and composite tests of this assumption are derived from properties of independence empirical processes and the consistency of these tests is established. As an application, we stimulate estimation of a random quasilinear utility function, where we apply our tests of independence
Tests of Independence in Separable Econometric Models: Theory and Application
A common stochastic restriction in econometric models separable in the latent variables is the assumption of stochastic independence between the unobserved and observed exogenous variables. Both simple and composite tests of this assumption are derived from properties of independence empirical processes and the consistency of these tests is established. As an application, we simulate estimation of a random quasilinear utility function, where we apply our tests of independence