23 research outputs found

    User-Friendly Covariance Estimation for Heavy-Tailed Distributions

    Get PDF
    We offer a survey of recent results on covariance estimation for heavy-tailed distributions. By unifying ideas scattered in the literature, we propose user-friendly methods that facilitate practical implementation. Specifically, we introduce element-wise and spectrum-wise truncation operators, as well as their MM-estimator counterparts, to robustify the sample covariance matrix. Different from the classical notion of robustness that is characterized by the breakdown property, we focus on the tail robustness which is evidenced by the connection between nonasymptotic deviation and confidence level. The key observation is that the estimators needs to adapt to the sample size, dimensionality of the data and the noise level to achieve optimal tradeoff between bias and robustness. Furthermore, to facilitate their practical use, we propose data-driven procedures that automatically calibrate the tuning parameters. We demonstrate their applications to a series of structured models in high dimensions, including the bandable and low-rank covariance matrices and sparse precision matrices. Numerical studies lend strong support to the proposed methods.Comment: 56 pages, 2 figure

    Optimal robust mean and location estimation via convex programs with respect to any pseudo-norms

    Get PDF
    We consider the problem of robust mean and location estimation w.r.t. any pseudo-norm of the form xRdxS=supvSx\in\mathbb{R}^d\to ||x||_S = \sup_{v\in S} where SS is any symmetric subset of Rd\mathbb{R}^d. We show that the deviation-optimal minimax subgaussian rate for confidence 1δ1-\delta is max(l(Σ1/2S)N,supvSΣ1/2v2log(1/δ)N) \max\left(\frac{l^*(\Sigma^{1/2}S)}{\sqrt{N}}, \sup_{v\in S}||\Sigma^{1/2}v||_2\sqrt{\frac{\log(1/\delta)}{N}}\right) where l(Σ1/2S)l^*(\Sigma^{1/2}S) is the Gaussian mean width of Σ1/2S\Sigma^{1/2}S and Σ\Sigma the covariance of the data (in the benchmark i.i.d. Gaussian case). This improves the entropic minimax lower bound from [Lugosi and Mendelson, 2019] and closes the gap characterized by Sudakov's inequality between the entropy and the Gaussian mean width for this problem. This shows that the right statistical complexity measure for the mean estimation problem is the Gaussian mean width. We also show that this rate can be achieved by a solution to a convex optimization problem in the adversarial and L2L_2 heavy-tailed setup by considering minimum of some Fenchel-Legendre transforms constructed using the Median-of-means principle. We finally show that this rate may also be achieved in situations where there is not even a first moment but a location parameter exists

    All-In-One Robust Estimator of the Gaussian Mean

    Full text link
    The goal of this paper is to show that a single robust estimator of the mean of a multivariate Gaussian distribution can enjoy five desirable properties. First, it is computationally tractable in the sense that it can be computed in a time which is at most polynomial in dimension, sample size and the logarithm of the inverse of the contamination rate. Second, it is equivariant by translations, uniform scaling and orthogonal transformations. Third, it has a high breakdown point equal to 0.50.5, and a nearly-minimax-rate-breakdown point approximately equal to 0.280.28. Fourth, it is minimax rate optimal, up to a logarithmic factor, when data consists of independent observations corrupted by adversarially chosen outliers. Fifth, it is asymptotically efficient when the rate of contamination tends to zero. The estimator is obtained by an iterative reweighting approach. Each sample point is assigned a weight that is iteratively updated by solving a convex optimization problem. We also establish a dimension-free non-asymptotic risk bound for the expected error of the proposed estimator. It is the first result of this kind in the literature and involves only the effective rank of the covariance matrix. Finally, we show that the obtained results can be extended to sub-Gaussian distributions, as well as to the cases of unknown rate of contamination or unknown covariance matrix.Comment: 41 pages, 5 figures; added sub-Gaussian case with unknown Sigma or ep
    corecore