slides

Sharp bounds on the rate of convergence of the empirical covariance matrix

Abstract

Let X1,...,XNRnX_1,..., X_N\in\R^n be independent centered random vectors with log-concave distribution and with the identity as covariance matrix. We show that with overwhelming probability at least 13exp(cn)˚1 - 3 \exp(-c\sqrt{n}\r) one has \sup_{x\in S^{n-1}} \Big|\frac{1/N}\sum_{i=1}^N (||^2 - \E||^2\r)\Big| \leq C \sqrt{\frac{n/N}}, where CC is an absolute positive constant. This result is valid in a more general framework when the linear forms ()iN,xSn1()_{i\leq N, x\in S^{n-1}} and the Euclidean norms (Xi/n)iN(|X_i|/\sqrt n)_{i\leq N} exhibit uniformly a sub-exponential decay. As a consequence, if AA denotes the random matrix with columns (Xi)(X_i), then with overwhelming probability, the extremal singular values λmin\lambda_{\rm min} and λmax\lambda_{\rm max} of AAAA^\top satisfy the inequalities 1Cn/Nλmin/Nλmax/N1+Cn/N 1 - C\sqrt{{n/N}} \le {\lambda_{\rm min}/N} \le \frac{\lambda_{\rm max}/N} \le 1 + C\sqrt{{n/N}} which is a quantitative version of Bai-Yin theorem \cite{BY} known for random matrices with i.i.d. entries

    Similar works