11 research outputs found

    Multivariate statistical analysis: a high-dimensional approach

    No full text
    In the last few decades the accumulation of large amounts of in­ formation in numerous applications. has stimtllated an increased in­ terest in multivariate analysis. Computer technologies allow one to use multi-dimensional and multi-parametric models successfully. At the same time, an interest arose in statistical analysis with a de­ ficiency of sample data. Nevertheless, it is difficult to describe the recent state of affairs in applied multivariate methods as satisfactory. Unimprovable (dominating) statistical procedures are still unknown except for a few specific cases. The simplest problem of estimat­ ing the mean vector with minimum quadratic risk is unsolved, even for normal distributions. Commonly used standard linear multivari­ ate procedures based on the inversion of sample covariance matrices can lead to unstable results or provide no solution in dependence of data. Programs included in standard statistical packages cannot process 'multi-collinear data' and there are no theoretical recommen­ dations except to ignore a part of the data. The probability of data degeneration increases with the dimension n, and for n > N, where N is the sample size, the sample covariance matrix has no inverse. Thus nearly all conventional linear methods of multivariate statis­ tics prove to be unreliable or even not applicable to high-dimensional data

    Unimprovable solution to systems of empirical linear algebraic equations

    No full text
    An optimum solution, free from degeneration, is found for a system of linear algebraic equations with empirical coefficients and right-hand sides. The quadratic risk of estimators of the unknown solution vector is minimized over a class of linear systems with given square norm of the coefficient matrix and length of the vector on the right-hand side. Empirical coefficients and the right-hand sides are assumed to be independent and normal with known variance. It is found that the optimal estimator has the form of a regularized minimum square solution with an extension multiple. A simple formula is derived showing explicitly the dependence of the minimal risk on parameters.Empirical systems of equations Empirical linear models Systems of random linear algebraic equations

    Normal model for distribution-free multivariate analysis

    No full text
    It is shown that standard quality functions of n-dimensional regularized linear multivariate procedures can be accurately estimated under the assumption that of populations normality if n is so large that it is comparable in magnitude with sample size. The correction terms are estimated from above and their upper estimates are obtained with accuracy up to absolute constants.Multivariate analysis Large dimension Normal model
    corecore