33,211 research outputs found

    Estimation of means in graphical Gaussian models with symmetries

    Full text link
    We study the problem of estimability of means in undirected graphical Gaussian models with symmetry restrictions represented by a colored graph. Following on from previous studies, we partition the variables into sets of vertices whose corresponding means are restricted to being identical. We find a necessary and sufficient condition on the partition to ensure equality between the maximum likelihood and least-squares estimators of the mean.Comment: Published in at http://dx.doi.org/10.1214/12-AOS991 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org

    MaxSkew and MultiSkew: Two R Packages for Detecting, Measuring and Removing Multivariate Skewness

    Full text link
    Skewness plays a relevant role in several multivariate statistical techniques. Sometimes it is used to recover data features, as in cluster analysis. In other circumstances, skewness impairs the performances of statistical methods, as in the Hotelling's one-sample test. In both cases, there is the need to check the symmetry of the underlying distribution, either by visual inspection or by formal testing. The R packages MaxSkew and MultiSkew address these issues by measuring, testing and removing skewness from multivariate data. Skewness is assessed by the third multivariate cumulant and its functions. The hypothesis of symmetry is tested either nonparametrically, with the bootstrap, or parametrically, under the normality assumption. Skewness is removed or at least alleviated by projecting the data onto appropriate linear subspaces. Usages of MaxSkew and MultiSkew are illustrated with the Iris dataset

    Semiparametrically efficient rank-based inference for shape I. optimal rank-based tests for sphericity

    Full text link
    We propose a class of rank-based procedures for testing that the shape matrix V\mathbf{V} of an elliptical distribution (with unspecified center of symmetry, scale and radial density) has some fixed value V0{\mathbf{V}}_0; this includes, for V0=Ik{\mathbf{V}}_0={\mathbf{I}}_k, the problem of testing for sphericity as an important particular case. The proposed tests are invariant under translations, monotone radial transformations, rotations and reflections with respect to the estimated center of symmetry. They are valid without any moment assumption. For adequately chosen scores, they are locally asymptotically maximin (in the Le Cam sense) at given radial densities. They are strictly distribution-free when the center of symmetry is specified, and asymptotically so when it must be estimated. The multivariate ranks used throughout are those of the distances--in the metric associated with the null value V0{\mathbf{V}}_0 of the shape matrix--between the observations and the (estimated) center of the distribution. Local powers (against elliptical alternatives) and asymptotic relative efficiencies (AREs) are derived with respect to the adjusted Mauchly test (a modified version of the Gaussian likelihood ratio procedure proposed by Muirhead and Waternaux [Biometrika 67 (1980) 31--43]) or, equivalently, with respect to (an extension of) the test for sphericity introduced by John [Biometrika 58 (1971) 169--174]. For Gaussian scores, these AREs are uniformly larger than one, irrespective of the actual radial density. Necessary and/or sufficient conditions for consistency under nonlocal, possibly nonelliptical alternatives are given. Finite sample performances are investigated via a Monte Carlo study.Comment: Published at http://dx.doi.org/10.1214/009053606000000731 in the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org

    Binary Models for Marginal Independence

    Full text link
    Log-linear models are a classical tool for the analysis of contingency tables. In particular, the subclass of graphical log-linear models provides a general framework for modelling conditional independences. However, with the exception of special structures, marginal independence hypotheses cannot be accommodated by these traditional models. Focusing on binary variables, we present a model class that provides a framework for modelling marginal independences in contingency tables. The approach taken is graphical and draws on analogies to multivariate Gaussian models for marginal independence. For the graphical model representation we use bi-directed graphs, which are in the tradition of path diagrams. We show how the models can be parameterized in a simple fashion, and how maximum likelihood estimation can be performed using a version of the Iterated Conditional Fitting algorithm. Finally we consider combining these models with symmetry restrictions

    Lattices of Graphical Gaussian Models with Symmetries

    Full text link
    In order to make graphical Gaussian models a viable modelling tool when the number of variables outgrows the number of observations, model classes which place equality restrictions on concentrations or partial correlations have previously been introduced in the literature. The models can be represented by vertex and edge coloured graphs. The need for model selection methods makes it imperative to understand the structure of model classes. We identify four model classes that form complete lattices of models with respect to model inclusion, which qualifies them for an Edwards-Havr\'anek model selection procedure. Two classes turn out most suitable for a corresponding model search. We obtain an explicit search algorithm for one of them and provide a model search example for the other.Comment: 29 pages, 18 figures. Restructured Section 5, results unchanged; added references in Section 6; amended example in Section 6.

    Conditional tests for elliptical symmetry using robust estimators

    Full text link
    This paper presents a procedure for testing the hypothesis that the underlying distribution of the data is elliptical when using robust location and scatter estimators instead of the sample mean and covariance matrix. Under mild assumptions that include elliptical distributions without first moments, we derive the test statistic asymptotic behaviour under the null hypothesis and under special alternatives. Numerical experiments allow to compare the behaviour of the tests based on the sample mean and covariance matrix with that based on robust estimators, under various elliptical distributions and different alternatives. This comparison was done looking not only at the observed level and power but we rather use the size-corrected relative exact power which provides a tool to assess the test statistic skill to detect alternatives. We also provide a numerical comparison with other competing tests.Comment: In press in Communications in Statistics: Theory and Methods, 201
    • …
    corecore