9 research outputs found
A test for multivariate skew-normality based on its canonical form
Abstract A test to assess if a sample comes from a multivariate skew-normal distribution is proposed. The test statistic is obtained from the canonical form of the multivariate skew-normal distribution and its null distribution is derived. The power of the proposed test is evaluated through Monte Carlo simulations for different conveniently chosen alternatives. Finally, three numerical examples are presented for the purpose of illustration
Model-based clustering and classification using mixtures of multivariate skewed power exponential distributions
Families of mixtures of multivariate power exponential (MPE) distributions
have been previously introduced and shown to be competitive for cluster
analysis in comparison to other elliptical mixtures including mixtures of
Gaussian distributions. Herein, we propose a family of mixtures of multivariate
skewed power exponential distributions to combine the flexibility of the MPE
distribution with the ability to model skewness. These mixtures are more robust
to variations from normality and can account for skewness, varying tail weight,
and peakedness of data. A generalized expectation-maximization approach
combining minorization-maximization and optimization based on accelerated line
search algorithms on the Stiefel manifold is used for parameter estimation.
These mixtures are implemented both in the model-based clustering and
classification frameworks. Both simulated and benchmark data are used for
illustration and comparison to other mixture families
The joint role of trimming and constraints in robust estimation for mixtures of Gaussian factor analyzers.
Producción CientíficaMixtures of Gaussian factors are powerful tools for modeling an unobserved heterogeneous
population, offering – at the same time – dimension reduction and model-based clustering. The high prevalence of spurious solutions and the disturbing effects of outlying observations in maximum likelihood estimation may cause biased or misleading inferences. Restrictions for the component covariances are considered in order to avoid spurious solutions, and trimming is also adopted, to provide robustness against violations of normality assumptions of the underlying latent factors. A detailed AECM algorithm for this new approach is presented. Simulation results and an application to the AIS dataset show the aim and effectiveness of the proposed methodology.Ministerio de Economía y Competitividad and FEDER, grant MTM2014-56235-C2-1-P, and by Consejería de Educación de la Junta de Castilla y León, grant VA212U13, by grant FAR 2015 from the University of Milano-Bicocca and by grant FIR 2014 from the University of Catania
The joint role of trimming and constraints in robust estimation for mixtures of Gaussian factor analyzers.
Producción CientíficaMixtures of Gaussian factors are powerful tools for modeling an unobserved heterogeneous
population, offering – at the same time – dimension reduction and model-based clustering. The high prevalence of spurious solutions and the disturbing effects of outlying observations in maximum likelihood estimation may cause biased or misleading inferences. Restrictions for the component covariances are considered in order to avoid spurious solutions, and trimming is also adopted, to provide robustness against violations of normality assumptions of the underlying latent factors. A detailed AECM algorithm for this new approach is presented. Simulation results and an application to the AIS dataset show the aim and effectiveness of the proposed methodology.Ministerio de Economía y Competitividad and FEDER, grant MTM2014-56235-C2-1-P, and by Consejería de Educación de la Junta de Castilla y León, grant VA212U13, by grant FAR 2015 from the University of Milano-Bicocca and by grant FIR 2014 from the University of Catania
Mixture of linear experts model for censored data: A novel approach with scale-mixture of normal distributions
The classical mixture of linear experts (MoE) model is one of the widespread
statistical frameworks for modeling, classification, and clustering of data.
Built on the normality assumption of the error terms for mathematical and
computational convenience, the classical MoE model has two challenges: 1) it is
sensitive to atypical observations and outliers, and 2) it might produce
misleading inferential results for censored data. The paper is then aimed to
resolve these two challenges, simultaneously, by proposing a novel robust MoE
model for model-based clustering and discriminant censored data with the
scale-mixture of normal class of distributions for the unobserved error terms.
Based on this novel model, we develop an analytical expectation-maximization
(EM) type algorithm to obtain the maximum likelihood parameter estimates.
Simulation studies are carried out to examine the performance, effectiveness,
and robustness of the proposed methodology. Finally, real data is used to
illustrate the superiority of the new model.Comment: 21 pages
Extending mixtures of factor models using the restricted multivariate skew-normal distribution
The mixture of factor analyzers (MFA) model provides a powerful tool for analyzing high-dimensional data as it can reduce the number of free parameters through its factor-analytic representation of the component covariance matrices. This paper extends the MFA model to incorporate a restricted version of the multivariate skew-normal distribution for the latent component factors, called mixtures of skew-normal factor analyzers (MSNFA). The proposed MSNFA model allows us to relax the need of the normality assumption for the latent factors in order to accommodate skewness in the observed data. The MSNFA model thus provides an approach to model-based density estimation and clustering of high-dimensional data exhibiting asymmetric characteristics. A computationally feasible Expectation Conditional Maximization (ECM) algorithm is developed for computing the maximum likelihood estimates of model parameters. The potential of the proposed methodology is exemplified using both real and simulated data. (C) 2015 Elsevier Inc. All rights reserved