24,661 research outputs found
Clustering South African households based on their asset status using latent variable models
The Agincourt Health and Demographic Surveillance System has since 2001
conducted a biannual household asset survey in order to quantify household
socio-economic status (SES) in a rural population living in northeast South
Africa. The survey contains binary, ordinal and nominal items. In the absence
of income or expenditure data, the SES landscape in the study population is
explored and described by clustering the households into homogeneous groups
based on their asset status. A model-based approach to clustering the Agincourt
households, based on latent variable models, is proposed. In the case of
modeling binary or ordinal items, item response theory models are employed. For
nominal survey items, a factor analysis model, similar in nature to a
multinomial probit model, is used. Both model types have an underlying latent
variable structure - this similarity is exploited and the models are combined
to produce a hybrid model capable of handling mixed data types. Further, a
mixture of the hybrid models is considered to provide clustering capabilities
within the context of mixed binary, ordinal and nominal response data. The
proposed model is termed a mixture of factor analyzers for mixed data (MFA-MD).
The MFA-MD model is applied to the survey data to cluster the Agincourt
households into homogeneous groups. The model is estimated within the Bayesian
paradigm, using a Markov chain Monte Carlo algorithm. Intuitive groupings
result, providing insight to the different socio-economic strata within the
Agincourt region.Comment: Published in at http://dx.doi.org/10.1214/14-AOAS726 the Annals of
Applied Statistics (http://www.imstat.org/aoas/) by the Institute of
Mathematical Statistics (http://www.imstat.org
Model Based Clustering for Mixed Data: clustMD
A model based clustering procedure for data of mixed type, clustMD, is
developed using a latent variable model. It is proposed that a latent variable,
following a mixture of Gaussian distributions, generates the observed data of
mixed type. The observed data may be any combination of continuous, binary,
ordinal or nominal variables. clustMD employs a parsimonious covariance
structure for the latent variables, leading to a suite of six clustering models
that vary in complexity and provide an elegant and unified approach to
clustering mixed data. An expectation maximisation (EM) algorithm is used to
estimate clustMD; in the presence of nominal data a Monte Carlo EM algorithm is
required. The clustMD model is illustrated by clustering simulated mixed type
data and prostate cancer patients, on whom mixed data have been recorded
Estimation of extended mixed models using latent classes and latent processes: the R package lcmm
The R package lcmm provides a series of functions to estimate statistical
models based on linear mixed model theory. It includes the estimation of mixed
models and latent class mixed models for Gaussian longitudinal outcomes (hlme),
curvilinear and ordinal univariate longitudinal outcomes (lcmm) and curvilinear
multivariate outcomes (multlcmm), as well as joint latent class mixed models
(Jointlcmm) for a (Gaussian or curvilinear) longitudinal outcome and a
time-to-event that can be possibly left-truncated right-censored and defined in
a competing setting. Maximum likelihood esimators are obtained using a modified
Marquardt algorithm with strict convergence criteria based on the parameters
and likelihood stability, and on the negativity of the second derivatives. The
package also provides various post-fit functions including goodness-of-fit
analyses, classification, plots, predicted trajectories, individual dynamic
prediction of the event and predictive accuracy assessment. This paper
constitutes a companion paper to the package by introducing each family of
models, the estimation technique, some implementation details and giving
examples through a dataset on cognitive aging
Together forever? Explaining exclusivity in party-firm relations
Parties and firms are the key actors of representative democracy and capitalism respectively and the dynamic of attachment between them is a central feature of any political economy. This is the first article to systematically analyse the exclusivity of party-firm relations. We consider exclusivity at a point in time and exclusivity over time. Does a firm have a relationship with only one party at a given point in time, or is it close to more than one party? Does a firm maintain a relationship with only one party over time, or does it switch between parties? Most important, how do patterns of exclusivity impact on a firmâs ability to lobby successfully? We propose a general theory, which explains patterns of party-firm relations by reference to the division of institutions and the type of party competition in a political system. A preliminary test of our theory with Polish survey data confirms our predictions, establishing a promising hypothesis for future research
- âŠ