20,432 research outputs found
On the Procrustean analogue of individual differences scaling (INDSCAL)
In this paper, individual differences scaling (INDSCAL) is revisited, considering
INDSCAL as being embedded within a hierarchy of individual difference scaling
models. We explore the members of this family, distinguishing (i) models, (ii) the
role of identification and substantive constraints, (iii) criteria for fitting models and (iv) algorithms to optimise the criteria. Model formulations may be based either on data that are in the form of proximities or on configurational matrices. In its configurational version, individual difference scaling may be formulated as a form of generalized Procrustes analysis. Algorithms are introduced for fitting the new
models. An application from sensory evaluation illustrates the performance of the
methods and their solutions
Global and local distance-based generalized linear models
This paper introduces local distance-based generalized linear models. These models extend (weighted) distance-based linear models first to the generalized linear model framework. Then, a nonparametric version of these models is proposed by means of local fitting. Distances between individuals are the only predictor information needed to fit these models. Therefore, they are applicable, among others, to mixed (qualitative and quantitative) explanatory variables or when the regressor is of functional type. An implementation is provided by the R package dbstats, which also implements other distance-based prediction methods. Supplementary material for this article is available online, which reproduces all the results of this article.Peer ReviewedPostprint (author's final draft
Multiple Correspondence Analysis & the Multilogit Bilinear Model
Multiple Correspondence Analysis (MCA) is a dimension reduction method which
plays a large role in the analysis of tables with categorical nominal variables
such as survey data. Though it is usually motivated and derived using geometric
considerations, in fact we prove that it amounts to a single proximal Newtown
step of a natural bilinear exponential family model for categorical data the
multinomial logit bilinear model. We compare and contrast the behavior of MCA
with that of the model on simulations and discuss new insights on the
properties of both exploratory multivariate methods and their cognate models.
One main conclusion is that we could recommend to approximate the multilogit
model parameters using MCA. Indeed, estimating the parameters of the model is
not a trivial task whereas MCA has the great advantage of being easily solved
by singular value decomposition and scalable to large data
A generalized framework for analyzing taxonomic, phylogenetic, and functional community structure based on presence-absence data
Community structure as summarized by presence–absence data is often evaluated via diversity measures by incorporating taxonomic, phylogenetic and functional information on the constituting species. Most commonly, various dissimilarity coefficients are used to express these aspects simultaneously such that the results are not comparable due to the lack of common conceptual basis behind index definitions. A new framework is needed which allows such comparisons, thus facilitating evaluation of the importance of the three sources of extra information in relation to conventional species-based representations. We define taxonomic, phylogenetic and functional beta diversity of species assemblages based on the generalized Jaccard dissimilarity index. This coefficient does not give equal weight to species, because traditional site dissimilarities are lowered by taking into account the taxonomic, phylogenetic or functional similarity of differential species in one site to the species in the other. These, together with the traditional, taxon- (species-) based beta diversity are decomposed into two additive fractions, one due to taxonomic, phylogenetic or functional excess and the other to replacement. In addition to numerical results, taxonomic, phylogenetic and functional community structure is visualized by 2D simplex or ternary plots. Redundancy with respect to taxon-based structure is expressed in terms of centroid distances between point clouds in these diagrams. The approach is illustrated by examples coming from vegetation surveys representing different ecological conditions. We found that beta diversity decreases in the following order: taxon-based, taxonomic (Linnaean), phylogenetic and functional. Therefore, we put forward the beta-redundancy hypothesis suggesting that this ordering may be most often the case in ecological communities, and discuss potential reasons and possible exceptions to this supposed rule. Whereas the pattern of change in diversity may be indicative of fundamental features of the particular community being studied, the effect of the choice of functional traits—a more or less subjective element of the framework—remains to be investigated
- …
