117 research outputs found
Don't Tie Yourself to an Onion: Don’t Tie Yourself to Assumptions of Normality
A structural measurement model (Adams, Wilson, & Wu, 1997) consists of an item response theory model for responses conditional on ability and a structural model that describes the distribution of ability in the population. As a rule, ability is assumed to be normally distributed in the population. However, there are situations where there is reason to assume that the distribution of ability is nonnormal. In this paper, we show that nonnormal ability distributions are easily modeled in a Bayesian framewor
Three representations of the Ising model
Statistical models that analyse (pairwise) relations between variables
encompass assumptions about the underlying mechanism that generated the
associations in the observed data. In the present paper we demonstrate that
three Ising model representations exist that, although each proposes a distinct
theoretical explanation for the observed associations, are mathematically
equivalent. This equivalence allows the researcher to interpret the results of
one model in three different ways. We illustrate the ramifications of this by
discussing concepts that are conceived as problematic in their traditional
explanation, yet when interpreted in the context of another explanation make
immediate sense.Comment: 11 pages, 1 figur
Logistic regression and Ising networks: prediction and estimation when violating lasso assumptions
The Ising model was originally developed to model magnetisation of solids in
statistical physics. As a network of binary variables with the probability of
becoming 'active' depending only on direct neighbours, the Ising model appears
appropriate for many other processes. For instance, it was recently applied in
psychology to model co-occurrences of mental disorders. It has been shown that
the connections between the variables (nodes) in the Ising network can be
estimated with a series of logistic regressions. This naturally leads to
questions of how well such a model predicts new observations and how well
parameters of the Ising model can be estimated using logistic regressions. Here
we focus on the high-dimensional setting with more parameters than observations
and consider violations of assumptions of the lasso. In particular, we
determine the consequences for both prediction and estimation when the sparsity
and restricted eigenvalue assumptions are not satisfied. We explain by using
the idea of connected copies (extreme multicollinearity) the fact that
prediction becomes better when either sparsity or multicollinearity is not
satisfied. We illustrate these results with simulations.Comment: to appear, Behaviormetrika, 201
A Rasch model and rating system for continuous responses collected in large-scale learning systems
An extension to a rating system for tracking the evolution of parameters over time using continuous variables is introduced. The proposed rating system assumes a distribution for the continuous responses, which is agnostic to the origin of the continuous scores and thus can be used for applications as varied as continuous scores obtained from language testing to scores derived from accuracy and response time from elementary arithmetic learning systems. Large-scale, high-stakes, online, anywhere anytime learning and testing inherently comes with a number of unique problems that require new psychometric solutions. These include (1) the cold start problem, (2) problem of change, and (3) the problem of personalization and adaptation. We outline how our proposed method addresses each of these problems. Three simulations are carried out to demonstrate the utility of the proposed rating system
Bayesian inference for low-rank Ising networks
Estimating the structure of Ising networks is a notoriously difficult problem. We demonstrate that using a latent variable representation of the Ising network, we can employ a full-data-information approach to uncover the network structure. Thereby, only ignoring information encoded in the prior distribution (of the latent variables). The full-data-information approach avoids having to compute the partition function and is thus computationally feasible, even for networks with many nodes. We illustrate the full-data-information approach with the estimation of dense network
Tracing systematic errors to personalize recommendations in single digit multiplication and beyond
In learning, errors are ubiquitous and inevitable. As these errors may signal otherwise latent cognitive processes, tutors - and students alike - can greatly benefit from the information they provide. In this paper, we introduce and evaluate the Systematic Error Tracing (SET) model that identifies the possible causes of systematically observed errors in domains where items are susceptible to most or all causes and errors can be explained by multiple causes. We apply the model to single-digit multiplication, a domain that is very suitable for the model, is well-studied, and allows us to analyze over 25,000 error responses from 335 learners. The model, derived from the Ising model popular in physics, makes use of a bigraph that links errors to causes. The error responses were taken from Math Garden, a computerized adaptive practice environment for arithmetic that is widely used in the Netherlands. We discuss and evaluate various model configurations with respect to the ranking of recommendations and calibration of probability estimates. The results show that the SET model outranks a majority vote baseline model when more than a single recommendation is considered. Finally, we contrast the SET model to similar approaches and discuss limitations and implications
Urnings:A new method for tracking dynamically changing parameters in paired comparison systems
We introduce a new rating system for tracking the development of parameters based on a stream of observations that can be viewed as paired comparisons. Rating systems are applied in competitive games, adaptive learning systems and platforms for product and service reviews. We model each observation as an outcome of a game of chance that depends on the parameters of interest (e.g. the outcome of a chess game depends on the abilities of the two players). Determining the probabilities of the different game outcomes is conceptualized as an urn problem, where a rating is represented by a probability (i.e. proportion of balls in the urn). This setup allows for evaluating the standard errors of the ratings and performing statistical inferences about the development of, and relations between, parameters. Theoretical properties of the system in terms of the invariant distributions of the ratings and their convergence are derived. The properties of the rating system are illustrated with simulated examples and its potential for answering research questions is illustrated using data from competitive chess, a movie review system, and an adaptive learning system for math
Tracking a multitude of abilities as they develop
Recently, the Urnings algorithm (Bolsinova et al., 2022, J. R. Stat. Soc. Ser. C Appl. Statistics, 71, 91) has been proposed that allows for tracking the development of abilities of the learners and the difficulties of the items in adaptive learning systems. It is a simple and scalable algorithm which is suited for large-scale applications in which large streams of data are coming into the system and on-the-fly updating is needed. Compared to alternatives like the Elo rating system and its extensions, the Urnings rating system allows the uncertainty of the ratings to be evaluated and accounts for adaptive item selection which, if not corrected for, may distort the ratings. In this paper we extend the Urnings algorithm to allow for both between-item and within-item multidimensionality. This allows for tracking the development of interrelated abilities both at the individual and the population level. We present formal derivations of the multidimensional Urnings algorithm, illustrate its properties in simulations, and present an application to data from an adaptive learning system for primary school mathematics called Math Garden
- …