Detecting differential item functioning in 2PL multistage assessments

Appelbaum, Sebastian; Debeer, Dries; Debelak, Rudolf; Tomasik, Martin J

Detecting differential item functioning in 2PL multistage assessments

Authors: Sebastian Appelbaum
Dries Debeer
Rudolf Debelak
Martin J Tomasik
Publication date: 31 May 2023
Publisher: MDPI Publishing
Doi

Abstract

The detection of differential item functioning is crucial for the psychometric evaluation of multistage tests. This paper discusses five approaches presented in the literature: logistic regression, SIBTEST, analytical score-based tests, bootstrap score-based tests, and permutation score-based tests. First, using an simulation study inspired by a real-life large-scale educational assessment, we compare the five approaches with respect to their type I error rate and their statistical power. Then, we present an application to an empirical data set. We find that all approaches show type I error rates close to the nominal alpha level. Furthermore, all approaches are shown to be sensitive to uniform and non-uniform DIF effects, with the score-based tests showing the highest power

Similar works

Full text

Available Versions

ZORA

oai:www.zora.uzh.ch:239692

Last time updated on 08/12/2023