Search CORE

175 research outputs found

Optimal item pool design for computerized adaptive tests with polytomous items using GPCM

Author: M. D X Zhou
Mark D Reckase
Reckase
Xuechun Zhou
Publication venue
Publication date: 01/01/2014
Field of study

Abstract Computerized adaptive testing (CAT) is a testing procedure with advantages in improving measurement precision and increasing test efficiency. An item pool with optimal characteristics is the foundation for a CAT program to achieve those desirable psychometric features. This study proposed a method to design an optimal item pool for tests with polytomous items using the generalized partial credit model (G-PCM). It extended a method for approximating optimality with polytomous items being described succinctly for the purpose of pool design. Optimal item pools were generated using CAT simulations with and without practical constraints of content balancing and item exposure control. The performances of the item pools were evaluated against an operational item pool. The results indicated that the item pools designed with stratification based on discrimination parameters performed well with an efficient use of the less discriminative items within the target accuracy levels. The implications for developing item pools are also discussed

CiteSeerX

Modeling Judgments in the Angoff and Contrasting-Groups Method of Standard Setting

Author: Angoff W. H.
Brandon P. R.
Gelman A.
Gilks W.
Haertel E. H.
Jaeger R. M.
Kane M. T.
Livingston S. A.
Longford N. T.
Lord F. M.
Meskauskas J. A.
Reckase M. D.
Reckase M. D.
Tanner M. A.
Zieky M. J.
Publication venue: 'Wiley'
Publication date
Field of study

Crossref

Measuring the ICF components of impairment, activity limitation and participation restriction: an item analysis using classical test theory and item response theory

Author: A Cieza
A Williams
AJ Carr
B Pollard
BB Reeve
Beth Pollard
BG Tabachnick
CA McHorney
D Andrich
D Thissen
Diane Dixon
DJ Cooke
DJ Cooke
DL Patrick
EF Sinar
J Dawson
J Dawson
J Singh
JE Ware
JE Ware
JF Fries
JN Insall
L Prieto
LJ Cronbach
M Akai
M Johnston
M Reckase
M Weigl
MAM Gignac
Marie Johnston
MG Lequesne
N Bellamy
Paul Dieppe
PM Fayers
R Hays
R Lindeboom
R Wilkie
R Wilkie
RB Fletcher
RD Hays
RF Meenan
RH Harwood
RJM Perenboom
RK Hambleton
RO Anderson
SE Embretson
SM Downing
SM Haley
W Kuyken
WH Harris
WHO
WHO
WHOQOL group
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

The International Classification of Functioning, Disability and Health (ICF) proposes three main health outcomes, Impairment (I), Activity Limitation (A) and Participation Restriction (P), but good measures of these constructs are needed The aim of this study was to use both Classical Test Theory (CTT) and Item Response Theory (IRT) methods to carry out an item analysis to improve measurement of these three components in patients having joint replacement surgery mainly for osteoarthritis (OA). A geographical cohort of patients about to undergo lower limb joint replacement was invited to participate. Five hundred and twenty four patients completed ICF items that had been previously identified as measuring only a single ICF construct in patients with osteoarthritis. There were 13 I, 26 A and 20 P items. The SF-36 was used to explore the construct validity of the resultant I, A and P measures. The CTT and IRT analyses were run separately to identify items for inclusion or exclusion in the measurement of each construct. The results from both analyses were compared and contrasted. Overall, the item analysis resulted in the removal of 4 I items, 9 A items and 11 P items. CTT and IRT identified the same 14 items for removal, with CTT additionally excluding 3 items, and IRT a further 7 items. In a preliminary exploration of reliability and validity, the new measures appeared acceptable. New measures were developed that reflect the ICF components of Impairment, Activity Limitation and Participation Restriction for patients with advanced arthritis. The resulting Aberdeen IAP measures (Ab-IAP) comprising I (Ab-I, 9 items), A (Ab-A, 17 items), and P (Ab-P, 9 items) met the criteria of conventional psychometric (CTT) analyses and the additional criteria (information and discrimination) of IRT. The use of both methods was more informative than the use of only one of these methods. Thus combining CTT and IRT appears to be a valuable tool in the development of measures

Crossref

University of Strathclyde Institutional Repository

Stirling Online Research Repository (RIOXX)

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Stirling Online Research Repository

Nonparametric IRT analysis of Quality-of-Life Scales and its application to the World Health Organization Quality-of-Life Scale (WHOQOL-Bref)

Crossref

PubMed Central

EUR Research Repository

Tilburg University Repository

A proof of principle for using adaptive testing in routine Outcome Monitoring: the efficiency of the Mood and Anxiety Symptoms Questionnaire -Anhedonic Depression CAT

Abstract Background In Routine Outcome Monitoring (ROM) there is a high demand for short assessments. Computerized Adaptive Testing (CAT) is a promising method for efficient assessment. In this article, the efficiency of a CAT version of the Mood and Anxiety Symptom Questionnaire, - Anhedonic Depression scale (MASQ-AD) for use in ROM was scrutinized in a simulation study. Methods The responses of a large sample of patients (<it>N </it>= 3,597) obtained through ROM were used. The psychometric evaluation showed that the items met the requirements for CAT. In the simulations, CATs with several measurement precision requirements were run on the item responses as if they had been collected adaptively. Results CATs employing only a small number of items gave results which, both in terms of depression measurement and criterion validity, were only marginally different from the results of a full MASQ-AD assessment. Conclusions It was concluded that CAT improved the efficiency of the MASQ-AD questionnaire very much. The strengths and limitations of the application of CAT in ROM are discussed.</p

Crossref

VU Research Portal

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Leiden University Scholary Publications

Linking tests of English for academic purposes to the CEFR: the score user’s perspective

The Common European Framework of Reference for Languages (CEFR) is widely used in setting language proficiency requirements, including for international students seeking access to university courses taught in English. When different language examinations have been related to the CEFR, the process is claimed to help score users, such as university admissions staff, to compare and evaluate these examinations as tools for selecting qualified applicants. This study analyses the linking claims made for four internationally recognised tests of English widely used in university admissions. It uses the Council of Europe’s (2009) suggested stages of specification, standard setting, and empirical validation to frame an evaluation of the extent to which, in this context, the CEFR has fulfilled its potential to “facilitate comparisons between different systems of qualifications.” Findings show that testing agencies make little use of CEFR categories to explain test content; represent the relationships between their tests and the framework in different terms; and arrive at conflicting conclusions about the correspondences between test scores and CEFR levels. This raises questions about the capacity of the CEFR to communicate competing views of a test construct within a coherent overarching structure

Crossref

University of Bedfordshire Repository

Some recommendations for developing multidimensional computerized adaptive tests for patient-reported outcomes

Author: AC Dueck
AM Boyd
BB Reeve
BB Reeve
BF Green
C Wang
C-H Chang
CAW Glas
CG Forero
D Thissen
DG Seo
DJ Thissen
DJ Weiss
DO Segall
DSJ Costa
E Basch
Food and Drug Administration
G Flens
G Maruyama
GJ Mellenbergh
J Brazier
J Speight
JA Landsheer
Jan R. Böhnke
JC Nunnally
JR Edwards
JS Gorin
KA Bollen
KJ Yost
L Cai
L Cai
L Yao
M Brod
M Doostfatemeh
M Heo
M Martin
MAG Sprangers
MC Edwards
MCS Paap
MCS Paap
MCS Paap
MD Reckase
Muirne C. S. Paap
MW Browne
N Deng
N Smits
N Smits
N Smits
Niels Smits
OS Chernyshenko
P Fayers
P Levy
P Michel
PM Fayers
PM Fayers
R Holman
RC MacCallum
RJ Adams
RJ Swartz
RJD Ayala
RK Tsutakawa
RM Luecht
RP Chalmers
S Jiang
SE Embretson
SM Wu
SP Reise
SP Reise
SW Choi
T Hastie
V Sebille
W Bonifay
W-C Wang
WA Nicewander
WHM Emons
Y Zheng
YH Li
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/04/2018
Field of study

PURPOSE: Multidimensional item response theory and computerized adaptive testing (CAT) are increasingly used in mental health, quality of life (QoL), and patient-reported outcome measurement. Although multidimensional assessment techniques hold promises, they are more challenging in their application than unidimensional ones. The authors comment on minimal standards when developing multidimensional CATs. METHODS: Prompted by pioneering papers published in QLR, the authors reflect on existing guidance and discussions from different psychometric communities, including guidelines developed for unidimensional CATs in the PROMIS project. RESULTS: The commentary focuses on two key topics: (1) the design, evaluation, and calibration of multidimensional item banks and (2) how to study the efficiency and precision of a multidimensional item bank. The authors suggest that the development of a carefully designed and calibrated item bank encompasses a construction phase and a psychometric phase. With respect to efficiency and precision, item banks should be large enough to provide adequate precision over the full range of the latent constructs. Therefore CAT performance should be studied as a function of the latent constructs and with reference to relevant benchmarks. Solutions are also suggested for simulation studies using real data, which often result in too optimistic evaluations of an item bank's efficiency and precision. DISCUSSION: Multidimensional CAT applications are promising but complex statistical assessment tools which necessitate detailed theoretical frameworks and methodological scrutiny when testing their appropriateness for practical applications. The authors advise researchers to evaluate item banks with a broad set of methods, describe their choices in detail, and substantiate their approach for validation

Crossref

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

University of Dundee Online Publications

UvA-DARE

International Migration, Integration and Social Cohesion online publications

Dissertations of the University of Groningen

Risky business: factor analysis of survey data – assessing the probability of incorrect dimensionalisation

Author: A Figueredo
A Skrondal
AW Meade
B Thompson
BF French
BM Byrne
BP O’Connor
C Frankfort-Nachmias
C Reinard John
CE Lance
Cees van der Eijk
CG Forero
D Iacobucci
D. Russell
DB Flora
DB Flora
DJ Bartholomew
DL Bandalos
DT Campbell
F Chen
FJ Floyd
FP Holgado-Tello
G Rasch
G Raîche
GA Ferguson
GD Garson
H Rickards
HW Marsh
HW Marsh
IT Jolliffe
J Pallant
J Rose
J Ruscio
J-O Kim
JA Hagenaars
JC Nunnally
JK Vermunt
JL Horn
JM Conway
Jonathan Rose
K Sijtsma
K Sijtsma
KF Widaman
KJ Preacher
L Hu
L Wilkinson
LG Humphreys
LK Muthén
LR Fabrigar
LR Fabrigar
LR Zientek
LW Glorfeld
M Basto
M Eid
M Kankaraš
M Lodge
M Norris
MD Reckase
ME Timmerman
MGR Courtney
MJ Mimiaga
MW Browne
N Cliff
N Schmitt
N Schmitt
P Corbetta
P De Boeck
PM Bentler
PM Bentler
R Hubbard
R Larsen
RB Cattell
RC MacCallum
RD Ledesma
RJ Mokken
RK Henson
RL Gorsuch
RL Gorsuch
RL Spitzer
RM Smith
RW Worthington
S Embretson
S Green
S-Y Lee
SA Rice
SJ Cho
SV Budaev
T Asparouhov
T Bond
TA Schmitt
U Olsson
UH Olsson
W Revelle
WA Gibson
WE Saris
WF Velicer
WF Velicer
WF Velicer
WH Van Schuur
WH Van Schuur
WJ Van der Linden
WR Zwick
Y Sawaki
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 17/01/2015
Field of study

This paper undertakes a systematic assessment of the extent to which factor analysis the correct number of latent dimensions (factors) when applied to ordered categorical survey items (so-called Likert items). We simulate 2400 data sets of uni-dimensional Likert items that vary systematically over a range of conditions such as the underlying population distribution, the number of items, the level of random error, and characteristics of items and item-sets. Each of these datasets is factor analysed in a variety of ways that are frequently used in the extant literature, or that are recommended in current methodological texts. These include exploratory factor retention heuristics such as Kaiser’s criterion, Parallel Analysis and a non-graphical scree test, and (for exploratory and confirmatory analyses) evaluations of model fit. These analyses are conducted on the basis of Pearson and polychoric correlations.We find that, irrespective of the particular mode of analysis, factor analysis applied to ordered-categorical survey data very often leads to over-dimensionalisation. The magnitude of this risk depends on the specific way in which factor analysis is conducted, the number of items, the properties of the set of items, and the underlying population distribution. The paper concludes with a discussion of the consequences of overdimensionalisation, and a brief mention of alternative modes of analysis that are much less prone to such problems

Nottingham ePrints

Public Library of Science (PLOS)

Nottingham eTheses

Crossref

Repository@Nottingham

Directory of Open Access Journals

PubMed Central

De Montfort University Open Research Archive

FigShare

Consumer satisfaction and item response theory: creating a measurement scale

Crossref

Predicting implementation from organizational readiness for change: a study protocol

Author: A Jennex
A Molla
A Molla
A Sen
AA Armenakis
Anne E Sales
BE Fuller
BJ Weiner
BJ Weiner
BJ Weiner
BL Lambert
BM Staw
C Helfrich
CA Schriesheim
CB Stetler
CE Lance
Christian D Helfrich
D Holt
Dean Blevins
Department of Veterans Affairs and Health Affairs DoD
Department of Veterans Affairs D
Department of Veterans Affairs Office of the Inspector General
DH Gustafson
DL Leslie
DL Streiner
DM Berwick
DT Holt
E Duncan
EA Balas
EH Morrato
EL Thorndike
EM Weissman
FC Blow
G Reiss
GG Cummings
GL Ingersoll
H Hagedorn
HD Abraham
Hildi Hagedorn
HJ Hagedorn
HR Bernard
Institute of Medicine Committee on Quality of Health Care in America
J Cohen
J Smith
JC Nunnally
JE Kirchner
Jeffrey L Smith
JW Newcomer
K Klein
KG Jöreskog
KJ Jansen
L Herscovitch
L Rampazzo
L Saldana
LD Reid
LV Rubenstein
M Nagy
MD Reckase
ME Smith
MJ Sernyak
MR Kauth
P Adam Kelly
P Haidet
P Podsakoff
P Rosenzweig
P Tesluk
PA Nutting
Patricia M Dubbert
PC Smith
PE Spector
PL Almasio
S Dopson
SC Hedrick
Scientific Advisory Committee of the Medical Outcomes Trust
SD Scott
SR Marder
T Molfenter
Timothy P Hogan
TR Hinkin
TR Hinkin
TS Stroup
Veterans Health Administration DoVAaHA Department of Defense
Veterans Health Administration DoVAaHA Department of Defense
VP Dhopesh
W Yu
WEK Lehman
WMK Trochim
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background There is widespread interest in measuring organizational readiness to implement evidence-based practices in clinical care. However, there are a number of challenges to validating organizational measures, including inferential bias arising from the halo effect and method bias - two threats to validity that, while well-documented by organizational scholars, are often ignored in health services research. We describe a protocol to comprehensively assess the psychometric properties of a previously developed survey, the Organizational Readiness to Change Assessment. Objectives Our objective is to conduct a comprehensive assessment of the psychometric properties of the Organizational Readiness to Change Assessment incorporating methods specifically to address threats from halo effect and method bias. Methods and Design We will conduct three sets of analyses using longitudinal, secondary data from four partner projects, each testing interventions to improve the implementation of an evidence-based clinical practice. Partner projects field the Organizational Readiness to Change Assessment at baseline (n = 208 respondents; 53 facilities), and prospectively assesses the degree to which the evidence-based practice is implemented. We will conduct predictive and concurrent validities using hierarchical linear modeling and multivariate regression, respectively. For predictive validity, the outcome is the change from baseline to follow-up in the use of the evidence-based practice. We will use intra-class correlations derived from hierarchical linear models to assess inter-rater reliability. Two partner projects will also field measures of job satisfaction for convergent and discriminant validity analyses, and will field Organizational Readiness to Change Assessment measures at follow-up for concurrent validity (n = 158 respondents; 33 facilities). Convergent and discriminant validities will test associations between organizational readiness and different aspects of job satisfaction: satisfaction with leadership, which should be highly correlated with readiness, versus satisfaction with salary, which should be less correlated with readiness. Content validity will be assessed using an expert panel and modified Delphi technique. Discussion We propose a comprehensive protocol for validating a survey instrument for assessing organizational readiness to change that specifically addresses key threats of bias related to halo effect, method bias and questions of construct validity that often go unexplored in research using measures of organizational constructs.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central