Search CORE

8 research outputs found

Standard setting: Comparison of two methods

Author: A Kramer
BH Verhoeven
BH Verhoeven
DB Wayne
DC Howell
DM Kaufman
Femi Oyebode
G Hurtz
G Talente
GJ Cizek
GV Glass
J Searle
JC Impara
JC Impara
JC Impara
JJ Norcini
JJ Norcini
JJ Norcini
K Boursicot
M Cusimano
M Kane
M Sayeed Haque
MD Reckase
MJ Zieky
ML Fehrmann
National Research Council
P Armitage
PR Brandon
S Humphry-Murto
S Kilminster
Sanju George
SM Case
SM Downing
WA Angoff
Publication venue: BioMed Central
Publication date: 14/09/2006
Field of study

BACKGROUND: The outcome of assessments is determined by the standard-setting method used. There is a wide range of standard – setting methods and the two used most extensively in undergraduate medical education in the UK are the norm-reference and the criterion-reference methods. The aims of the study were to compare these two standard-setting methods for a multiple-choice question examination and to estimate the test-retest and inter-rater reliability of the modified Angoff method. METHODS: The norm – reference method of standard -setting (mean minus 1 SD) was applied to the 'raw' scores of 78 4th-year medical students on a multiple-choice examination (MCQ). Two panels of raters also set the standard using the modified Angoff method for the same multiple-choice question paper on two occasions (6 months apart). We compared the pass/fail rates derived from the norm reference and the Angoff methods and also assessed the test-retest and inter-rater reliability of the modified Angoff method. RESULTS: The pass rate with the norm-reference method was 85% (66/78) and that by the Angoff method was 100% (78 out of 78). The percentage agreement between Angoff method and norm-reference was 78% (95% CI 69% – 87%). The modified Angoff method had an inter-rater reliability of 0.81 – 0.82 and a test-retest reliability of 0.59–0.74. CONCLUSION: There were significant differences in the outcomes of these two standard-setting methods, as shown by the difference in the proportion of candidates that passed and failed the assessment. The modified Angoff method was found to have good inter-rater reliability and moderate test-retest reliability

Crossref

Springer - Publisher Connector

University of Birmingham Research Portal

PubMed Central

Using item response theory to explore the psychometric properties of extended matching questions examination in undergraduate medical education

Author: A Van Alphen
B Chopin
BA Fenderson
BD Wright
C Hagquist
CD Kreiter
D Andrich
D Andrich
D Andrich
DL Streiner
G Karabastos
G Rasch
G Rasch
GE Miller
GE Stone
General Medical Council
J Dobby
J Kehoe
J Umar
JC Impara
JM Bland
M Banerji
M Kane
R Hambleton
RD Luce
RF Burton
RK Hambelton
RM Smith
S Alagumalai
SM Case
SM Case
TA Van Batenburg
V Wass
W Wang
WH Angoff
WJ Popham
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2005
Field of study

BACKGROUND: As assessment has been shown to direct learning, it is critical that the examinations developed to test clinical competence in medical undergraduates are valid and reliable. The use of extended matching questions (EMQ) has been advocated to overcome some of the criticisms of using multiple-choice questions to test factual and applied knowledge. METHODS: We analysed the results from the Extended Matching Questions Examination taken by 4th year undergraduate medical students in the academic year 2001 to 2002. Rasch analysis was used to examine whether the set of questions used in the examination mapped on to a unidimensional scale, the degree of difficulty of questions within and between the various medical and surgical specialties and the pattern of responses within individual questions to assess the impact of the distractor options. RESULTS: Analysis of a subset of items and of the full examination demonstrated internal construct validity and the absence of bias on the majority of questions. Three main patterns of response selection were identified. CONCLUSION: Modern psychometric methods based upon the work of Rasch provide a useful approach to the calibration and analysis of EMQ undergraduate medical assessments. The approach allows for a formal test of the unidimensionality of the questions and thus the validity of the summed score. Given the metric calibration which follows fit to the model, it also allows for the establishment of items banks to facilitate continuity and equity in exam standards

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Research Repository

White Rose Research Online