3 research outputs found
Peer assessment using comparative and absolute judgement
Peer assessment exercises yield varied reliability and validity. To maximise reliability and validity, the literature recommends adopting various design principles including the use of explicit assessment criteria. Counter to this literature, we report a peer assessment exercise in which criteria were deliberately avoided yet acceptable reliability and validity were achieved. Based on this finding, we make two arguments. First, the comparative judgement approach adopted can be applied successfully in different contexts, including higher education and secondary school. Second, the success was due to this approach; an alternative technique based on absolute judgement yielded poor reliability and validity. We conclude that sound outcomes are achievable without assessment criteria, but success depends on how the peer assessment activity is designed
An investigation of the response time for maths items in a computer adaptive test
An important advantage of computer based testing over conventional paper
and pencil based testing is that the response time to items from test takers
can be accurately recorded for subsequent analysis. This study investigates
the response time for maths items in a computer adaptive test designed as a
baseline assessment for pupils aged from 11 to 18 in the UK. The results
showed that the response time for all the items in the test generally increases
with item difficulty, although significant variability exists. The item difficulty
levels and the age and ability of test takers have significant influence on item
response time
Fifty years of A-level mathematics: have standards changed?
Advanced-level (A-level) mathematics is a high-profile qualification taken by many
school leavers in England, Wales, Northern Ireland and around the world as
preparation for university study. Concern has been expressed in these countries that
standards in A-level mathematics have declined over time, and that school leavers
enter university or the workplace lacking the required mathematical knowledge and
skills. The situation in England, Wales and Northern Ireland reflects more general
international concerns about decreasing educational standards. However, evidence to
support this concern has been of limited scope, rarely subjected to peer-review, and of
questionable validity. Our study overcame the limitations of previous research into
standards over time by applying a comparative judgement technique that enabled the
direct comparison of mathematical performance across different examinations.
Furthermore, unlike previous research, all examination questions were re-typeset and candidate responses rewritten to reduce bias arising from surface cues. Using this
technique, mathematics experts judged A-level scripts from the 1960s, 1990s and the
current decade. We report that the experts believed current A-level mathematics
standards to have declined since the 1960s, although there was no evidence that they
believed standards have declined since the 1990s. We contrast our findings with those
from previous comparison studies and consider implications for future research into
standards over time