Search CORE

34 research outputs found

A Reliability-Generalization Study of Journal Peer Reviews: A Multilevel Meta-Analysis of Inter-Rater Reliability and Its Determinants

Author: A *Marusic
A *Timmer
AA *Montgomery
AC *Justice
AC Weller
AD *Oxman
AP Field
B Thompson
BD Neff
C *Hendrick
C *Hendrick
C *Plug
CD Good
D Lindsey
DV *Cicchetti
DV *Cicchetti
DV *Cicchetti
DV Cicchetti
DV Cicchetti
DW Fiske
F Godlee
GA Lienert
GJ *Whitehurst
GV Glass
H Goldstein
H-D *Daniel
H-D Daniel
HA Herzog
Hans-Dieter Daniel
HC van Houwelingen
HD White
HO *Conn
HR *Rubin
HR *Rubin
HW Marsh
HW Marsh
HW Marsh
HW Marsh
IJ Bateman
IT *Cohen
IT *Cohen
J Strayhorn
J Ziman
JC Bailar
JC Glidewell
JE Hunter
JE Hunter
JE Hunter
JJ Hox
JL Blackburn
JM *Beyer
JM Campanario
JM Campanario
JM LeBreton
JR *Morrow
JR *Scott
JR Cole
JR Landis
K Dickersin
KJ *Kemper
L *Bornmann
L Bornmann
L Howard
L Langfeldt
LK Muthén
LL *Hargens
LPE *van der Steen
Lutz Bornmann
LV Hedges
LV Hedges
M *Bhandari
M *Yadollahie
M Borenstein
M Egger
M Wood
MC LaFollette
ME Falagas
MK Cho
ML *Callaham
P *McReynolds
P Gupta
PH Munley
PM *Rothwell
R DerSimonian
R DerSimonian
R DerSimonian
R Smith
R Snodgrass
RC Little
RE Petty
RL Ebel
RL Goldman
RO *Lempert
RW *Bohannon
Rüdiger Mutz
S *Scarr
S Hemlin
S Hopewell
S Kemp
SA *Kirk
SD Gottfredson
Simon Rogers
SN Beretvas
SS Siegelman
T Eckes
T Vacha-Haase
UW Jayasinghe
V Bakanic
W *Linden
W van den Noortgate
WA *Scott
WL Baker
WR Shadish
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Background: This paper presents the first meta-analysis for the inter-rater reliability (IRR) of journal peer reviews. IRR is defined as the extent to which two or more independent reviews of the same scientific document agree. Methodology/Principal Findings: Altogether, 70 reliability coefficients (Cohen’s Kappa, intra-class correlation [ICC], and Pearson product-moment correlation [r]) from 48 studies were taken into account in the meta-analysis. The studies were based on a total of 19,443 manuscripts; on average, each study had a sample size of 311 manuscripts (minimum: 28, maximum: 1983). The results of the meta-analysis confirmed the findings of the narrative literature reviews published to date: The level of IRR (mean ICC/r 2 =.34, mean Cohen’s Kappa =.17) was low. To explain the study-to-study variation of the IRR coefficients, meta-regression analyses were calculated using seven covariates. Two covariates that emerged in the metaregression analyses as statistically significant to gain an approximate homogeneity of the intra-class correlations indicated that, firstly, the more manuscripts that a study is based on, the smaller the reported IRR coefficients are. Secondly, if the information of the rating system for reviewers was reported in a study, then this was associated with a smaller IRR coefficient than if the information was not conveyed. Conclusions/Significance: Studies that report a high level of IRR are to be considered less credible than those with a low level o

CiteSeerX

Public Library of Science (PLOS)

Repository for Publications and Research Data

Crossref

Directory of Open Access Journals

PubMed Central