Search CORE

41 research outputs found

ROC curve analyses of eyewitness identification decisions: An analysis of the recent debate

Author: Chen Tina
Rotello Caren M.
Publication venue: ScholarWorks@UMass Amherst
Publication date: 01/01/2016
Field of study

How should the accuracy of eyewitness identification decisions be measured, so that best practices for identification can be determined? This fundamental question is under intense debate. One side advocates for continued use of a traditional measure of identification accuracy, known as the diagnosticity ratio, whereas the other side argues that receiver operating characteristic curves (ROCs) should be used instead because diagnosticity is confounded with response bias. Diagnosticity proponents have offered several criticisms of ROCs, which we show are either false or irrelevant to the assessment of eyewitness accuracy. We also show that, like diagnosticity, Bayesian measures of identification accuracy confound response bias with witnesses’ ability to discriminate guilty from innocent suspects. ROCs are an essential tool for distinguishing memory-based processes from decisional aspects of a response; simulations of different possible identification tasks and response strategies show that they offer important constraints on theory development

Crossref

ScholarWorks@UMass Amherst

PubMed Central

Recommended from our members

Estimating the proportion of guilty suspects and posterior probability of guilt in lineups using signal-detection models

Author: Cataldo Andrea M.
Cohen Andrew L.
Rotello Caren M.
Starns Jeffrey J.
Publication venue: ScholarWorks@UMass Amherst
Publication date: 01/01/2020
Field of study

Background The majority of eyewitness lineup studies are laboratory-based. How well the conclusions of these studies, including the relationship between confidence and accuracy, generalize to real-world police lineups is an open question. Signal detection theory (SDT) has emerged as a powerful framework for analyzing lineups that allows comparison of witnesses’ memory accuracy under different types of identification procedures. Because the guilt or innocence of a real-world suspect is generally not known, however, it is further unknown precisely how the identification of a suspect should change our belief in their guilt. The probability of guilt after the suspect has been identified, the posterior probability of guilt (PPG), can only be meaningfully estimated if we know the proportion of lineups that include a guilty suspect, P(guilty). Recent work used SDT to estimate P(guilty) on a single empirical data set that shared an important property with real-world data; that is, no information about the guilt or innocence of the suspects was provided. Here we test the ability of the SDT model to recover P(guilty) on a wide range of pre-existing empirical data from more than 10,000 identification decisions. We then use simulations of the SDT model to determine the conditions under which the model succeeds and, where applicable, why it fails. Results For both empirical and simulated studies, the model was able to accurately estimate P(guilty) when the lineups were fair (the guilty and innocent suspects did not stand out) and identifications of both suspects and fillers occurred with a range of confidence levels. Simulations showed that the model can accurately recover P(guilty) given data that matches the model assumptions. The model failed to accurately estimate P(guilty) under conditions that violated its assumptions; for example, when the effective size of the lineup was reduced, either because the fillers were selected to be poor matches to the suspect or because the innocent suspect was more familiar than the guilty suspect. The model also underestimated P(guilty) when a weapon was shown. Conclusions Depending on lineup quality, estimation of P(guilty) and, relatedly, PPG, from the SDT model can range from poor to excellent. These results highlight the need to carefully consider how the similarity relations between fillers and suspects influence identifications

ScholarWorks@UMass Amherst

Assessing the belief bias effect with ROCs: It's a response bias effect.

Author: Caren M. Rotello
Chad Dube
Evan Heit
Publication venue: 'American Psychological Association (APA)'
Publication date: 01/01/2010
Field of study

Crossref

Assessing Theoretical Conclusions With Blinded Inference to Investigate a Potential Inference Crisis

Scientific advances across a range of disciplines hinge on the ability to make inferences about unobservable theoretical entities on the basis of empirical data patterns. Accurate inferences rely on both discovering valid, replicable data patterns and accurately interpreting those patterns in terms of their implications for theoretical constructs. The replication crisis in science has led to widespread efforts to improve the reliability of research findings, but comparatively little attention has been devoted to the validity of inferences based on those findings. Using an example from cognitive psychology, we demonstrate a blinded-inference paradigm for assessing the quality of theoretical inferences from data. Our results reveal substantial variability in experts’ judgments on the very same data, hinting at a possible inference crisis

University of Groningen

University of Tasmania Open Access Repository

Warwick Research Archives Portal Repository

Queen Mary Research Online

University of Queensland eSpace

Aquila Digital Community

Crossref

Proceedings - University of Groningen

ARTS repository - University of Groningen

MAnnheim DOCument Server

UCL Discovery

Cronfa at Swansea University

ZORA

Dissertations of the University of Groningen

Simulation explorations of recognition sensitivity measures

Author: Adva Levi
Caren M. Rotello
Publication venue: 'Center for Open Science'
Publication date: 06/03/2024
Field of study

OSF Preprints

Are There Two Kinds of Reasoning?

Author: Heit Evan
Rotello Caren M.
Publication venue: eScholarship, University of California
Publication date: 01/01/2005
Field of study

Two experiments addressed the issue of how deductive reasoning and inductive reasoning are related. According to the criterion-shift account, these two kinds of reasoning assess arguments along a common scale of strength, however there is a stricter criterion for saying an argument is deductively correct as opposed to just inductively strong. The method, adapted from Rips (2001), was to give two groups of participants the same set of written arguments but with either deduction or induction instructions. Signal detection and receiver operating characteristic analyses showed that the difference between conditions could not be explained in terms of a criterion shift. Instead, the deduction condition showed greater sensitivity to argument strength than did the induction condition. Implications for two-process and one-process accounts of reasoning, and relations to memory research, are discussed

CiteSeerX

eScholarship - University of California

Illusions of plausibility: ROC evidence for cue-based retrieval interference

Author: Amanda Rysling
Brian Dillon
Caren M. Rotello
Publication venue: OSF
Publication date: 28/01/2018
Field of study

OSF Preprints

Sources of Bias in the Goodman-Kruskal Gamma Coefficient Measure of Association: Implications for Studies of Metacognitive Processes

Author: Caren M. Rotello
Michael E. J. Masson
Publication venue
Publication date: 01/01/2009
Field of study

In many cognitive, metacognitive, and perceptual tasks, measurement of performance or prediction accuracy may be influenced by response bias. Signal detection theory provides a means of assessing discrimination accuracy independent of such bias, but its application crucially depends on distributional assumptions. The Goodman-Kruskal gamma coefficient, G, has been proposed as an alternative means of measuring accuracy that is free of distributional assumptions. This measure is widely used with tasks that assess metamemory or metacognition performance. We demonstrate that the empirically determined value of G systematically deviates from its actual value under realistic conditions. We introduce a distribution-specific variant of G, called Gc, to show why this bias arises. Our findings imply caution is needed when using G as a measure of accuracy and alternative measures are recommended. "Our belief is that each scientific area that has use for measures of association should, after appropriate argument and trial, settle down on those measures most useful for its needs." – Goodman and Kruskal (1954, p. 763

CiteSeerX

Crossref

Memory Cognition

Author: Caren M. Rotello
One Of The
Publication venue
Publication date
Field of study

this article should be sent to either C. M. Rotello, Department of Psychology, Box 37710, University of Massachusetts, Amherst, MA 010037710 (e-mail: [email protected]) or E. Heit, Department of Psychology, University of Warwick, Coventry CV4 7AL, England (e-mail: [email protected]

CiteSeerX

The Measuring Memory Project

Author: Andrea M. Cataldo
Caren M. Rotello
Jeffrey Starns
Publication venue: 'Center for Open Science'
Publication date: 08/07/2019
Field of study

OSF Preprints