Search CORE

140 research outputs found

Empirical Analysis of Factors Affecting Confirmation Bias Levels of Software Engineers

Author: Ayse Bener
BE Teasley
D Kahneman
DJC MacKay
G Calikli
Gul Calikli
GV Glass
H Erdogmus
H Garavan
HJ Einhorn
J Borkowski
J Murray
JR Cox
JSBT Evans
JV Bradley
KI Manktelow
L Cosmides
M Knauff
MH Kutner
NV Dawson
NW Hirschi
PC Wason
PC Wason
PC Wason
PD Allison
PN Johnson-Laird
PW Cheng
R Tarling
RA Griggs
RA Griggs
RA Griggs
SJ Hoch
SL Jackson
TD Cook
W Stacy
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/12/2015
Field of study

Confirmation bias is defined as the tendency of people to seek evidence that verifies a hypothesis rather than seeking evidence to falsify it. Due to the confirmation bias, defects may be introduced in a software product during requirements analysis, design, implementation and/or testing phases. For instance, testers may exhibit confirmatory behavior in the form of a tendency to make the code run rather than employing a strategic approach to make it fail. As a result, most of the defects that have been introduced in the earlier phases of software development may be overlooked leading to an increase in software defect density. In this paper, we quantify confirmation bias levels in terms of a single derived metric. However, the main focus of this paper is the analysis of factors affecting confirmation bias levels of software engineers. Identification of these factors can guide project managers to circumvent negative effects of confirmation bias, as well as providing guidance for the recruitment and effective allocation of software engineers. In this empirical study, we observed low confirmation bias levels among participants with logical reasoning and hypothesis testing skills

Crossref

Open Research Online

Enlighten

The Search for Invariance: Repeated Positive Testing Serves the Goals of Causal Learning

Author: A Coenen
A Gopnik
A Gopnik
A Gopnik
A Gopnik
A Karmiloff-Smith
AM Johnston
B Inhelder
B Schwartz
B Sodian
B Weslake
C Cook
C Hitchcock
C Zimmerman
C Zimmerman
C Zimmerman
CM Walker
CRM McKenzie
D Klahr
D Klahr
D Klahr
D Kuhn
D Kuhn
D Kuhn
D Lewis
DD Tukey
DJ Navarro
EB Bonawitz
GD Heyman
GL Wells
HJ Einhorn
J Baron
J Friedrich
J Klayman
J Woodward
J Woodward
J Woodward
J Woodward
JE Tschirgi
JJ Gibson
JL Mackie
JR Saffran
K Dunbar
KS Kendler
L Schauble
L Schauble
L Schauble
M Friedman
M Oaksford
M Redhead
M Strevens
ME Gorman
MJ Mahoney
N Valanides
N Vasilyeva
NE Wetherick
P Kitcher
P Ylikoski
PC Wason
PC Wason
PC Wason
PC Wason
PG Devine
PN Johnson-Laird
R Vogel
R Wu
RB Skov
RD Tweney
RS Nickerson
RS Siegler
S Carey
S Carey
S Croker
SA Gelman
SA Siler
SA Sloman
SA Sloman
SA Sloman
SC Yang
T Blanchard
T Gerstenberg
T Lombrozo
TF Icard
TJP Schijndel van
Publication venue: eScholarship, University of California
Publication date: 01/01/2020
Field of study

Positive testing is characteristic of exploratory behavior, yet it seems to be at odds with the aim of information seeking. After all, repeated demonstrations of one’s current hypothesis often produce the same evidence and fail to distinguish it from potential alternatives. Research on the development of scientific reasoning and adult rule learning have both documented and attempted to explain this behavior. The current chapter reviews this prior work and introduces a novel theoretical account—the Search for Invariance (SI) hypothesis—which suggests that producing multiple positive examples serves the goals of causal learning. This hypothesis draws on the interventionist framework of causal reasoning, which suggests that causal learners are concerned with the invariance of candidate hypotheses. In a probabilistic and interdependent causal world, our primary goal is to determine whether, and in what contexts, our causal hypotheses provide accurate foundations for inference and intervention—not to disconfirm their alternatives. By recognizing the central role of invariance in causal learning, the phenomenon of positive testing may be reinterpreted as a rational information-seeking strategy

Crossref

eScholarship - University of California

Type I error rates of multi-arm multi-stage clinical trials: strong control and impact of intermediate outcomes

Author: B Choodari-Oskooei
B Freidlin
Babak Choodari-Oskooei
CU Kunz
CU Kunz
CW Dunnett
D Magirr
Daniel J. Bratton
DJ Bratton
DJ Bratton
DJ Bratton
DR Cohen
FMS Barthel
J Wason
JM Wason
JMS Wason
MA Proschan
Mahesh K. B. Parmar
MD Hughes
MK Parmar
MR Sydes
MR Sydes
P Royston
P Royston
Patrick P. J. Phillips
PC O’Brien
PPJ Phillips
SJ Pocock
T Jaki
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 02/07/2016
Field of study

BACKGROUND: The multi-arm multi-stage (MAMS) design described by Royston et al. [Stat Med. 2003;22(14):2239-56 and Trials. 2011;12:81] can accelerate treatment evaluation by comparing multiple treatments with a control in a single trial and stopping recruitment to arms not showing sufficient promise during the course of the study. To increase efficiency further, interim assessments can be based on an intermediate outcome (I) that is observed earlier than the definitive outcome (D) of the study. Two measures of type I error rate are often of interest in a MAMS trial. Pairwise type I error rate (PWER) is the probability of recommending an ineffective treatment at the end of the study regardless of other experimental arms in the trial. Familywise type I error rate (FWER) is the probability of recommending at least one ineffective treatment and is often of greater interest in a study with more than one experimental arm. METHODS: We demonstrate how to calculate the PWER and FWER when the I and D outcomes in a MAMS design differ. We explore how each measure varies with respect to the underlying treatment effect on I and show how to control the type I error rate under any scenario. We conclude by applying the methods to estimate the maximum type I error rate of an ongoing MAMS study and show how the design might have looked had it controlled the FWER under any scenario. RESULTS: The PWER and FWER converge to their maximum values as the effectiveness of the experimental arms on I increases. We show that both measures can be controlled under any scenario by setting the pairwise significance level in the final stage of the study to the target level. In an example, controlling the FWER is shown to increase considerably the size of the trial although it remains substantially more efficient than evaluating each new treatment in separate trials. CONCLUSIONS: The proposed methods allow the PWER and FWER to be controlled in various MAMS designs, potentially increasing the uptake of the MAMS design in practice. The methods are also applicable in cases where the I and D outcomes are identical

Crossref

Springer - Publisher Connector

UCL Discovery

PubMed Central

eScholarship - University of California

Decision-Making in Research Tasks with Sequential Testing

Author: A Rzhetsky
A Tatsioni
Alan Ruttenberg
Anna Dreber
AR Palmer
C Howson
C Zimmerman
D Kahneman
David G. Rand
DV Lindley
GL Wells
H Campbell
JD Nelson
JP Ioannidis
JP Ioannidis
JP Ioannidis
JP Ioannidis
LM Slowiaczek
LR Anderson
M Henrion
PC Wason
R Hanson
R Hoffmann
RD Csada
S Bikhchanidani
SN Goodman
SN Goodman
T Gilovich
T Pfeiffer
T Pfeiffer
Thomas Pfeiffer
W Edwards
Publication venue: Public Library of Science
Publication date: 25/02/2009
Field of study

Background: In a recent controversial essay, published by JPA Ioannidis in PLoS Medicine, it has been argued that in some research fields, most of the published findings are false. Based on theoretical reasoning it can be shown that small effect sizes, error-prone tests, low priors of the tested hypotheses and biases in the evaluation and publication of research findings increase the fraction of false positives. These findings raise concerns about the reliability of research. However, they are based on a very simple scenario of scientific research, where single tests are used to evaluate independent hypotheses. Methodology/Principal Findings: In this study, we present computer simulations and experimental approaches for analyzing more realistic scenarios. In these scenarios, research tasks are solved sequentially, i.e. subsequent tests can be chosen depending on previous results. We investigate simple sequential testing and scenarios where only a selected subset of results can be published and used for future rounds of test choice. Results from computer simulations indicate that for the tasks analyzed in this study, the fraction of false among the positive findings declines over several rounds of testing if the most informative tests are performed. Our experiments show that human subjects frequently perform the most informative tests, leading to a decline of false positives as expected from the simulations. Conclusions/Significance: For the research tasks studied here, findings tend to become more reliable over time. We also find that the performance in those experimental settings where not all performed tests could be published turned out to be surprisingly inefficient. Our results may help optimize existing procedures used in the practice of scientific research and provide guidance for the development of novel forms of scholarly communication.Engineering and Applied SciencesPsycholog

Public Library of Science (PLOS)

Crossref

Harvard University - DASH

Directory of Open Access Journals

PubMed Central

How logical reasoning mediates the relation between lexical quality and reading comprehension

Author: A Protopapas
AC Graesser
AC Graesser
AF Hayes
C Bowyer-Crane
C Perfetti
C Reverberi
C Shikishima
CA Perfetti
CA Perfetti
Eliane Segers
G Staphorsius
GR Kuperberg
HP Osana
J Raven
JG Cromley
JSB Evans
JSB Evans
JV Oakhill
K Cain
K Cain
K Cain
L Verhoeven
L Verhoeven
Ludo Verhoeven
MM Monti
PC Wason
PC Wason
PN Johnson-Laird
R Thurlow
S Siddiqui
VJ Haars
W Kintsch
WA Hoover
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Rationality and the experimental study of reasoning

Author: A. Tversky
B Rumain
D Hilton
D Kahneman
D Sperber
D Sperber
DE Dulany
FH Poletiek
G Politzer
G Politzer
G Politzer
G Politzer
HA Simon
HA Simon
HP Grice
HP Grice
I Begg
J Baratgin
J-B Henst
J-L Stilgenbauer
JJ Koehler
JSBT Evans
JSBT Evans
JSBT Evans
K Kotovsky
KE Stanovich
LJ Rips
MDS Braine
N Schwarz
O Ducrot
O Ducrot
PC Wason
PN Johnson-Laird
PN Johnson-Laird
SE Newstead
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2004
Field of study

A survey of the results obtained during the past three decades in some of the most widely used tasks and paradigms in the experimental study of reasoning is presented. It is shown that, at first sight, human performance suffers from serious shortcomings. However, after the problems of communication between experimenter and subject are taken into account, which leads to clarify the subject's representation of the tasks, one observes a better performance, although still far from perfect. Current theories of reasoning, of which the two most prominent are very briefly outlined, agree in identifying the load in working memory as the main source of limitation in performance. Finally, a recent view on human rationality prompted by the foregoing results is described

Crossref

Archive Electronique - Institut Jean Nicod

Bayesian inference for the information gain model

Author: A Gelman
A Gelman
AW Vaart van der
B Efron
C-F Sheu
D Gamerman
D Lunn
Denny Borsboom
DJ Lunn
DR Cavagnaro
DV Lindley
E-J Wagenmakers
Eric-Jan Wagenmakers
F Liang
H Jeffreys
IJ Myung
JD Nelson
JK Kruschke
JL Hintze
K Oberauer
K Stenning
KC Klauer
KC Klauer
M Hattori
M Hattori
M Oaksford
M Oaksford
M Oaksford
M Oaksford
N Chater
PC Wason
R Wetzels
RE Kass
SJ Dennis
Sven Stringer
WR Gilks
Publication venue: Springer-Verlag
Publication date: 01/01/2011
Field of study

One of the most popular paradigms to use for studying human reasoning involves the Wason card selection task. In this task, the participant is presented with four cards and a conditional rule (e.g., “If there is an A on one side of the card, there is always a 2 on the other side”). Participants are asked which cards should be turned to verify whether or not the rule holds. In this simple task, participants consistently provide answers that are incorrect according to formal logic. To account for these errors, several models have been proposed, one of the most prominent being the information gain model (Oaksford & Chater, Psychological Review, 101, 608–631, 1994). This model is based on the assumption that people independently select cards based on the expected information gain of turning a particular card. In this article, we present two estimation methods to fit the information gain model: a maximum likelihood procedure (programmed in R) and a Bayesian procedure (programmed in WinBUGS). We compare the two procedures and illustrate the flexibility of the Bayesian hierarchical procedure by applying it to data from a meta-analysis of the Wason task (Oaksford & Chater, Psychological Review, 101, 608–631, 1994). We also show that the goodness of fit of the information gain model can be assessed by inspecting the posterior predictives of the model. These Bayesian procedures make it easy to apply the information gain model to empirical data. Supplemental materials may be downloaded along with this article from www.springerlink.com

International Migration, Integration and Social Cohesion online publications

Of Black Swans and Tossed Coins: Is the Description-Experience Gap in Risky Choice Limited to Rare Events?

Author: A Bechara
A Bechara
A Bechara
A Kühberger
A Tversky
A Tversky
A Tversky
Angela Sirigu
AR Camilleri
AR Damasio
B De Martino
B Figner
B Marsh
C Camerer
C Ungemach
D Kahneman
Elliot A. Ludvig
EU Weber
F Strack
G Barron
G Lowenstein
J Cohen
J Cohen
JD Cohen
Kirman
M Bateson
M Baucells
Marcia L. Spetch
P Slovic
PC Wason
R Hau
R Hau
R Hertwig
R Hertwig
R McCloy
SA Huettel
T Gärling
T Rakow
T Rakow
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

When faced with risky decisions, people tend to be risk averse for gains and risk seeking for losses (the reflection effect). Studies examining this risk-sensitive decision making, however, typically ask people directly what they would do in hypothetical choice scenarios. A recent flurry of studies has shown that when these risky decisions include rare outcomes, people make different choices for explicitly described probabilities than for experienced probabilistic outcomes. Specifically, rare outcomes are overweighted when described and underweighted when experienced. In two experiments, we examined risk-sensitive decision making when the risky option had two equally probable (50%) outcomes. For experience-based decisions, there was a reversal of the reflection effect with greater risk seeking for gains than for losses, as compared to description-based decisions. This fundamental difference in experienced and described choices cannot be explained by the weighting of rare events and suggests a separate subjective utility curve for experience

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Warwick Research Archives Portal Repository

A short educational intervention diminishes causal illusions and specific paranormal beliefs in undergraduates

Author: A Costa
AL Alter
AR Harkness
B Keysar
B Mellers
BR Forer
C Impey
CR Snyder
D Broockman
DH Dickson
DH Phua
DL Hamilton
E Pronin
E Pronin
EA Wasserman
EJ Wagenmakers
Elisabet Tubau
F Blanco
F Blanco
F Blanco
G Smedslund
H Matute
H Matute
H Song
Helena Matute
HR Arkes
I Barberia
Itxaso Barberia
J Klayman
J Rotton
J Tobacyk
Javier Rodríguez-Ferreiro
JC Perales
JJ Tobacyk
José César Perales
L Díaz-Vilela
LB Alloy
LG Allan
LG Allan
LG Allan
M Lindeman
MJ Buehner
P Croskerry
PC Wason
R De Raedt
R Schmaltz
RM Msetfi
RP Larrick
RS Nickerson
S Brooks
SD Hannah
SO Lilienfeld
SO Lilienfeld
WBJ Swann
Y Bar-Haim
Ł Gawęda
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 23/04/2018
Field of study

Cognitive biases such as causal illusions have been related to paranormal and pseudoscientific beliefs and, thus, pose a real threat to the development of adequate critical thinking abilities. We aimed to reduce causal illusions in undergraduates by means of an educational intervention combining training-in-bias and training-in-rules techniques. First, participants directly experienced situations that tend to induce the Barnum effect and the confirmation bias. Thereafter, these effects were explained and examples of their influence over everyday life were provided. Compared to a control group, participants who received the intervention showed diminished causal illusions in a contingency learning task and a decrease in the precognition dimension of a paranormal belief scale. Overall, results suggest that evidence-based educational interventions like the one presented here could be used to significantly improve critical thinking skills in our students

Crossref

Directory of Open Access Journals

Diposit Digital de la Universitat de Barcelona

Chess databases as a research vehicle in psychology : modeling large data

Author: A Gelman
A Gelman
A Kiesel
A Newell
AD Groot De
AD Groot De
AE Elo
AS Luchins
BD Marx
CE Shannon
CF Chabris
D Bates
D Hofstadter
DZ Hambrick
E Keuleers
EM Reingold
F Gobet
F Gobet
F Gobet
F Gobet
F Gobet
F Gobet
F Saussure de
G Campitelli
G Campitelli
G Campitelli
G Campitelli
G Rubinstein
GM Joseph
HA Simon
HA Simon
HB Richman
I Fooken
J Baker
J Pinheiro
J Radanović
JA Sloboda
JF Voss
JH Holland
JH Moxley
JM Schraagen
KA Ericsson
KA Ericsson
KJ Preacher
KO Mason
M Bilalić
M Bilalić
M Bilalić
M Bilalić
M Bilalić
M Bilalić
M Bilalić
M Bilalić
M Bilalić
M Bilalić
M Bilalić
M Knapp
Merim Bilalić
MH Connors
N Charness
N Charness
N Vaci
N Vaci
ND Glenn
Nemanja Vaci
P Chassy
PC Lane
PC Wason
R Development Core Team
R Gaschler
RH Baayen
RH Baayen
RW Howard
RW Howard
RW Howard
RW Howard
RW Howard
RW Howard
RW Howard
RW Howard
RW Howard
RW Howard
RW Roring
S Vollstädt-Klein
SE Maxwell
SN Wood
T Shallice
T Stafford
TJ Hastie
TS Kuhn
WG Chase
Y Fang
Y Gong
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/09/2016
Field of study

The game of chess has often been used for psychological investigations, particularly in cognitive science. The clear-cut rules and well-defined environment of chess provide a model for investigations of basic cognitive processes, such as perception, memory, and problem solving, while the precise rating system for the measurement of skill has enabled investigations of individual differences and expertise-related effects. In the present study, we focus on another appealing feature of chess—namely, the large archive databases associated with the game. The German national chess database presented in this study represents a fruitful ground for the investigation of multiple longitudinal research questions, since it collects the data of over 130,000 players and spans over 25 years. The German chess database collects the data of all players, including hobby players, and all tournaments played. This results in a rich and complete collection of the skill, age, and activity of the whole population of chess players in Germany. The database therefore complements the commonly used expertise approach in cognitive science by opening up new possibilities for the investigation of multiple factors that underlie expertise and skill acquisition. Since large datasets are not common in psychology, their introduction also raises the question of optimal and efficient statistical analysis. We offer the database for download and illustrate how it can be used by providing concrete examples and a step-by-step tutorial using different statistical analyses on a range of topics, including skill development over the lifetime, birth cohort effects, effects of activity and inactivity on skill, and gender differences

Northumbria University Research Portal

Crossref

Springer - Publisher Connector

White Rose Research Online