Search CORE

87 research outputs found

Fast and flexible: Human program induction in abstract reasoning tasks

Author: Gureckis Todd M.
Johnson Aysja
Lake Brenden M.
Vong Wai Keen
Publication venue
Publication date: 01/01/2021
Field of study

The Abstraction and Reasoning Corpus (ARC) is a challenging program induction dataset that was recently proposed by Chollet (2019). Here, we report the first set of results collected from a behavioral study of humans solving a subset of tasks from ARC (40 out of 1000). Although this subset of tasks contains considerable variation, our results showed that humans were able to infer the underlying program and generate the correct test output for a novel test input example, with an average of 80% of tasks solved per participant, and with 65% of tasks being solved by more than 80% of participants. Additionally, we find interesting patterns of behavioral consistency and variability within the action sequences during the generation process, the natural language descriptions to describe the transformations for each task, and the errors people made. Our findings suggest that people can quickly and reliably determine the relevant features and properties of a task to compose a correct solution. Future modeling work could incorporate these findings, potentially by connecting the natural language descriptions we collected here to the underlying semantics of ARC.Comment: 7 pages, 7 figures, 1 tabl

arXiv.org e-Print Archive

eScholarship - University of California

Evaluating Amazon\u27s Mechanical Turk as a Tool for Experimental Behavioral Research

Author: Crump Matthew J. C.
Gureckis Todd M.
McDonnell John V.
Publication venue: CUNY Academic Works
Publication date: 13/03/2013
Field of study

Amazon Mechanical Turk (AMT) is an online crowdsourcing service where anonymous online workers complete web-based tasks for small sums of money. The service has attracted attention from experimental psychologists interested in gathering human subject data more efficiently. However, relative to traditional laboratory studies, many aspects of the testing environment are not under the experimenter\u27s control. In this paper, we attempt to empirically evaluate the fidelity of the AMT system for use in cognitive behavioral experiments. These types of experiment differ from simple surveys in that they require multiple trials, sustained attention from participants, comprehension of complex instructions, and millisecond accuracy for response recording and stimulus presentation. We replicate a diverse body of tasks from experimental psychology including the Stroop, Switching, Flanker, Simon, Posner Cuing, attentional blink, subliminal priming, and category learning tasks using participants recruited using AMT. While most of replications were qualitatively successful and validated the approach of collecting data anonymously online using a web-browser, others revealed disparity between laboratory results and online results. A number of important lessons were encountered in the process of conducting these replications that should be of value to other researchers

City University of New York

Learning Categories From an Intermittent Teacher

Author: Gureckis Todd M.
McDonnell John V.
Publication venue: 'American Psychological Association (APA)'
Publication date: 01/01/2011
Field of study

Crossref

eScholarship - University of California

Broken physics:A conjunction fallacy effect in intuitive physical reasoning

Author: Bramley Neil R
Davis Ernest
Gureckis Todd M.
Ludwin-Peery Ethan
Publication venue: 'SAGE Publications'
Publication date: 01/12/2020
Field of study

Edinburgh Research Explorer

Intuitive experimentation in the physical world

Author: Bramley Neil R.
Gerstenberg Tobias
Gureckis Todd M.
Tenenbaum Joshua B.
Publication venue: 'Elsevier BV'
Publication date: 01/09/2018
Field of study

Crossref

Edinburgh Research Explorer

Recommended from our members

Measuring category intuitiveness in unconstrained categorization tasks

Author: Akaike
Amotz Perlman
Anderson
Ashby
Ashby
Ashby
Barrett
Billman
Brown
Chapman
Chater
Colreavy
Compton
Compton
Corter
Darren J. Edwards
Demetras
Elman
Emmanuel M. Pothos
Estes
Feldman
Feldman
Fiser
Gopnik
Gosselin
Gureckis
Hahn
Hampton
Handel
Handel
Handel
Heller
Hines
John V. McDonnell
Johnson
Jones
Ken Kurtz
Kurtz
Love
Malt
Malt
Mareschal
Medin
Medin
Medin
Medin
Mervis
Milton
Milton
Minda
Morgan
Murphy
Murphy
Murphy
Nelson
Nelson
Nosofsky
Nosofsky
Peter Hines
Pitt
Pothos
Pothos
Pothos
Quinn
Rand
Reber
Regehr
Rips
Rosch
Sanborn
Schyns
Smith
Stewart
Todd M. Bailey
Vanpaemel
Publication venue: 'Elsevier BV'
Publication date: 01/01/2011
Field of study

What makes a category seem natural or intuitive? In this paper, an unsupervised categorization task was employed to examine observer agreement concerning the categorization of nine different stimulus sets. The stimulus sets were designed to capture different intuitions about classification structure. The main empirical index of category intuitiveness was the frequency of the preferred classification, for different stimulus sets. With 169 participants, and a within participants design, with some stimulus sets the most frequent classification was produced over 50 times and with others not more than two or three times. The main empirical finding was that cluster tightness was more important in determining category intuitiveness, than cluster separation. The results were considered in relation to the following models of unsupervised categorization: DIVA, the rational model, the simplicity model, SUSTAIN, an Unsupervised version of the Generalized Context Model (UGCM), and a simple geometric model based on similarity. DIVA, the geometric approach, SUSTAIN, and the UGCM provided good, though not perfect, fits. Overall, the present work highlights several theoretical and practical issues regarding unsupervised categorization and reveals weaknesses in some of the corresponding formal models

City Research Online

Crossref

Online Research @ Cardiff

Cronfa at Swansea University

Navigating through abstract decision spaces: Evaluating the role of state generalization in a dynamic decision-making task

Author: A. Ross Otto
A. W. Siegel
Arthur B. Markman
B. A. Cartwright
B. J. Stankiewicz
Bradley C. Love
C. J. Warry
H. Neth
H. Rachlin
I. Erev
J. Myerson
J. O’Keefe
N. D. Daw
R. Bogacz
R. J. Herrnstein
R. J. Tunney
R. Sutton
S. D. Whitehead
Todd M. Gureckis
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref