Search CORE

3 research outputs found

Many Labs 2: Investigating Variation in Replicability Across Samples and Settings

Author: Adams Byron G
Adams Reginald B
Alper Sinan
Aveyard Mark
Axt Jordan R
Babalola Mayowa T
Bahník Štěpán
Batra Rishtee
Berkics Mihály
Bernstein Michael J
Berry Daniel R
Bialobrzeska Olga
Binan Evans Dami
Bocian Konrad
Brandt Mark J
Busching Robert
Cai Huajian
Cambier Fanny
Cantarero Katarzyna
Carmichael Cheryl L
Ceric Francisco
Chandler Jesse
Chang Jen-Ho
Chatard Armand
Chen Eva E
Cheong Winnee
Cicero David C
Coen Sharon
Coleman Jennifer A
Collisson Brian
Conway Morgan A
Corker Katherine S
Curran Paul G
Cushman Fiery
Dagona Zubairu K
Dalgar Ilker
Dalla Rosa Anna
Davis William E
de Bruijn Maaike
De Schutter Leander
de Vries Marieke
Devos Thierry
Diego Vega Luis
Dozo Nerisa
Doğulu Canay
Dukes Kristin Nicole
Dunham Yarrow
Durrheim Kevin
Ebersole Charles R
Edlund John E
Eller Anja
English Alexander Scott
Finck Carolyn
Frankowska Natalia
Freyre Miguel-Ángel
Friedman Mike
Galliani Elisa Maria
Gandi Joshua C
Ghoshal Tanuka
Giessner Steffen R
Gill Tripat
Gnambs Timo
González Roberto
Graham Jesse
Grahe Jon E
Grahek Ivan
Green Eva GT
Gómez Ángel
Hai Kakul
Haigh Matthew
Haines Elizabeth L
Hall Michael P
Hasselman Fred
Heffernan Marie E
Hicks Joshua A
Houdek Petr
Huntsinger Jeffrey R
Huynh Ho Phi
Ijzerman Hans
Inbar Yoel
Innes-Ker Åse H
Jiménez-Leal William
John Melissa-Sue
Joy-Gaba Jennifer A
Kamiloğlu Roza G
Kappes Heather Barry
Karabati Serdar
Karick Haruna
Keller Victor N
Kende Anna
Kervyn Nicolas
Klein Richard A
Knežević Goran
Kovacs Carrie
Krueger Lacy E
Kurapov German
Kurtz Jamie
Lakens Daniël
Lazarević Ljiljana B
Lee Nichols Austin
Levitan Carmel A
Lewis Jr. Neil A
Lins Samuel
Lipsey Nikolette P
Losee Joy E
Maassen Esther
Maitner Angela T
Malingumu Winfrida
Mallett Robyn K
Marotta Satia A
Mena-Pacheco Fernando
Međedović Janko
Milfont Taciano L
Morris Wendy L
Murphy Sean C
Myachykov Andriy
Neave Nick
Neijenhuijs Koen
Nelson Anthony J
Neto Félix
Nosek Brian A
Ocampo Aaron
Oikawa Haruka
Oikawa Masanori
Ong Elsie
Orosz Gábor
Osowiecka Malgorzata
O’Donnell Susan L
Packard Grant
Petrović Boban
Pilati Ronaldo
Pinter Brad
Podesta Lysandra
Pogge Gabrielle
Pollmann Monique MH
Pérez-Sánchez Rolando
Rutchick Abraham M
Rédei Anna Cabak
Saavedra Patricio
Saeri Alexander K
Salomon Erika
Schmidt Kathleen
Schönbrodt Felix D
Sekerdej Maciej B
Sirlopú David
Skorinko Jeanine LM
Smith Michael A
Smith-Castro Vanessa
Smolders Karin CHJ
Sobkow Agata
Sowden Walter
Spachtholz Philipp
Srivastava Manini
Steiner Troy G
Stouten Jeroen
Street Chris NH
Sundfelt Oskar K
Szeto Stephanie
Szumowska Ewa
Tang Andrew CW
Tanzer Norbert
Tear Morgan J
Theriault Jordan
Thomae Manuela
Torres David
Traczyk Jakub
Tybur Joshua M
Ujhelyi A
Ujhelyi Adrienn
van Aert Robbie CM
van Assen Marcel ALM
van der Hulst Marije
van Lange Paul AM
van ’t Veer Anna Elisabeth
Vaughn Leigh Ann
Verniers Catherine
Verschoor Mark
Vianello Michelangelo
Voermans Ingrid PJ
Vranka Marek A
Vásquez- Echeverría Alejandro
Vázquez Alexandra
Welch Cheryl
Wichman Aaron L
Williams Lisa A
Wood Michael
Woodzicka Julie A
Wronska Marta K
Young Liane
Zelenski John M
Zhijia Zeng
Publication venue: 'SAGE Publications'
Publication date: 01/01/2018
Field of study

We conducted preregistered replications of 28 classic and contemporary published findings, with protocols that were peer reviewed in advance, to examine variation in effect magnitudes across samples and settings. Each protocol was administered to approximately half of 125 samples that comprised 15,305 participants from 36 countries and territories. Using the conventional criterion of statistical significance (p < .05), we found that 15 (54%) of the replications provided evidence of a statistically significant effect in the same direction as the original finding. With a strict significance criterion (p < .0001), 14 (50%) of the replications still provided such evidence, a reflection of the extremely highpowered design. Seven (25%) of the replications yielded effect sizes larger than the original ones, and 21 (75%) yielded effect sizes smaller than the original ones. The median comparable Cohen’s ds were 0.60 for the original findings and 0.15 for the replications. The effect sizes were small (< 0.20) in 16 of the replications (57%), and 9 effects (32%) were in the direction opposite the direction of the original effect. Across settings, the Q statistic indicated significant heterogeneity in 11 (39%) of the replication effects, and most of those were among the findings with the largest overall effect sizes; only 1 effect that was near zero in the aggregate showed significant heterogeneity according to this measure. Only 1 effect had a tau value greater than .20, an indication of moderate heterogeneity. Eight others had tau values near or slightly above .10, an indication of slight heterogeneity. Moderation tests indicated that very little heterogeneity was attributable to the order in which the tasks were performed or whether the tasks were administered in lab versus online. Exploratory comparisons revealed little heterogeneity between Western, educated, industrialized, rich, and democratic (WEIRD) cultures and less WEIRD cultures (i.e., cultures with relatively high and low WEIRDness scores, respectively). Cumulatively, variability in the observed effect sizes was attributable more to the effect being studied than to the sample or setting in which it was studied.UCR::Vicerrectoría de Investigación::Unidades de Investigación::Ciencias Sociales::Instituto de Investigaciones Psicológicas (IIP

Same data, different conclusions: Radical dispersion in empirical results when independent analysts operationalize and test the same hypothesis

Author: Adie Prestone
Alaburda Paulius
Albers Casper
Alspaugh Sara
Alstott Jeff
Althoff Tim
Amireh Hashem
Arzi Adbi
Bahnik Stepan
Baik Jason
Balling Laura Winther
Banker Sachin
Baranger David AA
Barr Dale J
Barros-Rivera Brenda
Bauer Matt
Bernstein Abraham
Blaise Enuh
Chan CS Richard
de la Rubia Eduardo Arinno
Feldman Michael
Garrison S Mason
Goldstein Pavel
Heer Jeffrey
Jong Jonathan
Kale Alex
Kelchtermans Stijn
Liu Yang
Madan Nikhil
Mandl Benjamin
Mohamed Zainab
Murase Toshio
Naseeb Chan
Nelson Andrew A
Nolan Rory
Otner Sarah MG
Prasad Vaishali Venkatesh
Robinson David
Robinson Emily
Schaumans Catherine BC
Schweinsberg Martin
Silberzahn Raphael
Snellman Kaisa
Sommer S Amy
Staub Nicola
Strobl Carolin
Tierney Warren
Valdivia Ana
van Aert Robbie CM
van Assen Marcel ALM
van den Akker Olmo R
Viganola Domenico
Yarkoni Tal
Publication venue: 'Elsevier BV'
Publication date: 17/06/2021
Field of study

In this crowdsourced initiative, independent analysts used the same dataset to test two hypotheses regarding the effects of scientists’ gender and professional status on verbosity during group meetings. Not only the analytic approach but also the operationalizations of key variables were left unconstrained and up to individual analysts. For instance, analysts could choose to operationalize status as job title, institutional ranking, citation counts, or some combination. To maximize transparency regarding the process by which analytic choices are made, the analysts used a platform we developed called DataExplained to justify both preferred and rejected analytic paths in real time. Analyses lacking sufficient detail, reproducible code, or with statistical errors were excluded, resulting in 29 analyses in the final sample. Researchers reported radically different analyses and dispersed empirical outcomes, in a number of cases obtaining significant effects in opposite directions for the same research question. A Boba multiverse analysis demonstrates that decisions about how to operationalize variables explain variability in outcomes above and beyond statistical choices (e.g., covariates). Subjective researcher decisions play a critical role in driving the reported empirical results, underscoring the need for open data, systematic robustness checks, and transparency regarding both analytic paths taken and not taken. Implications for organizations and leaders, whose decision making relies in part on scientific findings, consulting reports, and internal analyses by data scientists, are discussed

Oxford University Research Archive