Search CORE

54 research outputs found

bridgesampling: An R Package for Estimating Normalizing Constants

Author: Gronau Quentin F.
Singmann Henrik
Wagenmakers Eric-Jan
Publication venue: 'Foundation for Open Access Statistic'
Publication date: 01/02/2020
Field of study

Statistical procedures such as Bayes factor model selection and Bayesian model averaging require the computation of normalizing constants (e.g., marginal likelihoods). These normalizing constants are notoriously difficult to obtain, as they usually involve highdimensional integrals that cannot be solved analytically. Here we introduce an R package that uses bridge sampling (Meng and Wong 1996; Meng and Schilling 2002) to estimate normalizing constants in a generic and easy-to-use fashion. For models implemented in Stan, the estimation procedure is automatic. We illustrate the functionality of the package with three examples

UCL Discovery

Warwick Research Archives Portal Repository

Journal of Statistical Software

International Migration, Integration and Social Cohesion online publications

UvA-DARE

Informed Bayesian Inference for the A/B Test

Author: Gronau Quentin F.
Raj K. N. Akash
Wagenmakers Eric-Jan
Publication venue: 'Foundation for Open Access Statistic'
Publication date: 01/11/2021
Field of study

Booming in business and a staple analysis in medical trials, the A/B test assesses the effect of an intervention or treatment by comparing its success rate with that of a control condition. Across many practical applications, it is desirable that (1) evidence can be obtained in favor of the null hypothesis that the treatment is ineffective; (2) evidence can be monitored as the data accumulate; (3) expert prior knowledge can be taken into account. Most existing approaches do not fulfill these desiderata. Here we describe a Bayesian A/B procedure based on Kass and Vaidyanathan (1992) that allows one to monitor the evidence for the hypotheses that the treatment has either a positive effect, a negative effect, or, crucially, no effect. Furthermore, this approach enables one to incorporate expert knowledge about the relative prior plausibility of the rival hypotheses and about the expected size of the effect, given that it is non-zero. To facilitate the wider adoption of this Bayesian procedure we developed the abtest package in R. We illustrate the package options and the associated statistical results with a fictitious business example and a real data medical example

Journal of Statistical Software

International Migration, Integration and Social Cohesion online publications

UvA-DARE

Informed Bayesian t-Tests

Author: Gronau Q. F. (Quentin)
Ly A. (Alexander)
Wagenmakers E.-J. (Eric-Jan)
Publication venue: 'Informa UK Limited'
Publication date: 02/04/2020
Field of study

Across the empirical sciences, few statistical procedures rival the popularity of the frequentist (Formula presented.) -test. In contrast, the Bayesian versions of the (Formula presented.) -test have languished in obscurity. In recent years, however, the theoretical and practical advantages of the Bayesian (Formula presented.) -test have become increasingly apparent and various Bayesian t-tests have been proposed, both objective ones (based on general desiderata) and subjective ones (based on expert knowledge). Here, we propose a flexible t-prior for standardized effect size that allows computation of the Bayes factor by evaluating a single numerical integral. This specification contains previous objective and subjective t-test Bayes factors as special cases. Furthermore, we propose two measures for informed prior distributions that quantify the departure from the objective Bayes factor desiderata of predictive matching and information consistency. We illustrate the use of informed prior distributions based on an expert prior elicitation effort. Supplementary materials for this article are available online

CWI's Institutional Repository

Quantifying Support for the Null Hypothesis in Psychology: An Empirical Investigation

Author: Aczel Balazs
Gronau Quentin F
Kovacs Marton
Palfi Bence
Szaszi Barnabas
Szecsi Peter
Szollosi Aba
van den Bergh Don
Wagenmakers Eric-Jan
Zrubka Mark
Publication venue: 'SAGE Publications'
Publication date: 01/01/2018
Field of study

In the traditional statistical framework, nonsignificant results leave researchers in a state of suspended disbelief. In this study, we examined, empirically, the treatment and evidential impact of nonsignificant results. Our specific goals were twofold: to explore how psychologists interpret and communicate nonsignificant results and to assess how much these results constitute evidence in favor of the null hypothesis. First, we examined all nonsignificant findings mentioned in the abstracts of the 2015 volumes of Psychonomic Bulletin & Review, Journal of Experimental Psychology: General, and Psychological Science (N = 137). In 72% of these cases, nonsignificant results were misinterpreted, in that the authors inferred that the effect was absent. Second, a Bayes factor reanalysis revealed that fewer than 5% of the nonsignificant findings provided strong evidence (i.e., BF01 > 10) in favor of the null hypothesis over the alternative hypothesis. We recommend that researchers expand their statistical tool kit in order to correctly interpret nonsignificant results and to be able to evaluate the evidence for and against the null hypothesis

ELTE Digital Institutional Repository (EDIT)

UvA-DARE

International Migration, Integration and Social Cohesion online publications

Bayesian model-averaged meta-analysis in medicine

Author: Bartoš F. (František)
Gronau Q. F. (Quentin)
Ly A. (Alexander)
Otte W.M. (Willem)
Timmers B. (Bram)
Wagenmakers E.-J. (Eric-Jan)
Publication venue: 'Wiley'
Publication date: 27/10/2021
Field of study

We outline a Bayesian model-averaged (BMA) meta-analysis for standardized mean differences in order to quantify evidence for both treatment effectiveness (Formula presented.) and across-study heterogeneity (Formula presented.). We construct four competing models by orthogonally combining two present-absent assumptions, one for the treatment effect and one for across-study heterogeneity. To inform the choice of prior distributions for the model parameters, we used 50% of the Cochrane Database of Systematic Reviews to specify rival prior distributions for (Formula presented.) and (Formula presented.). The relative predictive performance of the competing models and rival prior distributions was assessed using the remaining 50% of the Cochrane Database. On average, (Formula presented.) —the model that assumes the presence of a treatment effect as well as across-study heterogeneity—outpredicted the other models, but not by a large margin. Within (Formula presented.), predictive adequacy was relatively constant across the rival prior distributions. We propose specific empirical prior distributions, both for the field in general and for each of 46 specific medical subdisciplines. An example from oral health demonstrates how the proposed prior distributions can be used to conduct a BMA meta-analysis in the open-source software R and JASP. The preregistered analysis plan is available at https://osf.io/zs3df/

CWI's Institutional Repository

PubMed Central

International Migration, Integration and Social Cohesion online publications

UvA-DARE

Statistics in the service of science : don’t let the tail wag the dog

Author: Chandramouli Suyog H.
Cox Gregory E.
Davis-Stober Clintin P.
Dunn John C.
Gronau Quentin F.
Kalish Michael L.
Kellen David
McMullin Sara D.
Navarro Danielle J.
Shiffrin Richard M.
Singmann Henrik
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2022
Field of study

Statistical modeling is generally meant to describe patterns in data in service of the broader scientific goal of developing theories to explain those patterns. Statistical models support meaningful inferences when models are built so as to align parameters of the model with potential causal mechanisms and how they manifest in data. When statistical models are instead based on assumptions chosen by default, attempts to draw inferences can be uninformative or even paradoxical—in essence, the tail is trying to wag the dog. These issues are illustrated by van Doorn et al. (this issue) in the context of using Bayes Factors to identify effects and interactions in linear mixed models. We show that the problems identified in their applications (along with other problems identified here) can be circumvented by using priors over inherently meaningful units instead of default priors on standardized scales. This case study illustrates how researchers must directly engage with a number of substantive issues in order to support meaningful inferences, of which we highlight two: The first is the problem of coordination, which requires a researcher to specify how the theoretical constructs postulated by a model are functionally related to observable variables. The second is the problem of generalization, which requires a researcher to consider how a model may represent theoretical constructs shared across similar but non-identical situations, along with the fact that model comparison metrics like Bayes Factors do not directly address this form of generalization. For statistical modeling to serve the goals of science, models cannot be based on default assumptions, but should instead be based on an understanding of their coordination function and on how they represent causal mechanisms that may be expected to generalize to other related scenarios

UCL Discovery

Warwick Research Archives Portal Repository

Helsingin yliopiston digitaalinen arkisto