Search CORE

169 research outputs found

Comment: Bayesian Checking of the Second Levels of Hierarchical Models

Author: Johnson Valen E.
Publication venue: 'Institute of Mathematical Statistics'
Publication date: 06/02/2008
Field of study

Comment: Bayesian Checking of the Second Levels of Hierarchical Models [arXiv:0802.0743]Comment: Published in at http://dx.doi.org/10.1214/07-STS235D the Statistical Science (http://www.imstat.org/sts/) by the Institute of Mathematical Statistics (http://www.imstat.org

arXiv.org e-Print Archive

Crossref

A Bayesian \chi^2 test for goodness-of-fit

Author: Johnson Valen E.
Publication venue: 'Institute of Mathematical Statistics'
Publication date: 01/01/2005
Field of study

This article describes an extension of classical \chi^2 goodness-of-fit tests to Bayesian model assessment. The extension, which essentially involves evaluating Pearson's goodness-of-fit statistic at a parameter value drawn from its posterior distribution, has the important property that it is asymptotically distributed as a \chi^2 random variable on K-1 degrees of freedom, independently of the dimension of the underlying parameter vector. By examining the posterior distribution of this statistic, global goodness-of-fit diagnostics are obtained. Advantages of these diagnostics include ease of interpretation, computational convenience and favorable power properties. The proposed diagnostics can be used to assess the adequacy of a broad class of Bayesian models, essentially requiring only a finite-dimensional parameter vector and conditionally independent observations.Comment: Published at http://dx.doi.org/10.1214/009053604000000616 in the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org

arXiv.org e-Print Archive

CiteSeerX

Crossref

UNIFORMLY MOST POWERFUL BAYESIAN TESTS

Author: Valen E. Johnson
Publication venue: 'Institute of Mathematical Statistics'
Publication date: 01/01/2013
Field of study

Uniformly most powerful tests are statistical hypothesis tests that provide the greatest power against a fixed null hypothesis among all tests of a given size. In this article, the notion of uniformly most powerful tests is extended to the Bayesian setting by defining uniformly most powerful Bayesian tests to be tests that maximize the probability that the Bayes factor, in favor of the alternative hypothesis, exceeds a specified threshold. Like their classical counterpart, uniformly most powerful Bayesian tests are most easily defined in one-parameter exponential family models, although extensions outside of this class are possible. The connection between uniformly most powerful tests and uniformly most powerful Bayesian tests can be used to provide an approximate calibration between p-values and Bayes factors. Finally, issues regarding the strong dependence of resulting Bayes factors and p-values on sample size are discussed

arXiv.org e-Print Archive

Crossref

Texas A&M Repository

PubMed Central

Revised standards for statistical evidence

Author: Valen E. Johnson
Publication venue: 'Proceedings of the National Academy of Sciences'
Publication date: 01/01/2013
Field of study

Recent advances in Bayesian hypothesis testing have led to the development of uniformly most powerful Bayesian tests, which represent an objective, default class of Bayesian hypothesis tests that have the same rejection regions as classical significance tests. Based on the correspondence between these two classes of tests, it is possible to equate the size of classical hypothesis tests with evidence thresholds in Bayesian tests, and to equate P values with Bayes factors. An examination of these connections suggest that recent concerns over the lack of reproducibility of scientific studies can be attributed largely to the conduct of significance tests at unjustifiably high levels of significance. To correct this problem, evidence thresholds required for the declaration of a significant finding should be increased to 25–50:1, and to 100–200:1 for the declaration of a highly significant finding. In terms of classical hypothesis tests, these evidence standards mandate the conduct of tests at the 0.005 or 0.001 level of significance

Crossref

Texas A&M Repository

PubMed Central