275 research outputs found

    A Bayesian test for excess zeros in a zero-inflated power series distribution

    Get PDF
    Power series distributions form a useful subclass of one-parameter discrete exponential families suitable for modeling count data. A zero-inflated power series distribution is a mixture of a power series distribution and a degenerate distribution at zero, with a mixing probability pp for the degenerate distribution. This distribution is useful for modeling count data that may have extra zeros. One question is whether the mixture model can be reduced to the power series portion, corresponding to p=0p=0, or whether there are so many zeros in the data that zero inflation relative to the pure power series distribution must be included in the model i.e., p0p\geq0. The problem is difficult partially because p=0p=0 is a boundary point. Here, we present a Bayesian test for this problem based on recognizing that the parameter space can be expanded to allow pp to be negative. Negative values of pp are inconsistent with the interpretation of pp as a mixing probability, however, they index distributions that are physically and probabilistically meaningful. We compare our Bayesian solution to two standard frequentist testing procedures and find that using a posterior probability as a test statistic has slightly higher power on the most important ranges of the sample size nn and parameter values than the score test and likelihood ratio test in simulations. Our method also performs well on three real data sets.Comment: Published in at http://dx.doi.org/10.1214/193940307000000068 the IMS Collections (http://www.imstat.org/publications/imscollections.htm) by the Institute of Mathematical Statistics (http://www.imstat.org

    Bayesian Variational Regularisation for Dark Matter Reconstruction with Uncertainty Quantification

    Get PDF
    Despite the great wealth of cosmological knowledge accumulated since the early 20th century, the nature of dark-matter, which accounts for ~85% of the matter content of the universe, remains illusive. Unfortunately, though dark-matter is scientifically interesting, with implications for our fundamental understanding of the Universe, it cannot be directly observed. Instead, dark-matter may be inferred from e.g. the optical distortion (lensing) of distant galaxies which, at linear order, manifests as a perturbation to the apparent magnitude (convergence) and ellipticity (shearing). Ensemble observations of the shear are collected and leveraged to construct estimates of the convergence, which can directly be related to the universal dark-matter distribution. Imminent stage IV surveys are forecast to accrue an unprecedented quantity of cosmological information; a discriminative partition of which is accessible through the convergence, and is disproportionately concentrated at high angular resolutions, where the echoes of cosmological evolution under gravity are most apparent. Capitalising on advances in probability concentration theory, this thesis merges the paradigms of Bayesian inference and optimisation to develop hybrid convergence inference techniques which are scalable, statistically principled, and operate over the Euclidean plane, celestial sphere, and 3-dimensional ball. Such techniques can quantify the plausibility of inferences at one-millionth the computational overhead of competing sampling methods. These Bayesian techniques are applied to the hotly debated Abell-520 merging cluster, concluding that observational catalogues contain insufficient information to determine the existence of dark-matter self-interactions. Further, these techniques were applied to all public lensing catalogues, recovering the then largest global dark-matter mass-map. The primary methodological contributions of this thesis depend only on posterior log-concavity, paving the way towards a, potentially revolutionary, complete hybridisation with artificial intelligence techniques. These next-generation techniques are the first to operate over the full 3-dimensional ball, laying the foundations for statistically principled universal dark-matter cartography, and the cosmological insights such advances may provide

    Bayesian Simultaneous Intervals for Small Areas: An Application to Mapping Mortality Rates in U.S. Health Service Areas

    Get PDF
    It is customary when presenting a choropleth map of rates or counts to present only the estimates (mean or mode) of the parameters of interest. While this technique illustrates spatial variation, it ignores the variation inherent in the estimates. We describe an approach to present variability in choropleth maps by constructing 100(1-alpha)% simultaneous intervals. The result provides three maps (estimate with two bands). We propose two methods to construct simultaneous intervals from the optimal individual highest posterior density (HPD) intervals to ensure joint simultaneous coverage of 100(1-alpha)%. Both methods exhibit the main feature of multiplying the lower bound and dividing the upper bound of the individual HPD intervals by parameters

    Inference on Impulse Response Functions in Structural VAR Models

    Get PDF
    Skepticism toward traditional identifying assumptions based on exclusion restrictions has led to a surge in the use of structural VAR models in which structural shocks are identified by restricting the sign of the responses of selected macroeconomic aggregates to these shocks. Researchers commonly report the vector of pointwise posterior medians of the impulse responses as a measure of central tendency of the estimated response functions, along with pointwise 68 percent posterior error bands. It can be shown that this approach cannot be used to characterize the central tendency of the structural impulse response functions. We propose an alternative method of summarizing the evidence from sign-identified VAR models designed to enhance their practical usefulness. Our objective is to characterize the most likely admissible model(s) within the set of structural VAR models that satisfy the sign restrictions. We show how the set of most likely structural response functions can be computed from the posterior mode of the joint distribution of admissible models both in the fully identified and in the partially identified case, and we propose a highest-posterior density credible set that characterizes the joint uncertainty about this set. Our approach can also be used to resolve the long-standing problem of ho

    KiDS-1000 Cosmology:Multi-probe weak gravitational lensing and spectroscopic galaxy clustering constraints

    Get PDF
    We present a joint cosmological analysis of weak gravitational lensing observations from the Kilo-Degree Survey (KiDS-1000), with redshift-space galaxy clustering observations from the Baryon Oscillation Spectroscopic Survey (BOSS), and galaxy-galaxy lensing observations from the overlap between KiDS-1000, BOSS and the spectroscopic 2-degree Field Lensing Survey (2dFLenS). This combination of large-scale structure probes breaks the degeneracies between cosmological parameters for individual observables, resulting in a constraint on the structure growth parameter S8=σ8Ωm/0.3=0.7660.014+0.020S_8=\sigma_8 \sqrt{\Omega_{\rm m}/0.3} = 0.766^{+0.020}_{-0.014}, that has the same overall precision as that reported by the full-sky cosmic microwave background observations from Planck. The recovered S8S_8 amplitude is low, however, by 8.3±2.68.3 \pm 2.6 % relative to Planck. This result builds from a series of KiDS-1000 analyses where we validate our methodology with variable depth mock galaxy surveys, our lensing calibration with image simulations and null-tests, and our optical-to-near-infrared redshift calibration with multi-band mock catalogues and a spectroscopic-photometric clustering analysis. The systematic uncertainties identified by these analyses are folded through as nuisance parameters in our cosmological analysis. Inspecting the offset between the marginalised posterior distributions, we find that the S8S_8-difference with Planck is driven by a tension in the matter fluctuation amplitude parameter, σ8\sigma_8. We quantify the level of agreement between the CMB and our large-scale structure constraints using a series of different metrics, finding differences with a significance ranging between  ⁣3σ\sim\! 3\,\sigma, when considering the offset in S8S_{8}, and  ⁣2σ\sim\! 2\,\sigma, when considering the full multi-dimensional parameter space.Comment: 25 pages, 11 figures, 5 tables, A&A accepted, including a new appendix on Intrinsic Alignments. The KiDS-1000 data products are available for download at http://kids.strw.leidenuniv.nl/DR4/lensing.php. This data release includes open source software, the shear-photo-z catalogue, the cosmic shear and 3x2pt data vectors and covariances, and posteriors in the form of Multinest chain

    J. K. Ghosh's contribution to statistics: A brief outline

    Get PDF
    Professor Jayanta Kumar Ghosh has contributed massively to various areas of Statistics over the last five decades. Here, we survey some of his most important contributions. In roughly chronological order, we discuss his major results in the areas of sequential analysis, foundations, asymptotics, and Bayesian inference. It is seen that he progressed from thinking about data points, to thinking about data summarization, to the limiting cases of data summarization in as they relate to parameter estimation, and then to more general aspects of modeling including prior and model selection.Comment: Published in at http://dx.doi.org/10.1214/074921708000000011 the IMS Collections (http://www.imstat.org/publications/imscollections.htm) by the Institute of Mathematical Statistics (http://www.imstat.org
    corecore