2,033,375 research outputs found

    Minimal inference from incomplete 2x2-tables

    Full text link
    Estimates based on 2x2 tables of frequencies are widely used in statistical applications. However, in many cases these tables are incomplete in the sense that the data required to compute the frequencies for a subset of the cells defining the table are unavailable. Minimal inference addresses those situations where this incompleteness leads to target parameters for these tables that are interval, rather than point, identifiable. In particular, we develop the concept of corroboration as a measure of the statistical evidence in the observed data that is not based on likelihoods. The corroboration function identifies the parameter values that are the hardest to refute, i.e., those values which, under repeated sampling, remain interval identified. This enables us to develop a general approach to inference from incomplete 2x2 tables when the additional assumptions required to support a likelihood-based approach cannot be sustained based on the data available. This minimal inference approach then provides a foundation for further analysis that aims at making sharper inference supported by plausible external beliefs

    Using Mathematica to build Non-parametric Statistical Tables

    Get PDF
    In this paper, I present computational procedures to obtian statistical tables. The tables of the asymptotic distribution and the exact distribution of Kolmogorov-Smirnov statistic D_n for one population, the table of the distribution of the runs R, the table of the distribution of Wilcoxon signed-rank statistic W^+ and the table of the distribution of Mann-Whitney statistic U_x using Mathematica, Version 3.9 under Window98. I think that it is an interesting cuestion because many statistical packages give the asymptotic significance level in the statistical tests and with these porcedures one can easily calculate the exact significance levels and the left-tail and right-tail probabilities with non-parametric distributions. I have used mathematica to make these calculations because one can use symbolic language to solve recursion relations. It's very easy to generate the format of the tables, and it's possible to obtain any table of the mentioned non-parametric distributions with any precision, not only with the standard parameters more used in Statistics, and without transcription mistakes. Furthermore, using similar procedures, we can generate tables for the following distribution functions: Binomial, Poisson, Hypergeometric, Normal, x^2 Chi-Square, T-Student, F-Snedecor, Geometric, Gamma and Beta.

    The Complexity of Three-Way Statistical Tables

    Full text link
    Multi-way tables with specified marginals arise in a variety of applications in statistics and operations research. We provide a comprehensive complexity classification of three fundamental computational problems on tables: existence, counting and entry-security. One major outcome of our work is that each of the following problems is intractable already for "slim" 3-tables, with constant and smallest possible number 3 of rows: (1) deciding existence of 3-tables with given consistent 2-marginals; (2) counting all 3-tables with given 2-marginals; (3) finding whether an integer value is attained in entry (i,j,k) by at least one of the 3-tables satisfying given (feasible) 2-marginals. This implies that a characterization of feasible marginals for such slim tables, sought by much recent research, is unlikely to exist. Another important consequence of our study is a systematic efficient way of embedding the set of 3-tables satisfying any given 1-marginals and entry upper bounds in a set of slim 3-tables satisfying suitable 2-marginals with no entry bounds. This provides a valuable tool for studying multi-index transportation problems and multi-index transportation polytopes

    The Geometry of Statistical Models for Two-Way Contingency Tables with Fixed Odds Ratios

    Get PDF
    We study the geometric structure of the statistical models for two-by-two contingency tables. One or two odds ratios are fixed and the corresponding models are shown to be a portion of a ruled quadratic surface or a segment. Some pointers to the general case of two-way contingency tables are also given and an application to case-control studies is presented.Comment: References were not displaying properly in the previous versio

    Understanding Database Reconstruction Attacks on Public Data

    Get PDF
    In 2020 the U.S. Census Bureau will conduct the Constitutionally mandated decennial Census of Population and Housing. Because a census involves collecting large amounts of private data under the promise of confidentiality, traditionally statistics are published only at high levels of aggregation. Published statistical tables are vulnerable to DRAs (database reconstruction attacks), in which the underlying microdata is recovered merely by finding a set of microdata that is consistent with the published statistical tabulations. A DRA can be performed by using the tables to create a set of mathematical constraints and then solving the resulting set of simultaneous equations. This article shows how such an attack can be addressed by adding noise to the published tabulations, so that the reconstruction no longer results in the original data

    Strangeness and Statistical QCD

    Get PDF
    We discuss properties of statistical QCD relevant in Fermi phase space model analysis of strange hadron production experimental data. We argue that the analysis results interpreted using established statistical QCD properties are demonstrating formation of the color deconfined state of matter in relativistic heavy ion collisions at highest CERN-SPS energies and at BNL-RHIC, comprising deconfined matter composed of nearly massless quarks and gluons, in statistical equilibrium.Comment: 22 pages, including 7 figures, 3 tables, to appear in Nuclear Physics Proceedings Supplement: STATISTICAL QCD, held at Bielefeld August 200

    About the Statistical Properties of Cosmological Billiards

    Full text link
    We summarize some recent progress in the understanding of the statistical properties of cosmological billiards.Comment: 10 pages, 5 figures, 2 tables, Proceedings of The second Galileo-XuGuangqi Meeting, 11-16/07/2010, Ventimiglia, Ital

    Descriptive Analysis of Matrix-Valued Time-Series

    Get PDF
    In this article we present a technique of data analysis applied to three-dimensional tables as, for instance, matrix-valued time-series. The main goal of the method is to describe the evolution of the statistical units with respect to time in a space summarizing the set of matrices. Moreover, our technique points out similar statistical units provided by a classification of their trajectories.matrix-valued time-series; data analysis; statistical units
    corecore