136 research outputs found

    k-ANONYMITY: A MODEL FOR PROTECTING PRIVACY

    Full text link

    Differentially Private Model Selection with Penalized and Constrained Likelihood

    Full text link
    In statistical disclosure control, the goal of data analysis is twofold: The released information must provide accurate and useful statistics about the underlying population of interest, while minimizing the potential for an individual record to be identified. In recent years, the notion of differential privacy has received much attention in theoretical computer science, machine learning, and statistics. It provides a rigorous and strong notion of protection for individuals' sensitive information. A fundamental question is how to incorporate differential privacy into traditional statistical inference procedures. In this paper we study model selection in multivariate linear regression under the constraint of differential privacy. We show that model selection procedures based on penalized least squares or likelihood can be made differentially private by a combination of regularization and randomization, and propose two algorithms to do so. We show that our private procedures are consistent under essentially the same conditions as the corresponding non-private procedures. We also find that under differential privacy, the procedure becomes more sensitive to the tuning parameters. We illustrate and evaluate our method using simulation studies and two real data examples

    Exclusive Strategy for Generalization Algorithms in Micro-data Disclosure

    Full text link
    Abstract. When generalization algorithms are known to the public, an adver-sary can obtain a more precise estimation of the secret table than what can be deduced from the disclosed generalization result. Therefore, whether a general-ization algorithm can satisfy a privacy property should be judged based on such an estimation. In this paper, we show that the computation of the estimation is inherently a recursive process that exhibits a high complexity when generaliza-tion algorithms take a straightforward inclusive strategy. To facilitate the design of more efficient generalization algorithms, we suggest an alternative exclusive strategy, which adopts a seemingly drastic approach to eliminate the need for recursion. Surprisingly, the data utility of the two strategies are actually not com-parable and the exclusive strategy can provide better data utility in certain cases.

    Can a supernova be located by its neutrinos?

    Get PDF
    A future core-collapse supernova in our Galaxy will be detected by several neutrino detectors around the world. The neutrinos escape from the supernova core over several seconds from the time of collapse, unlike the electromagnetic radiation, emitted from the envelope, which is delayed by a time of order hours. In addition, the electromagnetic radiation can be obscured by dust in the intervening interstellar space. The question therefore arises whether a supernova can be located by its neutrinos alone. The early warning of a supernova and its location might allow greatly improved astronomical observations. The theme of the present work is a careful and realistic assessment of this question, taking into account the statistical significance of the various neutrino signals. Not surprisingly, neutrino-electron forward scattering leads to a good determination of the supernova direction, even in the presence of the large and nearly isotropic background from other reactions. Even with the most pessimistic background assumptions, SuperKamiokande (SK) and the Sudbury Neutrino Observatory (SNO) can restrict the supernova direction to be within circles of radius 55^\circ and 2020^\circ, respectively. Other reactions with more events but weaker angular dependence are much less useful for locating the supernova. Finally, there is the oft-discussed possibility of triangulation, i.e., determination of the supernova direction based on an arrival time delay between different detectors. Given the expected statistics we show that, contrary to previous estimates, this technique does not allow a good determination of the supernova direction.Comment: 11 pages including 2 figures. Revised version corrects typos, adds some brief comment

    Privacy in Microdata Release: Challenges, Techniques, and Approaches

    Get PDF
    Releasing and disseminating useful microdata while ensuring that no personal or sensitive information is improperly exposed is a complex problem, heavily investigated by the scientific community in the past couple of decades. Various microdata protection approaches have then been proposed, achieving different privacy requirements through appropriate protection techniques. This chapter discusses the privacy risks that can arise in microdata release and illustrates some well-known privacy-preserving techniques and approaches

    Sample-Dependent Estimation in Survey Sampling

    No full text

    Towards a survey measurement system

    No full text
    corecore