Search CORE

9,157 research outputs found

On the Differential Privacy of Bayesian Inference

Author: Dimitrakakis Christos
Rubinstein Benjamin
Zhang Zuhe
Publication venue
Publication date: 22/12/2015
Field of study

We study how to communicate findings of Bayesian inference to third parties, while preserving the strong guarantee of differential privacy. Our main contributions are four different algorithms for private Bayesian inference on proba-bilistic graphical models. These include two mechanisms for adding noise to the Bayesian updates, either directly to the posterior parameters, or to their Fourier transform so as to preserve update consistency. We also utilise a recently introduced posterior sampling mechanism, for which we prove bounds for the specific but general case of discrete Bayesian networks; and we introduce a maximum-a-posteriori private mechanism. Our analysis includes utility and privacy bounds, with a novel focus on the influence of graph structure on privacy. Worked examples and experiments with Bayesian na{\"i}ve Bayes and Bayesian linear regression illustrate the application of our mechanisms.Comment: AAAI 2016, Feb 2016, Phoenix, Arizona, United State

arXiv.org e-Print Archive

HAL - Lille 3

INRIA a CCSD electronic archive server

HAL Descartes

Chalmers Research

Chalmers Publication Library

Hal-Diderot

Association for the Advancement of Artificial Intelligence: AAAI Publications

A Bayesian Model of Sample Selection with a Discrete Outcome Variable

Author: Maksym Obrizan
Publication venue
Publication date
Field of study

Relatively few published studies apply Heckman’s (1979) sample selection model to the case of a discrete endogenous variable and those are limited to a single outcome equation. However, there are potentially many applications for this model in health, labor and financial economics. To fill in this theoretical gap, I extend the Bayesian multivariate probit setup of Chib and Greenberg (1998) into a model of non-ignorable selection that can handle multiple selection and discrete-continuous outcome equations. The first extension of the multivariate probit model in Chib and Greenberg (1998) allows some of the outcomes to be missing. In addition, I use Cholesky factorization of the variance matrix to avoid the Metropolis-Hastings algorithm in the Gibbs sampler. Finally, using artificial data I show that the model is capable of retrieving the parameters used in the data-generating process and also that the resulting Markov Chain passes all standard convergence tests.Markov Chain Monte Carlo; sample selection; multivariate probit

Research Papers in Economics

Probabilistic Graphical Models on Multi-Core CPUs using Java 8

Author: Borchani Hanen
Martinez Ana M.
Masegosa Andres R.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2016
Field of study

In this paper, we discuss software design issues related to the development of parallel computational intelligence algorithms on multi-core CPUs, using the new Java 8 functional programming features. In particular, we focus on probabilistic graphical models (PGMs) and present the parallelisation of a collection of algorithms that deal with inference and learning of PGMs from data. Namely, maximum likelihood estimation, importance sampling, and greedy search for solving combinatorial optimisation problems. Through these concrete examples, we tackle the problem of defining efficient data structures for PGMs and parallel processing of same-size batches of data sets using Java 8 features. We also provide straightforward techniques to code parallel algorithms that seamlessly exploit multi-core processors. The experimental analysis, carried out using our open source AMIDST (Analysis of MassIve Data STreams) Java toolbox, shows the merits of the proposed solutions.Comment: Pre-print version of the paper presented in the special issue on Computational Intelligence Software at IEEE Computational Intelligence Magazine journa

arXiv.org e-Print Archive

VBN

Decentralized learning with budgeted network load using Gaussian copulas and classifier ensembles

Author: AP Dawid
C Genest
DH Wolpert
ED Sontag
F Pedregosa
GB Giannakis
I Zezula
J Kittler
J Kittler
L Breiman
L Xu
LK Hansen
M Wozniak
OP Faugeras
S Deerwester
TK Ho
V Tresp
Y Freund
Y Koren
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 28/03/2019
Field of study

We examine a network of learners which address the same classification task but must learn from different data sets. The learners cannot share data but instead share their models. Models are shared only one time so as to preserve the network load. We introduce DELCO (standing for Decentralized Ensemble Learning with COpulas), a new approach allowing to aggregate the predictions of the classifiers trained by each learner. The proposed method aggregates the base classifiers using a probabilistic model relying on Gaussian copulas. Experiments on logistic regressor ensembles demonstrate competing accuracy and increased robustness in case of dependent classifiers. A companion python implementation can be downloaded at https://github.com/john-klein/DELC

arXiv.org e-Print Archive

Crossref

INRIA a CCSD electronic archive server

HAL Descartes

UCL Discovery

Hal-Diderot