152 research outputs found
Drynx: Decentralized, Secure, Verifiable System for Statistical Queries and Machine Learning on Distributed Datasets
Data sharing has become of primary importance in many domains such as
big-data analytics, economics and medical research, but remains difficult to
achieve when the data are sensitive. In fact, sharing personal information
requires individuals' unconditional consent or is often simply forbidden for
privacy and security reasons. In this paper, we propose Drynx, a decentralized
system for privacy-conscious statistical analysis on distributed datasets.
Drynx relies on a set of computing nodes to enable the computation of
statistics such as standard deviation or extrema, and the training and
evaluation of machine-learning models on sensitive and distributed data. To
ensure data confidentiality and the privacy of the data providers, Drynx
combines interactive protocols, homomorphic encryption, zero-knowledge proofs
of correctness, and differential privacy. It enables an efficient and
decentralized verification of the input data and of all the system's
computations thus provides auditability in a strong adversarial model in which
no entity has to be individually trusted. Drynx is highly modular, dynamic and
parallelizable. Our evaluation shows that it enables the training of a logistic
regression model on a dataset (12 features and 600,000 records) distributed
among 12 data providers in less than 2 seconds. The computations are
distributed among 6 computing nodes, and Drynx enables the verification of the
query execution's correctness in less than 22 seconds.Comment: Accepted for publication at IEEE Transactions on Information
Forensics and Securit
A Predictive Model for User Motivation and Utility Implications of Privacy-Protection Mechanisms in Location Check-Ins
Location check-ins contain both geographical and semantic information about the visited venues. Semantic information is usually represented by means of tags (e.g., “restaurant”). Such data can reveal some personal information about users beyond what they actually expect to disclose, hence their privacy is threatened. To mitigate such threats, several privacy protection techniques based on location generalization have been proposed. Although the privacy implications of such techniques have been extensively studied, the utility implications are mostly unknown. In this paper, we propose a predictive model for quantifying the effect of a privacy-preserving technique (i.e., generalization) on the perceived utility of check-ins. We first study the users’ motivations behind their location check-ins, based on a study targeted at Foursquare users (N = 77). We propose a machine-learning method for determining the motivation behind each check-in, and we design a motivation-based predictive model for the utility implications of generalization. Based on the survey data, our results show that the model accurately predicts the fine-grained motivation behind a check-in in 43% of the cases and in 63% of the cases for the coarse-grained motivation. It also predicts, with a mean error of 0.52 (on a scale from 1 to 5), the loss of utility caused by semantic and geographical generalization. This model makes it possible to design of utility-aware, privacy-enhancing mechanisms in location-based online social networks. It also enables service providers to implement location-sharing mechanisms that preserve both the utility and privacy for their users
Evaluating the usability of a visual feature modeling notation
International audienceFeature modeling is a popular Software Product Line Engineering (SPLE) technique used to describe variability in a product family. A usable feature modeling tool environment should enable SPLE practitioners to produce good quality models, in particular, models that effectively communicate modeled information. FAMILIAR is a text-based environment for manipulating and composing Feature Models (FMs). In this paper we present extensions we made to FAMILIAR to enhance its usability. The extensions include a visualization of FMs, or more precisely , a feature diagram rendering mechanism that supports the use of a combination of text and graphics to describe FMs, their configurations, and the results of FM analyses. We also present the results of a preliminary evaluation of the environment's usability. The evaluation involves comparing the use of the extended environment with the previous text-based console-driven version. The preliminary experiment provides some evidence that use of the new environment results in increased cognitive effectiveness of novice users and improved quality of new FMs
An updated review of case–control studies of lung cancer and indoor radon-Is indoor radon the risk factor for lung cancer?
- …
