Search CORE

26 research outputs found

Efficient and Private Scoring of Decision Trees, Support Vector Machines and Logistic Regression Models based on Pre-Computation

Author: Anderson C. A. Nascimento
Caleb Horst
Martine De Cock
Rafael Dowsley
Raj Katti
Stacey C. Newman
Wing-Sea Poon
Publication venue: International Association for Cryptologic Research (IACR)
Publication date: 05/03/2017
Field of study

Many data-driven personalized services require that private data of users is scored against a trained machine learning model. In this paper we propose a novel protocol for privacy-preserving classification of decision trees, a popular machine learning model in these scenarios. Our solutions are composed out of building blocks, namely a secure comparison protocol, a protocol for obliviously selecting inputs, and a protocol for evaluating polynomials. By combining some of the building blocks for our decision tree classification protocol, we also improve previously proposed solutions for classification of support vector machines and logistic regression models. Our protocols are information theoretically secure and, unlike previously proposed solutions, do not require modular exponentiations. We show that our protocols for privacy-preserving classification lead to more efficient results from the point of view of computational and communication complexities. We present accuracy and runtime results for 7 classification benchmark datasets from the UCI repository

Cryptology ePrint Archive

Privacy-preserving scoring of tree ensembles : a novel framework for AI in healthcare

Author: De Cock Martine
Dowsley Rafael
Fritchman Kyle
Hughes Tyler
Nascimento Anderson
Saminathan Keerthanaa
Teredesai Ankur
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2018
Field of study

Machine Learning (ML) techniques now impact a wide variety of domains. Highly regulated industries such as healthcare and finance have stringent compliance and data governance policies around data sharing. Advances in secure multiparty computation (SMC) for privacy-preserving machine learning (PPML) can help transform these regulated industries by allowing ML computations over encrypted data with personally identifiable information (PII). Yet very little of SMC-based PPML has been put into practice so far. In this paper we present the very first framework for privacy-preserving classification of tree ensembles with application in healthcare. We first describe the underlying cryptographic protocols that enable a healthcare organization to send encrypted data securely to a ML scoring service and obtain encrypted class labels without the scoring service actually seeing that input in the clear. We then describe the deployment challenges we solved to integrate these protocols in a cloud based scalable risk-prediction platform with multiple ML models for healthcare AI. Included are system internals, and evaluations of our deployment for supporting physicians to drive better clinical outcomes in an accurate, scalable, and provably secure manner. To the best of our knowledge, this is the first such applied framework with SMC-based privacy-preserving machine learning for healthcare

Crossref

Ghent University Academic Bibliography

Round and Communication Balanced Protocols for Oblivious Evaluation of Finite State Machines

Author: Dowsley Rafael
Horst Caleb
Nascimento Anderson C. A.
Publication venue
Publication date: 20/03/2021
Field of study

We propose protocols for obliviously evaluating finite-state machines, i.e., the evaluation is shared between the provider of the finite-state machine and the provider of the input string in such a manner that neither party learns the other's input, and the states being visited are hidden from both. For alphabet size

|\Sigma|

, number of states

|Q|

, and input length

n

, previous solutions have either required a number of rounds linear in

n

or communication

\Omega(n|\Sigma||Q|\log|Q|)

. Our solutions require 2 rounds with communication

O(n(|\Sigma|+|Q|\log|Q|))

. We present two different solutions to this problem, a two-party one and a setting with an untrusted but non-colluding helper

arXiv.org e-Print Archive

Cryptology ePrint Archive

Introductory Chapter: Data Privacy Preservation on the Internet of Things

Author: Dasgupta Subhasis
Sen Jaydip
Publication venue: IntechOpen
Publication date: 27/09/2023
Field of study

IntechOpen

Privacy-preserving logistic regression with secret sharing

Author: Ghavamipour Ali Reza
Jiang Xiaoqian
Turkmen Fatih
Publication venue
Publication date: 02/04/2022
Field of study

Background: Logistic regression (LR) is a widely used classification method for modeling binary outcomes in many medical data classification tasks. Researchers that collect and combine datasets from various data custodians and jurisdictions can greatly benefit from the increased statistical power to support their analysis goals. However, combining data from different sources creates serious privacy concerns that need to be addressed. Methods: In this paper, we propose two privacy-preserving protocols for performing logistic regression with the Newton–Raphson method in the estimation of parameters. Our proposals are based on secure Multi-Party Computation (MPC) and tailored to the honest majority and dishonest majority security settings. Results: The proposed protocols are evaluated against both synthetic and real-world datasets in terms of efficiency and accuracy, and a comparison is made with the ordinary logistic regression. The experimental results demonstrate that the proposed protocols are highly efficient and accurate. Conclusions: Our work introduces two iterative algorithms to enable the distributed training of a logistic regression model in a privacy-preserving manner. The implementation results show that our algorithms can handle large datasets from multiple sources.</p

ARTS repository - University of Groningen