The importance of feature preprocessing for differentially private
  linear optimization

Menon, Aditya Krishna; Sun, Ziteng; Suresh, Ananda Theertha

The importance of feature preprocessing for differentially private linear optimization

Authors: Aditya Krishna Menon
Ziteng Sun
Ananda Theertha Suresh
Publication date: 19 July 2023
Publisher

Abstract

Training machine learning models with differential privacy (DP) has received increasing interest in recent years. One of the most popular algorithms for training differentially private models is differentially private stochastic gradient descent (DPSGD) and its variants, where at each step gradients are clipped and combined with some noise. Given the increasing usage of DPSGD, we ask the question: is DPSGD alone sufficient to find a good minimizer for every dataset under privacy constraints? As a first step towards answering this question, we show that even for the simple case of linear classification, unlike non-private optimization, (private) feature preprocessing is vital for differentially private optimization. In detail, we first show theoretically that there exists an example where without feature preprocessing, DPSGD incurs a privacy error proportional to the maximum norm of features over all samples. We then propose an algorithm called DPSGD-F, which combines DPSGD with feature preprocessing and prove that for classification tasks, it incurs a privacy error proportional to the diameter of the features

\max_{x, x' \in D} \|x - x'\|_2

. We then demonstrate the practicality of our algorithm on image classification benchmarks

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2307.11106

Last time updated on 28/07/2023