Privacy-preserving Fine-tuning of Large Language Models through Flatness

Chen, Tianlong; Chen, Tiejin; Da, Longchao; Li, Pingzhi; Wei, Hua; Zhou, Huixue; Zhou, Kaixiong

Privacy-preserving Fine-tuning of Large Language Models through Flatness

Authors: Tianlong Chen
Tiejin Chen
Longchao Da
Pingzhi Li
Hua Wei
Huixue Zhou
Kaixiong Zhou
Publication date: 6 March 2024
Publisher

Abstract

The privacy concerns associated with the use of Large Language Models (LLMs) have grown recently with the development of LLMs such as ChatGPT. Differential Privacy (DP) techniques are explored in existing work to mitigate their privacy risks at the cost of generalization degradation. Our paper reveals that the flatness of DP-trained models' loss landscape plays an essential role in the trade-off between their privacy and generalization. We further propose a holistic framework to enforce appropriate weight flatness, which substantially improves model generalization with competitive privacy preservation. It innovates from three coarse-to-grained levels, including perturbation-aware min-max optimization on model weights within a layer, flatness-guided sparse prefix-tuning on weights across layers, and weight knowledge distillation between DP \& non-DP weights copies. Comprehensive experiments of both black-box and white-box scenarios are conducted to demonstrate the effectiveness of our proposal in enhancing generalization and maintaining DP characteristics. For instance, on text classification dataset QNLI, DP-Flat achieves similar performance with non-private full fine-tuning but with DP guarantee under privacy budget

\epsilon=3

, and even better performance given higher privacy budgets. Codes are provided in the supplement.Comment: Accepted to ICLR 2024 SeT LLM Worksho

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2403.04124

Last time updated on 26/09/2024