SIGMA: Secure GPT Inference with Function Secret Sharing

Ananta Mukherjee; Ashish Panwar; Divya Gupta; Kanav Gupta; Neha Jawalkar; Nishanth Chandran; Rahul Sharma

SIGMA: Secure GPT Inference with Function Secret Sharing

Authors: Ananta Mukherjee
Ashish Panwar
Divya Gupta
Kanav Gupta
Neha Jawalkar
Nishanth Chandran
Rahul Sharma
Publication date: 22 August 2023
Publisher: International Association for Cryptologic Research (IACR)

Abstract

Secure 2-party computation (2PC) enables secure inference that offers protection for both proprietary machine learning (ML) models and sensitive inputs to them. However, the existing secure inference solutions suffer from high latency and communication overheads, particularly for transformers. Function secret sharing (FSS) is a recent paradigm for obtaining efficient 2PC protocols with a preprocessing phase. We provide SIGMA, the first end-to-end system for secure transformer inference based on FSS. By constructing new FSS-based protocols for complex machine learning functionalities, such as Softmax and GeLU, and also accelerating their computation on GPUs, SIGMA improves the latency of secure inference of transformers by

11-19\times

over the state-of-the-art that uses preprocessing and GPUs. We present the first secure inference of generative pre-trained transformer (GPT) models. In particular, SIGMA executes GPT-Neo with 1.3 billion parameters in 7.4s and HuggingFace\u27s GPT2 in 1.6s

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Cryptology ePrint Archive

oai:eprint.iacr.org:2023/1269

Last time updated on 25/08/2023