Search CORE

16 research outputs found

Horizontal Federated Learning and Secure Distributed Training for Recommendation System with Intel SGX

Author: Hu Albert
Hui Siyuan
Song Edmund
Zhang Yuqiu
Publication venue
Publication date: 11/07/2022
Field of study

With the advent of big data era and the development of artificial intelligence and other technologies, data security and privacy protection have become more important. Recommendation systems have many applications in our society, but the model construction of recommendation systems is often inseparable from users' data. Especially for deep learning-based recommendation systems, due to the complexity of the model and the characteristics of deep learning itself, its training process not only requires long training time and abundant computational resources but also needs to use a large amount of user data, which poses a considerable challenge in terms of data security and privacy protection. How to train a distributed recommendation system while ensuring data security has become an urgent problem to be solved. In this paper, we implement two schemes, Horizontal Federated Learning and Secure Distributed Training, based on Intel SGX(Software Guard Extensions), an implementation of a trusted execution environment, and TensorFlow framework, to achieve secure, distributed recommendation system-based learning schemes in different scenarios. We experiment on the classical Deep Learning Recommendation Model (DLRM), which is a neural network-based machine learning model designed for personalization and recommendation, and the results show that our implementation introduces approximately no loss in model performance. The training speed is within acceptable limits.Comment: 5 pages, 8 figure

arXiv.org e-Print Archive

PDoT: Private DNS-over-TLS with TEE Support

Author: Nakatsuka Yoshimichi
Paverd Andrew
Tsudik Gene
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 25/09/2019
Field of study

Security and privacy of the Internet Domain Name System (DNS) have been longstanding concerns. Recently, there is a trend to protect DNS traffic using Transport Layer Security (TLS). However, at least two major issues remain: (1) how do clients authenticate DNS-over-TLS endpoints in a scalable and extensible manner; and (2) how can clients trust endpoints to behave as expected? In this paper, we propose a novel Private DNS-over-TLS (PDoT ) architecture. PDoT includes a DNS Recursive Resolver (RecRes) that operates within a Trusted Execution Environment (TEE). Using Remote Attestation, DNS clients can authenticate, and receive strong assurance of trustworthiness of PDoT RecRes. We provide an open-source proof-of-concept implementation of PDoT and use it to experimentally demonstrate that its latency and throughput match that of the popular Unbound DNS-over-TLS resolver.Comment: To appear: ACSAC 201

arXiv.org e-Print Archive

Crossref

A Generative Framework for Low-Cost Result Validation of Outsourced Machine Learning Tasks

Author: Aguilera Miguel A. Guirao
Kumar Abhinav
Misra Satyajayant
Tourani Reza
Publication venue
Publication date: 30/10/2023
Field of study

The growing popularity of Machine Learning (ML) has led to its deployment in various sensitive domains, which has resulted in significant research focused on ML security and privacy. However, in some applications, such as autonomous driving, integrity verification of the outsourced ML workload is more critical--a facet that has not received much attention. Existing solutions, such as multi-party computation and proof-based systems, impose significant computation overhead, which makes them unfit for real-time applications. We propose Fides, a novel framework for real-time validation of outsourced ML workloads. Fides features a novel and efficient distillation technique--Greedy Distillation Transfer Learning--that dynamically distills and fine-tunes a space and compute-efficient verification model for verifying the corresponding service model while running inside a trusted execution environment. Fides features a client-side attack detection model that uses statistical analysis and divergence measurements to identify, with a high likelihood, if the service model is under attack. Fides also offers a re-classification functionality that predicts the original class whenever an attack is identified. We devised a generative adversarial network framework for training the attack detection and re-classification models. The evaluation shows that Fides achieves an accuracy of up to 98% for attack detection and 94% for re-classification.Comment: 16 pages, 11 figure

arXiv.org e-Print Archive

Federated Learning Enables Big Data for Rare Cancer Boundary Detection

Author: Abayazeed Aly
Abbassy Ahmed
Abello Ana
Adabi Saba
Agzarian Marc
Ahn Sung Soo
Ak Murat
Alafandi Ahmed
Alexander Gregory S
Alexiou Sotiris
Allen Bryan
Apgar Charles
Badve Chaitra
Baek Stephen
Baid Ujjwal
Bakas Spyridon
Bapuraj J. Rajiv
Bareja Rohan
Barnholtz-Sloan Jill S.
Beets-Tan Regina G. H.
Belouali Anas
Bencheqroun Camelia
Bendszus Martin
Benson Sean
Bernal Jose
Bhardwaj Sargam
Bhuvaneshwar Krithika
Bialecki Brian
Bilello Michel
Bink Andrea
Booth Thomas C.
Boss Michael A.
Brugnara Gianluca
Buatti John M.
Calabrese Evan
Capellades Jaume
Cha Soonmee
Chambless Lola B.
Chang Jong Hee
Chang Ken
Chelliah Alysha
Chen Cheng
Chen Jonathan
Choi Joseph
Choi Yoon Seong
Chong Chee
Chotai Silky
Cimino Lisa
Cloughesy Timothy F.
Colen Rivka R.
Currie Stuart
Cutler Danielle
Dako Farouk
Davatzikos Christos
Dicker Adam P.
Dixon Luke V. M.
Dorcas Adeleye
Dostál Marek
Dou Qi
Dragos Carmen
Dubbink Hendrikus J.
Edwards Brandon
Ellingson Benjamin M.
Escobar William
Ezhov Ivan
Falcão Alexandre Xavier
Farinhas Joaquim
Fatania Kavi
Flanders Adam E.
Foley Patrick
Fortin David
Franco-Maldonado Heydy
French Pim J.
Frood Russell
Fu Eric
Gahrmann Renske
Gamal Shady
Garrett John
Ghodasara Satyam
Gimpel James
Glocker Ben
Gruzdev Alexey
Guevara Pamela
Gusev Yuriy
Gómez Jhon
Haas Rourke
Hagiwara Akifumi
Haliassos Ilias
Hamghalam Mohammad
Hau Ann-Christin
Haunschmidt Andreas
Heng Pheng Ann
Herrera-Trujillo Alejandro
Hill Michael
Holcomb James
Hu Ricky
Huang Raymond Y.
Incekara Fatih
Ingalhalikar Madhura
Ismael Heba
Jadhav Manali
Jain Rajan
Jeraj Robert
Jiang Meirui
Jones Craig K.
Kalogeropoulou Christina
Kamnitsas Konstantinos
Kapsas Georgios
Kardamakis Dimitrios M.
Karkada Deepthi
Keunen Olivier
Keřkovský Miloš
Kim Ho Sung
Kim Yusung
Klein Stefan
Kofler Florian
Kolodziej Kenneth
Kopřivová Tereza
Kotrotsou Aikaterini
Kozubek Michal
Kumar Neeraj
Kurc Tahsin
LaMontagne Pamela
Landman Bennett
Larson Matthew
Lee Joonsang
Lee Matthew
Lee Seung-Koo
Lepage Martin
Li Hongwei
Liem Spencer
Loayza Francis
Lombardo Joseph
Lucio Diego R.
Lui Yvonne W.
Luo Bing
Lux Filip
López Eduardo
Madhavan Subha
Mahajan Abhishek
Maier-Hein Klaus
Maldjian Joseph A.
Mandel Jacob
Mani Kartik M.
Marcus Daniel
Marella Sailaja
Martin Jason
Martins Samuel B.
Matula Petr
McKinley Richard
Meckel Stephan
Meier Raphael
Mekhaimar Mahmoud
Mendoza Cristobal
Menotti David
Menze Bjoern
Metz Marie
Michálek Jan
Mistry Akshitkumar
Mitchell J. Ross
Modat Marc
Mohan Suyash
Moraes Fabio Y.
Moritani Toshio
Morón Fanny
Moustakas Konstantinos
Murcia Derrick
Muzi Mark
Necker Georg
Niclou Simone P.
Odafe-Oyibotha Olubunmi
Ogbole Godwin
Ormond David Ryan
Osobu Babatunde
Oughourlian Talia
Oyekunle Dotun
Palmer Joshua D.
Panagiotopoulos Vasileios
Pandey Umang
Park Ji Eun
Pati Sarthak
Payne David
Pei Linmin
Pelaez Enrique
Peoples Jacob J.
Pichler Josef
Pinho Marco C.
Poisson Laila
Pouymayou Bertrand
Prabhudesai Snehal
Prasanna Prateek
Preetha Chandrakanth J.
Price Cynthia
Puig Josep
Qayati Mohamed
Quevedo Sebastian
Quintero Carmen Balaña
Radojewski Piotr
Ramadass Karthik
Rao Arvind
Raymond Catalina
Reddy Divya
Reina G. Anthony
Reyes Mauricio
Rudie Jeffrey
Ríos Elvis
Sahm Felix
Saini Jitender
Sair Haris I.
Sako Chiharu
Saltz Joel
Sayah Anousheh
Schmidt Kendall
Schouten Joost W
Shah Prashant
Sharma Sonam
Shaykh Hassan F.
Sheller Micah
Shrestha Sampurna
Shu\u27aibu Mustapha
Shuaib Haris
Shukla Gaurav
Simpson Amber L.
Sloan Andrew E.
Slotboom Johannes
Smits Marion
So Tiffany Y.
Soneye Mayowa
Sprenger Flávia
Srinivasan Ashok
Teixeira Bernardo C. A.
Teuwen Jonas
Thompson John
Thompson Reid C.
Tiwari Pallavi
To Minh-Son
Torche Esteban
Tran Anh
Trenkler Johannes
Trujillo Maria
Tseng Tzu-Chi
Tsiganos Panagiotis
Turk Sevcan
Vadmal Vachan
Vallières Martin
van den Bent Martin J.
van der Voort Sebastian R.
Veettil Deepak Kattil
Velastin Sergio A.
Venkataraman Archana
Vera Franco
Verma Ruchika
Villanueva-Meyer Javier
Vincent Arnaud J. P. E.
Vogelbaum Michael A.
Vollmuth Philipp
Vybíhal Václav
Wagner Benjamin C.
Waite Kristin
Wang Chencai
Wang Nicholas
Wang Shih-Han
Weiss Tobias
Weller Michael
Wen Ning
Wick Wolfgang
Wiest Roland
Wiestler Benedikt
Wijnenga Maarten M. J.
Williams Matthew
Xu Kaiwen
Yadav Ipsa
Yogananda Chandan Ganesh Bangalore
Yoshiaki Ota
Yuan Yading
Yun Jihye
Zacharaki Evangelia I
Zampakis Peter
Zenk Maximilian
Publication venue: Jefferson Digital Commons
Publication date: 05/12/2022
Field of study

Although machine learning (ML) has shown promise across disciplines, out-of-sample generalizability is concerning. This is currently addressed by sharing multi-site data, but such centralization is challenging/infeasible to scale due to various limitations. Federated ML (FL) provides an alternative paradigm for accurate and generalizable ML, by only sharing numerical model updates. Here we present the largest FL study to-date, involving data from 71 sites across 6 continents, to generate an automatic tumor boundary detector for the rare disease of glioblastoma, reporting the largest such dataset in the literature (n = 6, 314). We demonstrate a 33% delineation improvement for the surgically targetable tumor, and 23% for the complete tumor extent, over a publicly trained model. We anticipate our study to: 1) enable more healthcare studies informed by large diverse data, ensuring meaningful results for rare diseases and underrepresented populations, 2) facilitate further analyses for glioblastoma by releasing our consensus model, and 3) demonstrate the FL effectiveness at such scale and task-complexity as a paradigm shift for multi-site collaborations, alleviating the need for data-sharing

Jefferson Digital Commons