Search CORE

4 research outputs found

Adversarial Machine Learning in the Wild

Author: Saadatpanah Parsa
Publication venue
Publication date: 01/01/2021
Field of study

Deep neural networks are making their way into our everyday lives at an increasing rate. While the adoption of these models has greatly improved our everyday lives, it has also opened the door to new vulnerabilities in real-world systems. More specifically, in the scope of this work we are interested in one class of vulnerabilities: adversarial attacks. Given the high importance and the sensitivity of some of the tasks these models are responsible for, it is crucial to study such vulnerabilities in real-world systems. In this work, we look at examples of deep neural network-based real-world systems, vulnerabilities of such systems, and approaches for making such systems more robust. First, we study an example of leveraging a deep neural network in a business-critical real-world system. We discuss how deep neural networks improve the quality of smart voice assistants. More specifically, we introduce how collaborative filtering models can automatically detect and resolve the errors of a voice assistant. We then discuss the success of this approach in improving the quality of a real-world voice assistant. Second, we demonstrate a proof of concept for an adversarial attack against content-based recommendation systems which are commonly used in real-world settings. We discuss how malicious actors can add unnoticeable perturbations to the content they upload to the website to achieve their preferred outcomes. We also show how adversarial training can render such attacks useless. Third, we discuss another example of how adversarial attacks can be leveraged to manipulate a real-world system. We study how adversarial attacks can successfully manipulate YouTube's copyright detection model and the financial implications of this vulnerability. In particular, we show how adversarial examples created for a copyright detection model that we implemented transfer to another black-box model. Finally, we study the problem of transfer learning in an adversarially robust setting. We discuss how robust models contain robust feature extractors and how we can leverage them to train new classifiers that preserve the robustness of the original model. We then study the case of fine-tuning in the target domain while preserving the robustness. We show the success of our proposed solutions in preserving the robustness in the target domain

Digital Repository at the University of Maryland

POTs: Protective Optimization Technologies

Author: Ah Muhammad
Akhtar Allana
Angwin Julia
Benner Katie
Biggio Battista
Bird Sarah
Browne Simone
Celis L. Elisa
Citron Danielle Keats
Corbett-Davies Sam
Dastin Jeffrey
Demontis Ambra
Donini Michele
dos Santos Cicero Nogueira
Eidelson Josh
Ensign Danielle
Fleurbaey Marc
Friedman Zack
Fu Zhenxin
Fulkerson Delbert Ray
Gangadharan Seeta Peña
Graehler Michael
Grand Julian Le
Green Ben
Gurses Seda
Hardt Moritz
Hinsliff Gaby
Israeli Eitan
Jackson Michael
Koh Pang Wei
Kunkle Fredrick
Lavigne Sam
Lum Kristian
Miettinen Kaisa
Miracle Veronica
O'Shea Bev
Overdorf Rebekah
Papernot Nicolas
Pasquale Frank
Phlips Louis
Pigou Arthur
Rosenblat Alex
Saadatpanah Parsa
Sainato Michael
Sen Amartya
Sen Amartya
Sharif Mahmood
Shkolnikova Svetlana
Starrett David A.
Sweeney Sam
Taylor Keeanga-Yamahtta
Tseng Francis
Tufekci Zeynep
Vincent James
Yeung Karen
Yew-Kwang Ng
Zafar Muhammad Bilal
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 26/01/2020
Field of study

Algorithmic fairness aims to address the economic, moral, social, and political impact that digital systems have on populations through solutions that can be applied by service providers. Fairness frameworks do so, in part, by mapping these problems to a narrow definition and assuming the service providers can be trusted to deploy countermeasures. Not surprisingly, these decisions limit fairness frameworks' ability to capture a variety of harms caused by systems. We characterize fairness limitations using concepts from requirements engineering and from social sciences. We show that the focus on algorithms' inputs and outputs misses harms that arise from systems interacting with the world; that the focus on bias and discrimination omits broader harms on populations and their environments; and that relying on service providers excludes scenarios where they are not cooperative or intentionally adversarial. We propose Protective Optimization Technologies (POTs). POTs provide means for affected parties to address the negative impacts of systems in the environment, expanding avenues for political contestation. POTs intervene from outside the system, do not require service providers to cooperate, and can serve to correct, shift, or expose harms that systems impose on populations and their environments. We illustrate the potential and limitations of POTs in two case studies: countering road congestion caused by traffic-beating applications, and recalibrating credit scoring for loan applicants.Comment: Appears in Conference on Fairness, Accountability, and Transparency (FAT* 2020). Bogdan Kulynych and Rebekah Overdorf contributed equally to this work. Version v1/v2 by Seda G\"urses, Rebekah Overdorf, and Ero Balsa was presented at HotPETS 2018 and at PiMLAI 201

arXiv.org e-Print Archive

Infoscience - École polytechnique fédérale de Lausanne

Crossref

Adversarially robust transfer learning

Author: Ghiasi Amin
Goldstein Tom
Jacobs David
Saadatpanah Parsa
Shafahi Ali
Studer Christoph
Zhu Chen
Publication venue: International Conference on Learning Representations
Publication date: 21/02/2020
Field of study

ransfer learning, in which a network is trained on one task and re-purposed on another, is often used to produce neural network classifiers when data is scarce or full-scale training is too costly. When the goal is to produce a model that is not only accurate but also adversarially robust, data scarcity and computational limitations become even more cumbersome. We consider robust transfer learning, in which we transfer not only performance but also robustness from a source model to a target domain. We start by observing that robust networks contain robust feature extractors. By training classifiers on top of these feature extractors, we produce new models that inherit the robustness of their parent networks. We then consider the case of "fine tuning" a network by re-training end-to-end in the target domain. When using lifelong learning strategies, this process preserves the robustness of the source network while achieving high accuracy. By using such strategies, it is possible to produce accurate and robust models with little data, and without the cost of adversarial training. Additionally, we can improve the generalization of adversarially trained models, while maintaining their robustness

arXiv.org e-Print Archive

Repository for Publications and Research Data