Search CORE

4 research outputs found

On the Robustness of Vision Transformers to Adversarial Examples

Author: Dijk M.E. (Marten) van
Mahmood K. (Kaleel)
Mahmood R. (Rigel)
Publication venue
Publication date: 11/10/2021
Field of study

Game theoretic mixed experts for combinational adversarial machine learning

Author: Ahmad S. (Sohaib)
Dijk M.E. (Marten) van
Ding C. (Caiwen)
Mahmood K. (Kaleel)
Rathbun E. (Ethan)
Publication venue
Publication date: 26/11/2022
Field of study

Recent advances in adversarial machine learning have shown that defenses considered to be robust are actually susceptible to adversarial attacks which are specifically tailored to target their weaknesses. These defenses include Barrage of Random Transforms (BaRT), Friendly Adversarial Training (FAT), Trash is Treasure (TiT) and ensemble models made up of Vision Transformers (ViTs), Big Transfer models and Spiking Neural Networks (SNNs). A natural question arises: how can one best leverage a combination of adversarial defenses to thwart such attacks? In this paper, we provide a game-theoretic framework for ensemble adversarial attacks and defenses which answers this question. In addition to our framework we produce the first adversarial defense transferability study to further motivate a need for combinational defenses utilizing a diverse set of defense architectures. Our framework is called Game theoretic Mixed Experts (GaME) and is designed to find the Mixed-Nash strategy for a defender when facing an attacker employing compositional adversarial attacks. We show that this framework creates an ensemble of defenses with greater robustness than multiple state-of-the-art, single-model defenses in addition to combinational defenses with uniform probability distributions. Overall, our framework and analyses advance the field of adversarial machine learning by yielding new insights into compositional attack and defense formulations

CWI's Institutional Repository

Besting the black-box: Barrier zones for adversarial example defense

Author: Dijk M.E. (Marten) van
Mahmood K. (Kaleel)
Nguyen L. M. (Lam)
Nguyen P.H. (Phuong Ha)
Nguyen T. (Thanh)
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 27/12/2021
Field of study

Adversarial machine learning defenses have primarily been focused on mitigating static, white-box attacks. However, it remains an open question whether such defenses are robust under an adaptive black-box adversary. In this paper, we specifically focus on the black-box threat model and make the following contributions: First we develop an enhanced adaptive black-box attack which is experimentally shown to be ≥ 30% more effective than the original adaptive black-box attack proposed by Papernot et al. For our second contribution, we test 10 recent defenses using our new attack and propose our own black-box defense (barrier zones). We show that our defense based on barrier zones offers significant improvements in security over state-of-the-art defenses. This improvement includes greater than 85% robust accuracy against black-box boundary attacks, transfer attacks and our new adaptive black-box attack, for the datasets we study. For completeness, we verify our claims through extensive experimentation with 10 other defenses using three adversarial models (14 different black-box attacks) on two datasets (CIFAR-10 and Fashion-MNIST)

CWI's Institutional Repository

Back in Black: A comparative evaluation of recent state-of-the-art black-box attacks

Author: Dijk M.E. (Marten) van
Mahmood K. (Kaleel)
Mahmood R. (Rigel)
Rathbun E. (Ethan)
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 24/12/2021
Field of study

The field of adversarial machine learning has experienced a near exponential growth in the amount of papers being produced since 2018. This massive information output has yet to be properly processed and categorized. In this paper, we seek to help alleviate this problem by systematizing the recent advances in adversarial machine learning black-box attacks since 2019. Our survey summarizes and categorizes 20 recent black-box attacks. We also present a new analysis for understanding the attack success rate with respect to the adversarial model used in each paper. Overall, our paper surveys a wide body of literature to highlight recent attack developments and organizes them into four attack categories: score based attacks, decision based attacks, transfer attacks and non-traditional attacks. Further, we provide a new mathematical framework to show exactly how attack results can fairly be compared

CWI's Institutional Repository