A Survey on Transferability of Adversarial Examples across Deep Neural
  Networks

Cao, Xiaochun; de Jorge, Pau; Gu, Jindong; Hu, Anjun; Jia, Xiaojun; Khakzar, Ashkan; Li, Zhijiang; Liu, Xinwei; Ma, Avery; Torr, Philip; Xun, Yuan; Yu, Wenqain

A Survey on Transferability of Adversarial Examples across Deep Neural Networks

Authors: Xiaochun Cao
Pau de Jorge
Jindong Gu
Anjun Hu
Xiaojun Jia
Ashkan Khakzar
Zhijiang Li
Xinwei Liu
Avery Ma
Philip Torr
Yuan Xun
Wenqain Yu
Publication date: 26 October 2023
Publisher

Abstract

The emergence of Deep Neural Networks (DNNs) has revolutionized various domains, enabling the resolution of complex tasks spanning image recognition, natural language processing, and scientific problem-solving. However, this progress has also exposed a concerning vulnerability: adversarial examples. These crafted inputs, imperceptible to humans, can manipulate machine learning models into making erroneous predictions, raising concerns for safety-critical applications. An intriguing property of this phenomenon is the transferability of adversarial examples, where perturbations crafted for one model can deceive another, often with a different architecture. This intriguing property enables "black-box" attacks, circumventing the need for detailed knowledge of the target model. This survey explores the landscape of the adversarial transferability of adversarial examples. We categorize existing methodologies to enhance adversarial transferability and discuss the fundamental principles guiding each approach. While the predominant body of research primarily concentrates on image classification, we also extend our discussion to encompass other vision tasks and beyond. Challenges and future prospects are discussed, highlighting the importance of fortifying DNNs against adversarial vulnerabilities in an evolving landscape

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2310.17626

Last time updated on 16/01/2024