How the Softmax Output is Misleading for Evaluating the Strength of
  Adversarial Examples

De Neve, Wesley; Ozbulak, Utku; Van Messem, Arnout

slides

How the Softmax Output is Misleading for Evaluating the Strength of Adversarial Examples

Authors: Wesley De Neve
Utku Ozbulak
Arnout Van Messem
Publication date: 1 January 2018
Publisher

Abstract

Even before deep learning architectures became the de facto models for complex computer vision tasks, the softmax function was, given its elegant properties, already used to analyze the predictions of feedforward neural networks. Nowadays, the output of the softmax function is also commonly used to assess the strength of adversarial examples: malicious data points designed to fail machine learning models during the testing phase. However, in this paper, we show that it is possible to generate adversarial examples that take advantage of some properties of the softmax function, leading to undesired outcomes when interpreting the strength of the adversarial examples at hand. Specifically, we argue that the output of the softmax function is a poor indicator when the strength of an adversarial example is analyzed and that this indicator can be easily tricked by already existing methods for adversarial example generation

Similar works

Full text

Available Versions

Open Repository and Bibliography - Liège

oai:orbi.ulg.ac.be:2268/257831

Last time updated on 14/10/2021

Open Repository and Bibliography - Liège

oai:orbi.ulg.ac.be:2268/265534

Last time updated on 23/12/2021

Ghent University Academic Bibliography

oai:archive.ugent.be:8586221

Last time updated on 17/03/2019