Explaining Vulnerabilities of Deep Learning to Adversarial Malware Binaries

Armando, Alessandro; Biggio, Battista; Demetrio, Luca; Lagorio, Giovanni; Roli, Fabio

Explaining Vulnerabilities of Deep Learning to Adversarial Malware Binaries

Authors: Alessandro Armando
Battista Biggio
Luca Demetrio
Giovanni Lagorio
Fabio Roli
Publication date: 1 January 2019
Publisher

Abstract

Recent work has shown that deep-learning algorithms for malware detection are also susceptible to adversarial examples, i.e., carefully-crafted perturbations to input malware that enable misleading classification. Although this has questioned their suitability for this task, it is not yet clear why such algorithms are easily fooled also in this particular application domain. In this work, we take a first step to tackle this issue by leveraging explainable machine-learning algorithms developed to interpret the black-box decisions of deep neural networks. In particular, we use an explainable technique known as feature attribution to identify the most influential input features contributing to each decision, and adapt it to provide meaningful explanations to the classification of malware binaries. In this case, we find that a recently-proposed convolutional neural network does not learn any meaningful characteristic for malware detection from the data and text sections of executable files, but rather tends to learn to discriminate between benign and malware samples based on the characteristics found in the file header. Based on this finding, we propose a novel attack algorithm that generates adversarial malware binaries by only changing few tens of bytes in the file header. With respect to the other state-of-the-art attack algorithms, our attack does not require injecting any padding bytes at the end of the file, and it is much more efficient, as it requires manipulating much fewer bytes

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Archivio istituzionale della ricerca - Università di Genova

oai:iris.unige.it:11567/962743

Last time updated on 21/08/2019

Archivio istituzionale della ricerca - Università di Cagliari

oai:iris.unica.it:11584/259983

Last time updated on 19/04/2019