Mimicking anti-viruses with machine learning and entropy profiles

Alshahwan; Barandiaran; Bhattacharya; Bischl; Breiman; Chio; Cohen; Frank; Héctor D. Menéndez; José Luis Llorente; Oktavianto; Quinlan; Ripley; Santos; Saxe; Sebastián; Sikorski; Venables

Mimicking anti-viruses with machine learning and entropy profiles

Authors: Alshahwan
Barandiaran
Bhattacharya
Bischl
Breiman
Chio
Cohen
Frank
Héctor D. Menéndez
José Luis Llorente
Oktavianto
Quinlan
Ripley
Santos
Saxe
Sebastián
Sikorski
Venables
Publication date: 1 January 2019
Publisher: 'MDPI AG'
Doi

Abstract

The quality of anti-virus software relies on simple patterns extracted from binary files. Although these patterns have proven to work on detecting the specifics of software, they are extremely sensitive to concealment strategies, such as polymorphism or metamorphism. These limitations also make anti-virus software predictable, creating a security breach. Any black hat with enough information about the anti-virus behaviour can make its own copy of the software, without any access to the original implementation or database. In this work, we show how this is indeed possible by combining entropy patterns with classification algorithms. Our results, applied to 57 different anti-virus engines, show that we can mimic their behaviour with an accuracy close to 98% in the best case and 75% in the worst, applied on Windows’ disk resident malware