Enhancing Malware Detection Through Machine Learning Using XAI with SHAP Framework

Abstract

Malware represents a significant cyber threat that can potentially disrupt any activities within an organization. There is a need to devise effective proactive methods for malware detection, thereby minimizing the associated risks. However, this task is challenging due to the ever-growing volume of malware data and the continuously evolving techniques employed by malicious actors. In this context, machine learning models offer a promising approach to identify key malware features and facilitate accurate detection. Machine learning has proven to be effective in detecting malware and has recently gained widespread attention from both the academic and research sectors. Despite their effectiveness, current research on machine learning (ML) models for malware detection often lacks necessary explanations for the selection of key features. This opacity of ML models can complicate the understanding of the outputs, errors, and decision-making processes. To address this challenge, this research uses Explainable AI (XAI), particularly the SHAP framework, to enhance transparency and interpretability. By providing extensive insights into how each feature contributes to the model’s conclusions, the approach further improves the model’s accountability. An experiment was conducted to demonstrate the applicability of the proposed method, beginning with the training of the chosen machine learning models, including Random Forest, Adaboost, Support Vector Machine and Artificial Neural Network, for detecting malware, and concluding with the explanation of the decision-making process using XAI techniques. The results showed high accuracy in malware detection, along with comprehensive explanations of the feature contributions, which justifies the outputs produced by the models.</p

Similar works

Full text

thumbnail-image

ARU Anglia Ruskin Research (ARRO)

redirect
Last time updated on 15/05/2025

This paper was published in ARU Anglia Ruskin Research (ARRO).

Having an issue?

Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.

Licence: CC BY-NC-ND 4.0