Search CORE

1 research outputs found

Data Driven based Malicious URL Detection using Explainable AI

Author: Chowdhury Deepraj
Dwivedi Ashutosh Dhar
Mukkamala Raghava Rao
Poddar Saranda
Publication venue: IEEE Signal Processing Society
Publication date: 01/01/2022
Field of study

With the ever-increasing reach of the internet, and its increasing access through various types of devices, the spread of malware, phishing attempts, etc. have steadily been increasing, along with their level of sophistication. Thus it becomes very important to conduct research on different methods to prevent such harmful attacks on systems and users. Using a malicious URL is the common way for hackers to attack a system, thus, to accommodate the variety attack vectors of malicious websites, 21 features were extracted from 651,191 URLs to train the proposed model. A two-stage stacked ensemble learning model, based on gradient boosting methods and random forest, has been trained and tested in the 70:30 ratio of the 651,191 URLs, and an accuracy of 97% has been achieved. Then Explainable AI (XAI) has been used to clearly explain the working of the model, and study the impact of each of the 21 features on the 4 class predictions (benign, defacement, phishing and malware).</p

VBN