Machine Learning Driven Email Phishing Detection

Ayekple, Selasi Tsatsu; Bekoe, Ransford Fiifi; Addy, Ebenezer; Engmann, Felicia

text

oai:digitalcommons.kennesaw.edu:acist-1282

Machine Learning Driven Email Phishing Detection

Authors: Selasi Tsatsu Ayekple
Ransford Fiifi Bekoe
Ebenezer Addy
Felicia Engmann
Publication date: 28 August 2025
Publisher: DigitalCommons@Kennesaw State University

Abstract

Phishing attacks pose significant risks to cybersecurity, exploiting user trust through deceptive email content. This paper presents a machine learning based framework for detecting phishing emails using a 2024 dataset comprising over 80,000 labeled samples sourced from PhishTank and Kaggle. Features were engineered from URLs, email content, and metadata. Five models— Logistic Regression, Support Vector Machine (SVM), Random Forest, XGBoost, and K-Nearest Neighbors (KNN)—were evaluated. Simulated results demonstrate that ensemble models, particularly Random Forest and XGBoost, delivered optimal results, with near-perfect accuracy and recall. The study highlights the efficacy of combining feature-based engineering with ensemble learning to enhance real-time phishing detection

text

Similar works

Full text

DigitalCommons@Kennesaw State University

oai:digitalcommons.kennesaw.ed...

Last time updated on 24/01/2026

This paper was published in DigitalCommons@Kennesaw State University.

Having an issue?

Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.