A multi-modal framework for improving the accuracy of phishing email detection

International Journal of Electrical and Computer Engineering

A multi-modal framework for improving the accuracy of phishing email detection

Abstract

Phishing emails continue to pose a significant cybersecurity threat, particularly through the increasing use of malicious attachments to evade traditional text-based detection systems. Most existing approaches focus primarily on email content, creating a blind spot in attachment-aware phishing detection. This paper proposes a multi-modal phishing email classification model that integrates email header features, body text analysis, and attachment inspection within an ensemble learning framework. Independent machine learning classifiers are employed for each email component, and a majority voting mechanism is used to determine the final classification decision. The proposed model is evaluated using publicly available email and attachment datasets that are combined to simulate attachment-bearing phishing emails. Experimental results demonstrate strong detection performance across multiple evaluation metrics. Nevertheless, the study acknowledges the limitation of using synthetically paired email bodies and attachments, which may not fully capture real-world semantic relationships. The findings highlight the importance of incorporating attachment-aware analysis into phishing detection systems and provide a foundation for future research on semantic consistency modeling and transformer-based architectures.

Discover Our Library

Embark on a journey through our expansive collection of articles and let curiosity lead your path to innovation.

Explore Now
Library 3D Ilustration