Robust ML-based Detection of Conventional, LLM-Generated, and Adversarial Phishing Emails Using Advanced Text Preprocessing

Phishing remains a critical cybersecurity threat, especially with the advent of large language models (LLMs) capable of generating highly convincing malicious content. Unlike earlier phishing attempts which are identifiable by grammatical errors, misspellings, incorrect phrasing, and inconsistent formatting, LLM generated emails are grammatically sound, contextually relevant, and linguistically natural. These advancements make phishing emails increasingly difficult to distinguish from legitimate ones, challenging traditional detection mechanisms. Conventional phishing detection systems often fail when faced with emails crafted by LLMs or manipulated using adversarial perturbation techniques. To address this challenge, we propose a robust phishing email detection system featuring an enhanced text preprocessing pipeline. This pipeline includes spelling correction and word splitting to counteract adversarial modifications and improve detection accuracy. Our approach integrates widely adopted natural language processing (NLP) feature extraction techniques and machine learning algorithms. We evaluate our models on publicly available datasets comprising both phishing and legitimate emails, achieving a detection accuracy of 94.26% and F1-score of 84.39% in model deployment setting. To assess robustness, we further evaluate our models using adversarial phishing samples generated by four attack methods in Python TextAttack framework. Additionally, we evaluate models' performance against phishing emails generated by LLMs including ChatGPT and Llama. Results highlight the resilience of models against evolving AI-powered phishing threats.

翻译：钓鱼攻击依然是网络安全领域的重大威胁，尤其随着能够生成极具迷惑性恶意内容的大型语言模型（LLMs）的出现。早期钓鱼邮件可通过语法错误、拼写错误、错误措辞及格式不一致等特征进行识别，而LLM生成的邮件则语法规范、上下文相关且语言自然。这些技术进步使得钓鱼邮件与合法邮件越来越难以区分，对传统检测机制构成了严峻挑战。面对由LLMs生成或通过对抗性扰动技术篡改的邮件时，传统钓鱼检测系统往往失效。为应对这一挑战，我们提出一种鲁棒的钓鱼邮件检测系统，其核心是增强型文本预处理流程。该流程包含拼写校正与词汇切分，以抵消对抗性篡改并提升检测准确率。我们的方法整合了广泛采用的自然语言处理（NLP）特征提取技术与机器学习算法。我们在包含钓鱼邮件与合法邮件的公开数据集上评估模型，在部署环境中实现了94.26%的检测准确率与84.39%的F1分数。为评估鲁棒性，我们进一步使用Python TextAttack框架中四种攻击方法生成的对抗性钓鱼样本测试模型性能。此外，我们还评估了模型对ChatGPT和Llama等LLM生成钓鱼邮件的检测能力。实验结果凸显了所提模型对持续演进的人工智能驱动钓鱼威胁的强健防御能力。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日