Detecting Scams Using Large Language Models

Large Language Models (LLMs) have gained prominence in various applications, including security. This paper explores the utility of LLMs in scam detection, a critical aspect of cybersecurity. Unlike traditional applications, we propose a novel use case for LLMs to identify scams, such as phishing, advance fee fraud, and romance scams. We present notable security applications of LLMs and discuss the unique challenges posed by scams. Specifically, we outline the key steps involved in building an effective scam detector using LLMs, emphasizing data collection, preprocessing, model selection, training, and integration into target systems. Additionally, we conduct a preliminary evaluation using GPT-3.5 and GPT-4 on a duplicated email, highlighting their proficiency in identifying common signs of phishing or scam emails. The results demonstrate the models' effectiveness in recognizing suspicious elements, but we emphasize the need for a comprehensive assessment across various language tasks. The paper concludes by underlining the importance of ongoing refinement and collaboration with cybersecurity experts to adapt to evolving threats.

翻译：大型语言模型（LLMs）已在包括安全领域在内的各类应用中崭露头角。本文探讨了LLMs在网络安全关键环节——诈骗检测中的效用。不同于传统应用，我们提出了一种新型用例：利用LLMs识别钓鱼诈骗、预付费诈骗和浪漫诈骗等欺诈行为。我们展现了LLMs在安全领域的显著应用，并论述了诈骗带来的独特挑战。具体而言，我们系统阐述了使用LLMs构建有效诈骗检测器的关键步骤，重点涵盖数据采集、预处理、模型选择、训练及与目标系统的集成。此外，我们基于重复邮件，使用GPT-3.5和GPT-4进行了初步评估，突出了它们在识别钓鱼或诈骗邮件常见特征方面的能力。结果表明这些模型在识别可疑要素方面具有有效性，但我们也强调需跨各类语言任务进行综合评估的必要性。本文最后指出，必须持续优化并与网络安全专家协作，以适应不断演变的威胁态势。

相关内容

SCAM

关注 0

代码分析与操作（SCAM）国际工作会议的目的是将从事与计算机系统源代码的分析和/或操作有关的理论、技术和应用的研究人员和实践者聚集在一起。虽然在更广泛的软件工程界中，人们的注意力都集中在系统开发和演化的其他方面，如规范、设计和需求工程，但源代码是对系统行为的唯一精确描述。因此，对源代码的分析和操作仍然是一个紧迫的问题。官网链接：http://www.ieee-scam.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日