Spear Phishing With Large Language Models

Recent progress in artificial intelligence (AI), particularly in the domain of large language models (LLMs), has resulted in powerful and versatile dual-use systems. This intelligence can be put towards a wide variety of beneficial tasks, yet it can also be used to cause harm. This study explores one such harm by examining how LLMs can be used for spear phishing, a form of cybercrime that involves manipulating targets into divulging sensitive information. I first explore LLMs' ability to assist with the reconnaissance and message generation stages of a spear phishing attack, where I find that LLMs are capable of assisting with the email generation phase of a spear phishing attack. To explore how LLMs could potentially be harnessed to scale spear phishing campaigns, I then create unique spear phishing messages for over 600 British Members of Parliament using OpenAI's GPT-3.5 and GPT-4 models. My findings provide some evidence that these messages are not only realistic but also cost-effective, with each email costing only a fraction of a cent to generate. Next, I demonstrate how basic prompt engineering can circumvent safeguards installed in LLMs, highlighting the need for further research into robust interventions that can help prevent models from being misused. To further address these evolving risks, I explore two potential solutions: structured access schemes, such as application programming interfaces, and LLM-based defensive systems.

翻译：近期人工智能（AI）的进展，特别是在大语言模型（LLMs）领域，催生了强大且多功能的双重用途系统。这些智能系统可被用于多种有益任务，但也可能被滥用造成危害。本研究聚焦于大语言模型在鱼叉式网络钓鱼攻击中的应用——一种通过操控目标泄露敏感信息的网络犯罪形式。我首先探究了LLMs在鱼叉式网络钓鱼攻击的信息收集与信息生成阶段的辅助能力，发现LLMs能够有效协助攻击者完成邮件撰写环节。为探索LLMs被用于规模化鱼叉式网络钓鱼攻击的潜在可能性，我利用OpenAI的GPT-3.5和GPT-4模型为600多名英国议员生成了个性化网络钓鱼邮件。研究结果表明，这些邮件不仅具有高度真实性，且成本效益显著——每封邮件的生成成本仅需不到一美分。随后，我演示了如何通过基础提示工程绕过LLMs的安全防护机制，这凸显了亟需开展稳健干预措施的进一步研究以防止模型被滥用。为应对这些日益演变的威胁，我探讨了两种潜在解决方案：应用程序接口等结构化访问机制，以及基于LLMs的防御系统。

相关内容

大语言模型

关注 67

大语言模型是基于海量文本数据训练的深度学习模型。它不仅能够生成自然语言文本，还能够深入理解文本含义，处理各种自然语言任务，如文本摘要、问答、翻译等。2023年，大语言模型及其在人工智能领域的应用已成为全球科技研究的热点，其在规模上的增长尤为引人注目，参数量已从最初的十几亿跃升到如今的一万亿。参数量的提升使得模型能够更加精细地捕捉人类语言微妙之处，更加深入地理解人类语言的复杂性。在过去的一年里，大语言模型在吸纳新知识、分解复杂任务以及图文对齐等多方面都有显著提升。随着技术的不断成熟，它将不断拓展其应用范围，为人类提供更加智能化和个性化的服务，进一步改善人们的生活和生产方式。

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

【ACL2020】多模态信息抽取，365页ppt

专知会员服务

151+阅读 · 2020年7月6日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日