A Framework for Evaluating Emerging Cyberattack Capabilities of AI

As frontier AI models become more capable, evaluating their potential to enable cyberattacks is crucial for ensuring the safe development of Artificial General Intelligence (AGI). Current cyber evaluation efforts are often ad-hoc, lacking systematic analysis of attack phases and guidance on targeted defenses. This work introduces a novel evaluation framework that addresses these limitations by: (1) examining the end-to-end attack chain, (2) identifying gaps in AI threat evaluation, and (3) helping defenders prioritize targeted mitigations and conduct AI-enabled adversary emulation for red teaming. Our approach adapts existing cyberattack chain frameworks for AI systems. We analyzed over 12,000 real-world instances of AI use in cyberattacks catalogued by Google's Threat Intelligence Group. Based on this analysis, we curated seven representative cyberattack chain archetypes and conducted a bottleneck analysis to pinpoint potential AI-driven cost disruptions. Our benchmark comprises 50 new challenges spanning various cyberattack phases. Using this benchmark, we devised targeted cybersecurity model evaluations, report on AI's potential to amplify offensive capabilities across specific attack phases, and offer recommendations for prioritizing defenses. We believe this represents the most comprehensive AI cyber risk evaluation framework published to date.

翻译：随着前沿人工智能模型的能力日益增强，评估其促成网络攻击的潜力对于确保通用人工智能（AGI）的安全发展至关重要。目前的网络评估工作往往是临时性的，缺乏对攻击阶段的系统分析以及对针对性防御的指导。本研究引入了一个新颖的评估框架，通过以下方式解决这些局限性：(1) 审视端到端的攻击链，(2) 识别人工智能威胁评估中的差距，以及 (3) 帮助防御者优先考虑针对性缓解措施，并执行人工智能驱动的对手模拟以进行红队测试。我们的方法调整了现有的网络攻击链框架以适用于人工智能系统。我们分析了谷歌威胁情报小组编目的超过12,000个网络攻击中人工智能使用的真实案例。基于此分析，我们整理了七个具有代表性的网络攻击链原型，并进行了瓶颈分析以确定潜在的人工智能驱动的成本颠覆点。我们的基准测试包含50个跨越不同网络攻击阶段的新挑战。利用此基准，我们设计了针对性的网络安全模型评估，报告了人工智能在特定攻击阶段放大攻击能力的潜力，并提供了关于优先防御的建议。我们相信这是迄今为止已发布的最全面的人工智能网络风险评估框架。

相关内容

关注 7104

人工智能杂志AI(Artificial Intelligence)是目前公认的发表该领域最新研究成果的主要国际论坛。该期刊欢迎有关AI广泛方面的论文，这些论文构成了整个领域的进步，也欢迎介绍人工智能应用的论文，但重点应该放在新的和新颖的人工智能方法如何提高应用领域的性能，而不是介绍传统人工智能方法的另一个应用。关于应用的论文应该描述一个原则性的解决方案，强调其新颖性，并对正在开发的人工智能技术进行深入的评估。官网地址：http://dblp.uni-trier.de/db/journals/ai/

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日