A Systematic Security Evaluation of OpenClaw and Its Variants - 专知论文

会员服务 ·

0

Agent · MoDELS · OpenClaw · Backbone · AI ·

A Systematic Security Evaluation of OpenClaw and Its Variants

翻译：暂无翻译

Yuhang Wang,Haichang Gao,Zhenxing Niu,Zhaoxiang Liu,Wenjing Zhang,Xiang Wang,Shiguo Lian

from arxiv, 39 pages, 14 figures

Tool-augmented AI agents substantially extend the practical capabilities of large language models, but they also introduce security risks that cannot be identified through model-only evaluation. In this paper, we present a systematic security assessment of six representative OpenClaw-series agent frameworks, namely OpenClaw, AutoClaw, QClaw, KimiClaw, MaxClaw, and ArkClaw, under multiple backbone models. To support this study, we construct a benchmark of 205 test cases covering representative attack behaviors across the full agent execution lifecycle, enabling unified evaluation of risk exposure at both the framework and model levels. Our results show that all evaluated agents exhibit substantial security vulnerabilities, and that agentized systems are significantly riskier than their underlying models used in isolation. In particular, reconnaissance and discovery behaviors emerge as the most common weaknesses, while different frameworks expose distinct high-risk profiles, including credential leakage, lateral movement, privilege escalation, and resource development. These findings indicate that the security of modern agent systems is shaped not only by the safety properties of the backbone model, but also by the coupling among model capability, tool use, multi-step planning, and runtime orchestration. We further show that once an agent is granted execution capability and persistent runtime context, weaknesses arising in early stages can be amplified into concrete system-level failures. Overall, our study highlights the need to move beyond prompt-level safeguards toward lifecycle-wide security governance for intelligent agent frameworks.

翻译：暂无翻译

0

相关内容

Agent

AI原生组织：OpenClaw推动组织形态重塑，47页pdf

AI原生组织：OpenClaw推动组织形态重塑，47页pdf

专知会员服务

24+阅读 · 3月27日

OpenClaw完全指南：从入门到精通｜附629页PDF文件下载

OpenClaw完全指南：从入门到精通｜附629页PDF文件下载

专知会员服务

88+阅读 · 3月14日

清华大学：OpenClaw发展研究1.0报告｜附75页PDF文件下载

清华大学：OpenClaw发展研究1.0报告｜附75页PDF文件下载

专知会员服务

121+阅读 · 3月6日

AI 智能体系统：体系架构、应用场景及评估范式

AI 智能体系统：体系架构、应用场景及评估范式

专知会员服务

68+阅读 · 1月6日

自进化人工智能体的全面综述：连接基础模型与终身自主智能系统的新范式

自进化人工智能体的全面综述：连接基础模型与终身自主智能系统的新范式

专知会员服务

33+阅读 · 2025年12月28日

OpenAI 32页《智能体》指南，如何构建首个智能体系统

OpenAI 32页《智能体》指南，如何构建首个智能体系统

专知会员服务

50+阅读 · 2025年4月18日

大规模安全：大模型安全的全面综述

大规模安全：大模型安全的全面综述

专知会员服务

35+阅读 · 2025年2月11日

世界模型：安全性视角

世界模型：安全性视角

专知会员服务

43+阅读 · 2024年11月17日

大型语言模型网络安全综述

大型语言模型网络安全综述

专知会员服务

68+阅读 · 2024年5月12日

【2024新书】大型语言模型安全开发者手册，250页pdf

【2024新书】大型语言模型安全开发者手册，250页pdf

专知会员服务

76+阅读 · 2024年2月12日

【AI+军事】《用于威胁评估的人工智能工具》加拿大国防研究和发展部技术报告，附中文版pdf

【AI+军事】《用于威胁评估的人工智能工具》加拿大国防研究和发展部技术报告，附中文版pdf

专知

91+阅读 · 2022年4月17日

OpenVSLAM：日本新开源”全能“视觉SLAM框架

OpenVSLAM：日本新开源”全能“视觉SLAM框架

计算机视觉life

13+阅读 · 2019年6月12日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

BERT 现已开源：最先进的 NLP 预训练技术，支持中文和更多语言

BERT 现已开源：最先进的 NLP 预训练技术，支持中文和更多语言

谷歌开发者

16+阅读 · 2018年11月6日

【泡泡图灵智库】密集相关的自监督视觉描述学习（RAL）

【泡泡图灵智库】密集相关的自监督视觉描述学习（RAL）

泡泡机器人SLAM

11+阅读 · 2018年10月6日

变分自编码器VAE：原来是这么一回事 | 附开源代码

变分自编码器VAE：原来是这么一回事 | 附开源代码

PaperWeekly

12+阅读 · 2018年3月23日

基础 | 基于注意力机制的seq2seq网络

基础 | 基于注意力机制的seq2seq网络

黑龙江大学自然语言处理实验室

16+阅读 · 2018年3月7日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

资源 | 清华大学开源OpenKE：知识表示学习平台

资源 | 清华大学开源OpenKE：知识表示学习平台

机器之心

10+阅读 · 2017年11月4日

基于动态网络结构的膜计算系统及其算法研究

国家自然科学基金

1+阅读 · 2015年12月31日

复杂系统中多密码算法密钥协同安全研究

国家自然科学基金

0+阅读 · 2015年12月31日

可证明安全的确定性公钥加密体制研究

国家自然科学基金

0+阅读 · 2015年12月31日

输入约束下的多智能体系统完全分布式协调控制研究

国家自然科学基金

5+阅读 · 2015年12月31日

属性驱动的自适应多agent系统设计关键技术研究

国家自然科学基金

2+阅读 · 2015年12月31日

移动与可穿戴计算中Eyes-Free交互界面研究

国家自然科学基金

0+阅读 · 2014年12月31日

网络化控制系统安全理论与关键技术

国家自然科学基金

1+阅读 · 2014年12月31日

Android移动终端多语种基础软件组合的安全技术研究

国家自然科学基金

0+阅读 · 2014年12月31日

多域网络安全的异构策略语义形态与验证机制

国家自然科学基金

0+阅读 · 2014年12月31日

物联网关键技术RFID系统安全测试的仿真架构.评估模型和受攻击模式的研究和实践

国家自然科学基金

2+阅读 · 2014年12月31日

Open Challenges in Multi-Agent Security: Towards Secure Systems of Interacting AI Agents

Arxiv

0+阅读 · 4月29日

Security Considerations for Multi-agent Systems

Arxiv

0+阅读 · 4月26日

SafetyALFRED: Evaluating Safety-Conscious Planning of Multimodal Large Language Models

Arxiv

0+阅读 · 4月21日

Security Threat Modeling for Emerging AI-Agent Protocols: A Comparative Analysis of MCP, A2A, Agora, and ANP

Arxiv

0+阅读 · 4月17日

Formalizing the Safety, Security, and Functional Properties of Agentic AI Systems

Arxiv

0+阅读 · 4月15日

Claw-Eval: Toward Trustworthy Evaluation of Autonomous Agents

Arxiv

0+阅读 · 4月7日

Foundations for Agentic AI Investigations from the Forensic Analysis of OpenClaw

Arxiv

0+阅读 · 4月7日

Your Agent, Their Asset: A Real-World Safety Analysis of OpenClaw

Arxiv

0+阅读 · 4月6日

A Systematic Taxonomy of Security Vulnerabilities in the OpenClaw AI Agent Framework

Arxiv

0+阅读 · 3月29日

ClawTrap: A MITM-Based Red-Teaming Framework for Real-World OpenClaw Security Evaluation

Arxiv

0+阅读 · 3月19日

VIP会员

文章信息

相关主题

最新内容

DeepSeek 版Claude Code，免费小白安装教程来了！

DeepSeek 版Claude Code，免费小白安装教程来了！

专知会员服务

6+阅读 · 5月5日

【ICML Spotlight 2026】 T²PO: 不确定性引导的探索控制框架，实现稳定多轮Agentic强化学习

【ICML Spotlight 2026】 T²PO: 不确定性引导的探索控制框架，实现稳定多轮Agentic强化学习

专知会员服务

2+阅读 · 5月5日

基础模型驱动的工业智能体：技术成熟度、能力变迁与未竟之挑战

基础模型驱动的工业智能体：技术成熟度、能力变迁与未竟之挑战

专知会员服务

0+阅读 · 5月5日

《机动炮兵的演进与未来：技术进步、历史沿革与炮兵作战前瞻》

《机动炮兵的演进与未来：技术进步、历史沿革与炮兵作战前瞻》

专知会员服务

3+阅读 · 5月5日

《火炮弹药快速效能建模：提升互操作性与技术优势》（报告）

《火炮弹药快速效能建模：提升互操作性与技术优势》（报告）

专知会员服务

4+阅读 · 5月5日

《美空军条令出版物 2-0：情报（2026版）》

《美空军条令出版物 2-0：情报（2026版）》

专知会员服务

9+阅读 · 5月5日

美陆军“飞蝇陷阱5.0”项目将新兴技术交到作战人员手中

美陆军“飞蝇陷阱5.0”项目将新兴技术交到作战人员手中

专知会员服务

3+阅读 · 5月5日

帕兰提尔 Gotham：一个游戏规则改变器

帕兰提尔 Gotham：一个游戏规则改变器

专知会员服务

5+阅读 · 5月5日

【ICML 2026】用测试时训练线性化视觉Transformer：T⁵ 实现 Softmax 注意力到线性复杂度的快速转换

【ICML 2026】用测试时训练线性化视觉Transformer：T⁵ 实现 Softmax 注意力到线性复杂度的快速转换

专知会员服务

2+阅读 · 5月5日

【AAAI 2026】大模型做知识蒸馏：CMM将LLM特征拆解给小模型协同学习

【AAAI 2026】大模型做知识蒸馏：CMM将LLM特征拆解给小模型协同学习

专知会员服务

2+阅读 · 5月5日

【ICML Spotlight 2026 】NonZero：交互引导探索的多智能体蒙特卡洛树搜索

【ICML Spotlight 2026 】NonZero：交互引导探索的多智能体蒙特卡洛树搜索

专知会员服务

8+阅读 · 5月4日

【综述】机器人学习中的世界模型：全面综述

【综述】机器人学习中的世界模型：全面综述

专知会员服务

10+阅读 · 5月4日

伊朗的导弹-无人机行动及其对美国威慑的影响

伊朗的导弹-无人机行动及其对美国威慑的影响

专知会员服务

8+阅读 · 5月4日

《未来战术无人机系统案例研究：量身定制采办策略方法》100页报告

《未来战术无人机系统案例研究：量身定制采办策略方法》100页报告

专知会员服务

8+阅读 · 5月4日

战争贩子：2026年第一季度美国对中东潜在军售激增

战争贩子：2026年第一季度美国对中东潜在军售激增

专知会员服务

6+阅读 · 5月4日

相关VIP内容

AI原生组织：OpenClaw推动组织形态重塑，47页pdf

AI原生组织：OpenClaw推动组织形态重塑，47页pdf

专知会员服务

24+阅读 · 3月27日

OpenClaw完全指南：从入门到精通｜附629页PDF文件下载

OpenClaw完全指南：从入门到精通｜附629页PDF文件下载

专知会员服务

88+阅读 · 3月14日

清华大学：OpenClaw发展研究1.0报告｜附75页PDF文件下载

清华大学：OpenClaw发展研究1.0报告｜附75页PDF文件下载

专知会员服务

121+阅读 · 3月6日

AI 智能体系统：体系架构、应用场景及评估范式

AI 智能体系统：体系架构、应用场景及评估范式

专知会员服务

68+阅读 · 1月6日

自进化人工智能体的全面综述：连接基础模型与终身自主智能系统的新范式

自进化人工智能体的全面综述：连接基础模型与终身自主智能系统的新范式

专知会员服务

33+阅读 · 2025年12月28日

OpenAI 32页《智能体》指南，如何构建首个智能体系统

OpenAI 32页《智能体》指南，如何构建首个智能体系统

专知会员服务

50+阅读 · 2025年4月18日

大规模安全：大模型安全的全面综述

大规模安全：大模型安全的全面综述

专知会员服务

35+阅读 · 2025年2月11日

世界模型：安全性视角

世界模型：安全性视角

专知会员服务

43+阅读 · 2024年11月17日

大型语言模型网络安全综述

大型语言模型网络安全综述

专知会员服务

68+阅读 · 2024年5月12日

【2024新书】大型语言模型安全开发者手册，250页pdf

【2024新书】大型语言模型安全开发者手册，250页pdf

专知会员服务

76+阅读 · 2024年2月12日

热门VIP内容

开通专知VIP会员享更多权益服务

【ICML Spotlight 2026】 T²PO: 不确定性引导的探索控制框架，实现稳定多轮Agentic强化学习

《机动炮兵的演进与未来：技术进步、历史沿革与炮兵作战前瞻》

DeepSeek 版Claude Code，免费小白安装教程来了！

基础模型驱动的工业智能体：技术成熟度、能力变迁与未竟之挑战

相关资讯

【AI+军事】《用于威胁评估的人工智能工具》加拿大国防研究和发展部技术报告，附中文版pdf

【AI+军事】《用于威胁评估的人工智能工具》加拿大国防研究和发展部技术报告，附中文版pdf

专知

91+阅读 · 2022年4月17日

OpenVSLAM：日本新开源”全能“视觉SLAM框架

OpenVSLAM：日本新开源”全能“视觉SLAM框架

计算机视觉life

13+阅读 · 2019年6月12日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

BERT 现已开源：最先进的 NLP 预训练技术，支持中文和更多语言

BERT 现已开源：最先进的 NLP 预训练技术，支持中文和更多语言

谷歌开发者

16+阅读 · 2018年11月6日

【泡泡图灵智库】密集相关的自监督视觉描述学习（RAL）

【泡泡图灵智库】密集相关的自监督视觉描述学习（RAL）

泡泡机器人SLAM

11+阅读 · 2018年10月6日

变分自编码器VAE：原来是这么一回事 | 附开源代码

变分自编码器VAE：原来是这么一回事 | 附开源代码

PaperWeekly

12+阅读 · 2018年3月23日

基础 | 基于注意力机制的seq2seq网络

基础 | 基于注意力机制的seq2seq网络

黑龙江大学自然语言处理实验室

16+阅读 · 2018年3月7日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

资源 | 清华大学开源OpenKE：知识表示学习平台

资源 | 清华大学开源OpenKE：知识表示学习平台

机器之心

10+阅读 · 2017年11月4日

相关论文

Open Challenges in Multi-Agent Security: Towards Secure Systems of Interacting AI Agents

Arxiv

0+阅读 · 4月29日

Security Considerations for Multi-agent Systems

Arxiv

0+阅读 · 4月26日

SafetyALFRED: Evaluating Safety-Conscious Planning of Multimodal Large Language Models

Arxiv

0+阅读 · 4月21日

Security Threat Modeling for Emerging AI-Agent Protocols: A Comparative Analysis of MCP, A2A, Agora, and ANP

Arxiv

0+阅读 · 4月17日

Formalizing the Safety, Security, and Functional Properties of Agentic AI Systems

Arxiv

0+阅读 · 4月15日

Claw-Eval: Toward Trustworthy Evaluation of Autonomous Agents

Arxiv

0+阅读 · 4月7日

Foundations for Agentic AI Investigations from the Forensic Analysis of OpenClaw

Arxiv

0+阅读 · 4月7日

Your Agent, Their Asset: A Real-World Safety Analysis of OpenClaw

Arxiv

0+阅读 · 4月6日

A Systematic Taxonomy of Security Vulnerabilities in the OpenClaw AI Agent Framework

Arxiv

0+阅读 · 3月29日

ClawTrap: A MITM-Based Red-Teaming Framework for Real-World OpenClaw Security Evaluation

Arxiv

0+阅读 · 3月19日

相关基金

基于动态网络结构的膜计算系统及其算法研究

国家自然科学基金

1+阅读 · 2015年12月31日

复杂系统中多密码算法密钥协同安全研究

国家自然科学基金

0+阅读 · 2015年12月31日

可证明安全的确定性公钥加密体制研究

国家自然科学基金

0+阅读 · 2015年12月31日

输入约束下的多智能体系统完全分布式协调控制研究

国家自然科学基金

5+阅读 · 2015年12月31日

属性驱动的自适应多agent系统设计关键技术研究

国家自然科学基金

2+阅读 · 2015年12月31日

移动与可穿戴计算中Eyes-Free交互界面研究

国家自然科学基金

0+阅读 · 2014年12月31日

网络化控制系统安全理论与关键技术

国家自然科学基金

1+阅读 · 2014年12月31日

Android移动终端多语种基础软件组合的安全技术研究

国家自然科学基金

0+阅读 · 2014年12月31日

多域网络安全的异构策略语义形态与验证机制

国家自然科学基金

0+阅读 · 2014年12月31日

物联网关键技术RFID系统安全测试的仿真架构.评估模型和受攻击模式的研究和实践

国家自然科学基金

2+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员