Agents of Chaos - 专知论文

会员服务 ·

0

Agent · 回合 · Shell · 周期的 · AI ·

Agents of Chaos

翻译：暂无翻译

Natalie Shapira,Chris Wendler,Avery Yen,Gabriele Sarti,Koyena Pal,Olivia Floody,Adam Belfki,Alex Loftus,Aditya Ratan Jannali,Nikhil Prakash,Jasmine Cui,Giordano Rogers,Jannik Brinkmann,Can Rager,Amir Zur,Michael Ripa,Aruna Sankaranarayanan,David Atkinson,Rohit Gandikota,Jaden Fiotto-Kaufman,EunJeong Hwang,Hadas Orgad,P Sam Sahil,Negev Taglicht,Tomer Shabtay,Atai Ambus,Nitay Alon,Shiri Oron,Ayelet Gordon-Tapiero,Yotam Kaplan,Vered Shwartz,Tamar Rott Shaham,Christoph Riedl,Reuth Mirsky,Maarten Sap,David Manheim,Tomer Ullman,David Bau

We report an exploratory red-teaming study of autonomous language-model-powered agents deployed in a live laboratory environment with persistent memory, email accounts, Discord access, file systems, and shell execution. Over a two-week period, twenty AI researchers interacted with the agents under benign and adversarial conditions. Focusing on failures emerging from the integration of language models with autonomy, tool use, and multi-party communication, we document eleven representative case studies. Observed behaviors include unauthorized compliance with non-owners, disclosure of sensitive information, execution of destructive system-level actions, denial-of-service conditions, uncontrolled resource consumption, identity spoofing vulnerabilities, cross-agent propagation of unsafe practices, and partial system takeover. In several cases, agents reported task completion while the underlying system state contradicted those reports. We also report on some of the failed attempts. Our findings establish the existence of security-, privacy-, and governance-relevant vulnerabilities in realistic deployment settings. These behaviors raise unresolved questions regarding accountability, delegated authority, and responsibility for downstream harms, and warrant urgent attention from legal scholars, policymakers, and researchers across disciplines. This report serves as an initial empirical contribution to that broader conversation.

翻译：暂无翻译

0

相关内容

Agent

最新新Agent综述！76页327篇论文梳理，北交大桑基韬教授团队发布《迈向模型原生智能体式人工智能的范式转变综述》

最新新Agent综述！76页327篇论文梳理，北交大桑基韬教授团队发布《迈向模型原生智能体式人工智能的范式转变综述》

专知会员服务

37+阅读 · 2025年10月17日

AI Agent、传统聊天机器人有何区别？如何评测？这篇30页综述讲明白了

AI Agent、传统聊天机器人有何区别？如何评测？这篇30页综述讲明白了

专知会员服务

21+阅读 · 2025年7月2日

Agent有望定义万亿劳动力市场

Agent有望定义万亿劳动力市场

专知会员服务

18+阅读 · 2025年6月11日

Agent视域下的人工智能赋能作战系统

Agent视域下的人工智能赋能作战系统

专知会员服务

54+阅读 · 2024年12月15日

2024中国AI Agent行业研究报告｜附60页PDF文件下载

2024中国AI Agent行业研究报告｜附60页PDF文件下载

专知会员服务

125+阅读 · 2024年4月30日

Agent建模讲义：复杂系统与Agent模型

Agent建模讲义：复杂系统与Agent模型

专知会员服务

81+阅读 · 2024年4月24日

Al Agent--大模型时代重要落地方向

Al Agent--大模型时代重要落地方向

专知会员服务

106+阅读 · 2024年4月8日

数字世界中的大模型Agent：机遇与风险

数字世界中的大模型Agent：机遇与风险

专知会员服务

60+阅读 · 2023年12月25日

作战 Agent 的学习算法研究进展与发展趋势

作战 Agent 的学习算法研究进展与发展趋势

专知会员服务

70+阅读 · 2023年10月3日

AI Agent下一个热点？复旦最新86页《大型语言模型智能体的崛起与潜力》综述，详述LLM Agent: 大脑、感知和行动

AI Agent下一个热点？复旦最新86页《大型语言模型智能体的崛起与潜力》综述，详述LLM Agent: 大脑、感知和行动

专知会员服务

170+阅读 · 2023年9月15日

推荐！【中文版】美国海军研究实验室《将机器学习异常检测技术应用于美国海军空间系统运行》43页技术报告

推荐！【中文版】美国海军研究实验室《将机器学习异常检测技术应用于美国海军空间系统运行》43页技术报告

专知

33+阅读 · 2022年7月13日

初学者系列：Attentional Factorization Machines（AFM）详解

初学者系列：Attentional Factorization Machines（AFM）详解

专知

82+阅读 · 2019年9月16日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Awesome-Chinese-NLP：中文自然语言处理相关资料

Awesome-Chinese-NLP：中文自然语言处理相关资料

AINLP

30+阅读 · 2019年2月17日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Hands-on Machine Learning with Scikit-Learn and TensorFlow 学习笔记

Hands-on Machine Learning with Scikit-Learn and TensorFlow 学习笔记

AINLP

12+阅读 · 2018年11月12日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

笔记 | Sentiment Analysis

笔记 | Sentiment Analysis

黑龙江大学自然语言处理实验室

10+阅读 · 2018年5月6日

Focal Loss for Dense Object Detection

Focal Loss for Dense Object Detection

统计学习与视觉计算组

12+阅读 · 2018年3月15日

论文浅尝 | Question Answering over Freebase

论文浅尝 | Question Answering over Freebase

开放知识图谱

19+阅读 · 2018年1月9日

肺炎支原体外排泵ABC Transporter在大环内酯类耐药中的作用机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

融合人脑意图与力觉反馈的外骨骼机器人步态控制CPG模型及调节方法

国家自然科学基金

0+阅读 · 2015年12月31日

野外环境下四足机器人地形辨识与可通过性评价方法研究

国家自然科学基金

4+阅读 · 2015年12月31日

多功能超病毒递送系统的构建及其作用机制的研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于自主学习的Ad hoc Agent序贯决策研究

国家自然科学基金

47+阅读 · 2015年12月31日

多元质量特性下兵器装备协同研制能力网络形成与动态演化机理

国家自然科学基金

2+阅读 · 2015年12月31日

属性驱动的自适应多agent系统设计关键技术研究

国家自然科学基金

2+阅读 · 2015年12月31日

基于群体智能的多无人机编队自主协调控制及验证

国家自然科学基金

20+阅读 · 2013年12月31日

面向人与Agent混合的多团队协作仿真训练方法研究

国家自然科学基金

19+阅读 · 2012年12月31日

基于群体智能的多Agent协作模型与适应性研究

国家自然科学基金

18+阅读 · 2009年12月31日

AgentIR: Reasoning-Aware Retrieval for Deep Research Agents

Arxiv

0+阅读 · 3月9日

Evolving Deception: When Agents Evolve, Deception Wins

Arxiv

0+阅读 · 3月6日

Agent Tools Orchestration Leaks More: Dataset, Benchmark, and Mitigation

Arxiv

0+阅读 · 3月6日

AgentIR: Reasoning-Aware Retrival for Deep Research Agents

Arxiv

0+阅读 · 3月4日

From Secure Agentic AI to Secure Agentic Web: Challenges, Threats, and Future Directions

Arxiv

0+阅读 · 3月2日

Scaling Generalist Data-Analytic Agents

Arxiv

0+阅读 · 2月27日

Agentic AI for Scalable and Robust Optical Systems Control

Arxiv

0+阅读 · 2月23日

A Survey on Large Language Model based Autonomous Agents

Arxiv

36+阅读 · 2023年8月22日

AgentBench: Evaluating LLMs as Agents

Arxiv

14+阅读 · 2023年8月7日

Voyager: An Open-Ended Embodied Agent with Large Language Models

Arxiv

15+阅读 · 2023年5月25日

VIP会员

文章信息

相关主题

相关VIP内容

最新新Agent综述！76页327篇论文梳理，北交大桑基韬教授团队发布《迈向模型原生智能体式人工智能的范式转变综述》

最新新Agent综述！76页327篇论文梳理，北交大桑基韬教授团队发布《迈向模型原生智能体式人工智能的范式转变综述》

专知会员服务

37+阅读 · 2025年10月17日

AI Agent、传统聊天机器人有何区别？如何评测？这篇30页综述讲明白了

AI Agent、传统聊天机器人有何区别？如何评测？这篇30页综述讲明白了

专知会员服务

21+阅读 · 2025年7月2日

Agent有望定义万亿劳动力市场

Agent有望定义万亿劳动力市场

专知会员服务

18+阅读 · 2025年6月11日

Agent视域下的人工智能赋能作战系统

Agent视域下的人工智能赋能作战系统

专知会员服务

54+阅读 · 2024年12月15日

2024中国AI Agent行业研究报告｜附60页PDF文件下载

2024中国AI Agent行业研究报告｜附60页PDF文件下载

专知会员服务

125+阅读 · 2024年4月30日

Agent建模讲义：复杂系统与Agent模型

Agent建模讲义：复杂系统与Agent模型

专知会员服务

81+阅读 · 2024年4月24日

Al Agent--大模型时代重要落地方向

Al Agent--大模型时代重要落地方向

专知会员服务

106+阅读 · 2024年4月8日

数字世界中的大模型Agent：机遇与风险

数字世界中的大模型Agent：机遇与风险

专知会员服务

60+阅读 · 2023年12月25日

作战 Agent 的学习算法研究进展与发展趋势

作战 Agent 的学习算法研究进展与发展趋势

专知会员服务

70+阅读 · 2023年10月3日

AI Agent下一个热点？复旦最新86页《大型语言模型智能体的崛起与潜力》综述，详述LLM Agent: 大脑、感知和行动

AI Agent下一个热点？复旦最新86页《大型语言模型智能体的崛起与潜力》综述，详述LLM Agent: 大脑、感知和行动

专知会员服务

170+阅读 · 2023年9月15日

热门VIP内容

开通专知VIP会员享更多权益服务

《不对称消耗：乌克兰与伊朗“沙赫德”项目中低成本无人机作战的定量分析（2022-2026年）》2026最新358页

《美陆军条令：野战炮兵营作战》2026版

谷歌Gemini军事AI扩展至五角大楼上百万人员，取代Anthropic

《多智能体影响图在混合威胁建模中的应用》最新30页报告

相关资讯

推荐！【中文版】美国海军研究实验室《将机器学习异常检测技术应用于美国海军空间系统运行》43页技术报告

推荐！【中文版】美国海军研究实验室《将机器学习异常检测技术应用于美国海军空间系统运行》43页技术报告

专知

33+阅读 · 2022年7月13日

初学者系列：Attentional Factorization Machines（AFM）详解

初学者系列：Attentional Factorization Machines（AFM）详解

专知

82+阅读 · 2019年9月16日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Awesome-Chinese-NLP：中文自然语言处理相关资料

Awesome-Chinese-NLP：中文自然语言处理相关资料

AINLP

30+阅读 · 2019年2月17日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Hands-on Machine Learning with Scikit-Learn and TensorFlow 学习笔记

Hands-on Machine Learning with Scikit-Learn and TensorFlow 学习笔记

AINLP

12+阅读 · 2018年11月12日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

笔记 | Sentiment Analysis

笔记 | Sentiment Analysis

黑龙江大学自然语言处理实验室

10+阅读 · 2018年5月6日

Focal Loss for Dense Object Detection

Focal Loss for Dense Object Detection

统计学习与视觉计算组

12+阅读 · 2018年3月15日

论文浅尝 | Question Answering over Freebase

论文浅尝 | Question Answering over Freebase

开放知识图谱

19+阅读 · 2018年1月9日

相关论文

AgentIR: Reasoning-Aware Retrieval for Deep Research Agents

Arxiv

0+阅读 · 3月9日

Evolving Deception: When Agents Evolve, Deception Wins

Arxiv

0+阅读 · 3月6日

Agent Tools Orchestration Leaks More: Dataset, Benchmark, and Mitigation

Arxiv

0+阅读 · 3月6日

AgentIR: Reasoning-Aware Retrival for Deep Research Agents

Arxiv

0+阅读 · 3月4日

From Secure Agentic AI to Secure Agentic Web: Challenges, Threats, and Future Directions

Arxiv

0+阅读 · 3月2日

Scaling Generalist Data-Analytic Agents

Arxiv

0+阅读 · 2月27日

Agentic AI for Scalable and Robust Optical Systems Control

Arxiv

0+阅读 · 2月23日

A Survey on Large Language Model based Autonomous Agents

Arxiv

36+阅读 · 2023年8月22日

AgentBench: Evaluating LLMs as Agents

Arxiv

14+阅读 · 2023年8月7日

Voyager: An Open-Ended Embodied Agent with Large Language Models

Arxiv

15+阅读 · 2023年5月25日

相关基金

肺炎支原体外排泵ABC Transporter在大环内酯类耐药中的作用机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

融合人脑意图与力觉反馈的外骨骼机器人步态控制CPG模型及调节方法

国家自然科学基金

0+阅读 · 2015年12月31日

野外环境下四足机器人地形辨识与可通过性评价方法研究

国家自然科学基金

4+阅读 · 2015年12月31日

多功能超病毒递送系统的构建及其作用机制的研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于自主学习的Ad hoc Agent序贯决策研究

国家自然科学基金

47+阅读 · 2015年12月31日

多元质量特性下兵器装备协同研制能力网络形成与动态演化机理

国家自然科学基金

2+阅读 · 2015年12月31日

属性驱动的自适应多agent系统设计关键技术研究

国家自然科学基金

2+阅读 · 2015年12月31日

基于群体智能的多无人机编队自主协调控制及验证

国家自然科学基金

20+阅读 · 2013年12月31日

面向人与Agent混合的多团队协作仿真训练方法研究

国家自然科学基金

19+阅读 · 2012年12月31日

基于群体智能的多Agent协作模型与适应性研究

国家自然科学基金

18+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员