SWE-chat: Coding Agent Interactions From Real Users in the Wild - 专知论文

会员服务 ·

0

SWE-chat: Coding Agent Interactions From Real Users in the Wild

翻译：暂无翻译

Joachim Baumann,Vishakh Padmakumar,Xiang Li,John Yang,Diyi Yang,Sanmi Koyejo

AI coding agents are being adopted at scale, yet we lack empirical evidence on how people actually use them and how much of their output is useful in practice. We present SWE-chat, the first large-scale dataset of real coding agent sessions collected from open-source developers in the wild. The dataset currently contains 6,000 sessions, comprising more than 63,000 user prompts and 355,000 agent tool calls. SWE-chat is a living dataset; our collection pipeline automatically and continually discovers and processes sessions from public repositories. Leveraging SWE-chat, we provide an initial empirical characterization of real-world coding agent usage and failure modes. We find that coding patterns are bimodal: in 41% of sessions, agents author virtually all committed code ("vibe coding"), while in 23%, humans write all code themselves. Despite rapidly improving capabilities, coding agents remain inefficient in natural settings. Just 44% of all agent-produced code survives into user commits, and agent-written code introduces more security vulnerabilities than code authored by humans. Furthermore, users push back against agent outputs -- through corrections, failure reports, and interruptions -- in 44% of all turns. By capturing complete interaction traces with human vs. agent code authorship attribution, SWE-chat provides an empirical foundation for moving beyond curated benchmarks towards an evidence-based understanding of how AI agents perform in real developer workflows.

翻译：暂无翻译

0

相关内容

《Hello-Agents》项目正式发布，一起从零学习智能体！

《Hello-Agents》项目正式发布，一起从零学习智能体！

专知会员服务

31+阅读 · 1月2日

Agent有望定义万亿劳动力市场

Agent有望定义万亿劳动力市场

专知会员服务

19+阅读 · 2025年6月11日

AI Agent深度（二）：2025 Agent元年，AI从L2向L3发展

AI Agent深度（二）：2025 Agent元年，AI从L2向L3发展

专知会员服务

42+阅读 · 2025年5月5日

人工智能专题报告：Operator和Manus打开AI Agent时代

人工智能专题报告：Operator和Manus打开AI Agent时代

专知会员服务

62+阅读 · 2025年3月12日

再谈工业AI：立足跨模型架构AI中台，落地垂类Agent场景

再谈工业AI：立足跨模型架构AI中台，落地垂类Agent场景

专知会员服务

45+阅读 · 2025年3月9日

AI Agent，大模型时代重要落地方向, 42页ppt

AI Agent，大模型时代重要落地方向, 42页ppt

专知会员服务

290+阅读 · 2023年10月12日

AI Agent下一个热点？复旦最新86页《大型语言模型智能体的崛起与潜力》综述，详述LLM Agent: 大脑、感知和行动

AI Agent下一个热点？复旦最新86页《大型语言模型智能体的崛起与潜力》综述，详述LLM Agent: 大脑、感知和行动

专知会员服务

170+阅读 · 2023年9月15日

工业机器人及工控系统行业深度分析：从ChatGPT到RobotGPT，回答人形机器人八个问题

工业机器人及工控系统行业深度分析：从ChatGPT到RobotGPT，回答人形机器人八个问题

专知会员服务

46+阅读 · 2023年6月28日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

微软亚研提出VL-BERT：通用的视觉-语言预训练模型

微软亚研提出VL-BERT：通用的视觉-语言预训练模型

机器之心

15+阅读 · 2019年9月3日

参数少一半，效果还更好，天津大学和微软提出Transformer压缩模型

参数少一半，效果还更好，天津大学和微软提出Transformer压缩模型

机器之心

15+阅读 · 2019年7月13日

利用上下文常识，让AI读懂不完整人类指令 | 一周AI最火论文

利用上下文常识，让AI读懂不完整人类指令 | 一周AI最火论文

大数据文摘

12+阅读 · 2019年5月6日

从虚拟到现实，北大等提出基于强化学习的端到端主动目标跟踪方法

从虚拟到现实，北大等提出基于强化学习的端到端主动目标跟踪方法

机器之心

23+阅读 · 2019年4月13日

多图带你读懂 Transformers 的工作原理

多图带你读懂 Transformers 的工作原理

AI研习社

10+阅读 · 2019年3月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【泡泡图灵智库】密集相关的自监督视觉描述学习（RAL）

【泡泡图灵智库】密集相关的自监督视觉描述学习（RAL）

泡泡机器人SLAM

11+阅读 · 2018年10月6日

谷歌 AI：语义文本相似度研究进展

谷歌 AI：语义文本相似度研究进展

AI研习社

22+阅读 · 2018年6月13日

腾讯：机器学习构建通用的数据异常检测平台

腾讯：机器学习构建通用的数据异常检测平台

全球人工智能

11+阅读 · 2018年5月1日

Representation Learning on Network 网络表示学习

Representation Learning on Network 网络表示学习

全球人工智能

10+阅读 · 2017年10月19日

共融机器人战略规划研究和学术交流

国家自然科学基金

15+阅读 · 2016年12月31日

基于人机交互的数据驱动式人群行为建模与仿真研究

国家自然科学基金

4+阅读 · 2015年12月31日

非结构环境下基于三维肢体动作理解的工业机器人交互技术研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于主题网络的用户内在兴趣发现及演进研究

国家自然科学基金

0+阅读 · 2015年12月31日

面向多用户行为的无线识别关键技术研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于跨层网络编码感知的无线传感器网络节能路由协议研究

国家自然科学基金

0+阅读 · 2015年12月31日

机器灵巧手基于触滑觉信息协同的自适应力控制方法研究

国家自然科学基金

3+阅读 · 2015年12月31日

面向人与Agent混合的多团队协作仿真训练方法研究

国家自然科学基金

19+阅读 · 2012年12月31日

数据和模型混合驱动的虚拟人群行为仿真技术研究及其在军事中的应用

国家自然科学基金

10+阅读 · 2011年12月31日

基于群体智能的多Agent协作模型与适应性研究

国家自然科学基金

18+阅读 · 2009年12月31日

Building Persona-Based Agents On Demand: Tailoring Multi-Agent Workflows to User Needs

Arxiv

0+阅读 · 4月30日

SWE-Bench 5G: Benchmarking AI Coding Agents on Telecom Network Engineering Tasks

Arxiv

0+阅读 · 4月29日

Asymmetric Goal Drift in Coding Agents Under Value Conflict

Arxiv

0+阅读 · 4月24日

Scheming in the wild: detecting real-world AI scheming incidents with open-source intelligence

Arxiv

0+阅读 · 4月10日

When Agent Markets Arrive

Arxiv

0+阅读 · 4月8日

DoubleAgents: Human-Agent Alignment in a Socially Embedded Workflow

Arxiv

0+阅读 · 4月6日

"When to Hand Off, When to Work Together": Expanding Human-Agent Co-Creative Collaboration through Concurrent Interaction

Arxiv

0+阅读 · 4月5日

Synergy: A Next-Generation General-Purpose Agent for Open Agentic Web

Arxiv

0+阅读 · 3月30日

Regulating AI Agents

Arxiv

0+阅读 · 3月24日

Verifiable Semantics for Agent-to-Agent Communication

Arxiv

0+阅读 · 3月19日

VIP会员

文章信息

相关主题

最新内容

DeepSeek 版Claude Code，免费小白安装教程来了！

DeepSeek 版Claude Code，免费小白安装教程来了！

专知会员服务

9+阅读 · 5月5日

【ICML Spotlight 2026】 T²PO: 不确定性引导的探索控制框架，实现稳定多轮Agentic强化学习

【ICML Spotlight 2026】 T²PO: 不确定性引导的探索控制框架，实现稳定多轮Agentic强化学习

专知会员服务

5+阅读 · 5月5日

基础模型驱动的工业智能体：技术成熟度、能力变迁与未竟之挑战

基础模型驱动的工业智能体：技术成熟度、能力变迁与未竟之挑战

专知会员服务

5+阅读 · 5月5日

《机动炮兵的演进与未来：技术进步、历史沿革与炮兵作战前瞻》

《机动炮兵的演进与未来：技术进步、历史沿革与炮兵作战前瞻》

专知会员服务

6+阅读 · 5月5日

《火炮弹药快速效能建模：提升互操作性与技术优势》（报告）

《火炮弹药快速效能建模：提升互操作性与技术优势》（报告）

专知会员服务

9+阅读 · 5月5日

《美空军条令出版物 2-0：情报（2026版）》

《美空军条令出版物 2-0：情报（2026版）》

专知会员服务

14+阅读 · 5月5日

美陆军“飞蝇陷阱5.0”项目将新兴技术交到作战人员手中

美陆军“飞蝇陷阱5.0”项目将新兴技术交到作战人员手中

专知会员服务

6+阅读 · 5月5日

帕兰提尔 Gotham：一个游戏规则改变器

帕兰提尔 Gotham：一个游戏规则改变器

专知会员服务

9+阅读 · 5月5日

【ICML 2026】用测试时训练线性化视觉Transformer：T⁵ 实现 Softmax 注意力到线性复杂度的快速转换

【ICML 2026】用测试时训练线性化视觉Transformer：T⁵ 实现 Softmax 注意力到线性复杂度的快速转换

专知会员服务

3+阅读 · 5月5日

【AAAI 2026】大模型做知识蒸馏：CMM将LLM特征拆解给小模型协同学习

【AAAI 2026】大模型做知识蒸馏：CMM将LLM特征拆解给小模型协同学习

专知会员服务

3+阅读 · 5月5日

【ICML Spotlight 2026 】NonZero：交互引导探索的多智能体蒙特卡洛树搜索

【ICML Spotlight 2026 】NonZero：交互引导探索的多智能体蒙特卡洛树搜索

专知会员服务

8+阅读 · 5月4日

【综述】机器人学习中的世界模型：全面综述

【综述】机器人学习中的世界模型：全面综述

专知会员服务

12+阅读 · 5月4日

伊朗的导弹-无人机行动及其对美国威慑的影响

伊朗的导弹-无人机行动及其对美国威慑的影响

专知会员服务

9+阅读 · 5月4日

《未来战术无人机系统案例研究：量身定制采办策略方法》100页报告

《未来战术无人机系统案例研究：量身定制采办策略方法》100页报告

专知会员服务

9+阅读 · 5月4日

战争贩子：2026年第一季度美国对中东潜在军售激增

战争贩子：2026年第一季度美国对中东潜在军售激增

专知会员服务

7+阅读 · 5月4日

相关VIP内容

《Hello-Agents》项目正式发布，一起从零学习智能体！

《Hello-Agents》项目正式发布，一起从零学习智能体！

专知会员服务

31+阅读 · 1月2日

Agent有望定义万亿劳动力市场

Agent有望定义万亿劳动力市场

专知会员服务

19+阅读 · 2025年6月11日

AI Agent深度（二）：2025 Agent元年，AI从L2向L3发展

AI Agent深度（二）：2025 Agent元年，AI从L2向L3发展

专知会员服务

42+阅读 · 2025年5月5日

人工智能专题报告：Operator和Manus打开AI Agent时代

人工智能专题报告：Operator和Manus打开AI Agent时代

专知会员服务

62+阅读 · 2025年3月12日

再谈工业AI：立足跨模型架构AI中台，落地垂类Agent场景

再谈工业AI：立足跨模型架构AI中台，落地垂类Agent场景

专知会员服务

45+阅读 · 2025年3月9日

AI Agent，大模型时代重要落地方向, 42页ppt

AI Agent，大模型时代重要落地方向, 42页ppt

专知会员服务

290+阅读 · 2023年10月12日

AI Agent下一个热点？复旦最新86页《大型语言模型智能体的崛起与潜力》综述，详述LLM Agent: 大脑、感知和行动

AI Agent下一个热点？复旦最新86页《大型语言模型智能体的崛起与潜力》综述，详述LLM Agent: 大脑、感知和行动

专知会员服务

170+阅读 · 2023年9月15日

工业机器人及工控系统行业深度分析：从ChatGPT到RobotGPT，回答人形机器人八个问题

工业机器人及工控系统行业深度分析：从ChatGPT到RobotGPT，回答人形机器人八个问题

专知会员服务

46+阅读 · 2023年6月28日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【ICML Spotlight 2026】 T²PO: 不确定性引导的探索控制框架，实现稳定多轮Agentic强化学习

《机动炮兵的演进与未来：技术进步、历史沿革与炮兵作战前瞻》

DeepSeek 版Claude Code，免费小白安装教程来了！

基础模型驱动的工业智能体：技术成熟度、能力变迁与未竟之挑战

相关资讯

微软亚研提出VL-BERT：通用的视觉-语言预训练模型

微软亚研提出VL-BERT：通用的视觉-语言预训练模型

机器之心

15+阅读 · 2019年9月3日

参数少一半，效果还更好，天津大学和微软提出Transformer压缩模型

参数少一半，效果还更好，天津大学和微软提出Transformer压缩模型

机器之心

15+阅读 · 2019年7月13日

利用上下文常识，让AI读懂不完整人类指令 | 一周AI最火论文

利用上下文常识，让AI读懂不完整人类指令 | 一周AI最火论文

大数据文摘

12+阅读 · 2019年5月6日

从虚拟到现实，北大等提出基于强化学习的端到端主动目标跟踪方法

从虚拟到现实，北大等提出基于强化学习的端到端主动目标跟踪方法

机器之心

23+阅读 · 2019年4月13日

多图带你读懂 Transformers 的工作原理

多图带你读懂 Transformers 的工作原理

AI研习社

10+阅读 · 2019年3月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【泡泡图灵智库】密集相关的自监督视觉描述学习（RAL）

【泡泡图灵智库】密集相关的自监督视觉描述学习（RAL）

泡泡机器人SLAM

11+阅读 · 2018年10月6日

谷歌 AI：语义文本相似度研究进展

谷歌 AI：语义文本相似度研究进展

AI研习社

22+阅读 · 2018年6月13日

腾讯：机器学习构建通用的数据异常检测平台

腾讯：机器学习构建通用的数据异常检测平台

全球人工智能

11+阅读 · 2018年5月1日

Representation Learning on Network 网络表示学习

Representation Learning on Network 网络表示学习

全球人工智能

10+阅读 · 2017年10月19日

相关论文

Building Persona-Based Agents On Demand: Tailoring Multi-Agent Workflows to User Needs

Arxiv

0+阅读 · 4月30日

SWE-Bench 5G: Benchmarking AI Coding Agents on Telecom Network Engineering Tasks

Arxiv

0+阅读 · 4月29日

Asymmetric Goal Drift in Coding Agents Under Value Conflict

Arxiv

0+阅读 · 4月24日

Scheming in the wild: detecting real-world AI scheming incidents with open-source intelligence

Arxiv

0+阅读 · 4月10日

When Agent Markets Arrive

Arxiv

0+阅读 · 4月8日

DoubleAgents: Human-Agent Alignment in a Socially Embedded Workflow

Arxiv

0+阅读 · 4月6日

"When to Hand Off, When to Work Together": Expanding Human-Agent Co-Creative Collaboration through Concurrent Interaction

Arxiv

0+阅读 · 4月5日

Synergy: A Next-Generation General-Purpose Agent for Open Agentic Web

Arxiv

0+阅读 · 3月30日

Regulating AI Agents

Arxiv

0+阅读 · 3月24日

Verifiable Semantics for Agent-to-Agent Communication

Arxiv

0+阅读 · 3月19日

相关基金

共融机器人战略规划研究和学术交流

国家自然科学基金

15+阅读 · 2016年12月31日

基于人机交互的数据驱动式人群行为建模与仿真研究

国家自然科学基金

4+阅读 · 2015年12月31日

非结构环境下基于三维肢体动作理解的工业机器人交互技术研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于主题网络的用户内在兴趣发现及演进研究

国家自然科学基金

0+阅读 · 2015年12月31日

面向多用户行为的无线识别关键技术研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于跨层网络编码感知的无线传感器网络节能路由协议研究

国家自然科学基金

0+阅读 · 2015年12月31日

机器灵巧手基于触滑觉信息协同的自适应力控制方法研究

国家自然科学基金

3+阅读 · 2015年12月31日

面向人与Agent混合的多团队协作仿真训练方法研究

国家自然科学基金

19+阅读 · 2012年12月31日

数据和模型混合驱动的虚拟人群行为仿真技术研究及其在军事中的应用

国家自然科学基金

10+阅读 · 2011年12月31日

基于群体智能的多Agent协作模型与适应性研究

国家自然科学基金

18+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员