Synthetic Computers at Scale for Long-Horizon Productivity Simulation - 专知论文

会员服务 ·

0

Synthetic Computers at Scale for Long-Horizon Productivity Simulation

翻译：暂无翻译

Tao Ge,Baolin Peng,Hao Cheng,Jianfeng Gao

from arxiv, Preview version; work in progress

Realistic long-horizon productivity work is strongly conditioned on user-specific computer environments, where much of the work context is stored and organized through directory structures and content-rich artifacts. To scale synthetic data creation for such productivity scenarios, we introduce Synthetic Computers at Scale, a scalable methodology for creating such environments with realistic folder hierarchies and content-rich artifacts (e.g., documents, spreadsheets, and presentations). Conditioned on each synthetic computer, we run long-horizon simulations: one agent creates productivity objectives that are specific to the computer's user and require multiple professional deliverables and about a month of human work; another agent then acts as that user and keeps working across the computer -- for example, navigating the filesystem for grounding, coordinating with simulated collaborators, and producing professional artifacts -- until these objectives are completed. In preliminary experiments, we create 1,000 synthetic computers and run long-horizon simulations on them; each run requires over 8 hours of agent runtime and spans more than 2,000 turns on average. These simulations produce rich experiential learning signals, whose effectiveness is validated by significant improvements in agent performance on both in-domain and out-of-domain productivity evaluations. Given that personas are abundant at billion scale, this methodology can in principle scale to millions or even billions of synthetic user worlds with sufficient compute, enabling broader coverage of diverse professions, roles, contexts, environments, and productivity needs. We argue that scalable synthetic computer creation, together with at-scale simulations, is highly promising as a foundational substrate for agent self-improvement and agentic reinforcement learning in long-horizon productivity scenarios.

翻译：暂无翻译

0

相关内容

《用计算图变换加速实际工程设计优化》MIT 400页

《用计算图变换加速实际工程设计优化》MIT 400页

专知会员服务

17+阅读 · 2025年11月7日

深度学习在农业领域的研究与应用

深度学习在农业领域的研究与应用

专知会员服务

23+阅读 · 2024年5月3日

最新《工业大模型应用报告》

最新《工业大模型应用报告》

专知会员服务

121+阅读 · 2024年4月5日

AI大模型落地终端，AIPC驱动PC行业新增长

AI大模型落地终端，AIPC驱动PC行业新增长

专知会员服务

48+阅读 · 2024年2月25日

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

清华49页长文全方位分析参数高效微调方案Delta Tuning，揭秘大模型背后的机理

清华49页长文全方位分析参数高效微调方案Delta Tuning，揭秘大模型背后的机理

专知会员服务

50+阅读 · 2022年4月8日

【CVPR 2022】连续驾驶场景与不断增长的建筑的连续立体匹配，Continual Stereo Matching of Continuous Driving Scenes with Growing Architecture

【CVPR 2022】连续驾驶场景与不断增长的建筑的连续立体匹配，Continual Stereo Matching of Continuous Driving Scenes with Growing Architecture

专知会员服务

11+阅读 · 2022年3月12日

【KDD2019|讲座推荐】从生产规模神经网络中发现知识的统计学习方法：Statistical Mechanics Methods for Discovering Knowledge from Production-Scale Neural Networks

【KDD2019|讲座推荐】从生产规模神经网络中发现知识的统计学习方法：Statistical Mechanics Methods for Discovering Knowledge from Production-Scale Neural Networks

专知会员服务

18+阅读 · 2019年12月4日

【中科院计算所】边缘计算与工具综述论文，A Survey on Edge Computing Systems and Tools

【中科院计算所】边缘计算与工具综述论文，A Survey on Edge Computing Systems and Tools

专知会员服务

96+阅读 · 2019年11月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

时间序列预测：一课掌握亚马逊开源算法DeepAR

时间序列预测：一课掌握亚马逊开源算法DeepAR

机器之心

13+阅读 · 2020年6月3日

初学者系列：Attentional Factorization Machines（AFM）详解

初学者系列：Attentional Factorization Machines（AFM）详解

专知

82+阅读 · 2019年9月16日

参数少一半，效果还更好，天津大学和微软提出Transformer压缩模型

参数少一半，效果还更好，天津大学和微软提出Transformer压缩模型

机器之心

15+阅读 · 2019年7月13日

【数字孪生】数字孪生是制造业实现“智能+”的技术接口

【数字孪生】数字孪生是制造业实现“智能+”的技术接口

产业智能官

35+阅读 · 2019年4月30日

【数字孪生】数字孪生是工业互联网关键技术和重要场景

【数字孪生】数字孪生是工业互联网关键技术和重要场景

产业智能官

39+阅读 · 2019年4月9日

多图带你读懂 Transformers 的工作原理

多图带你读懂 Transformers 的工作原理

AI研习社

10+阅读 · 2019年3月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【仿真】生产系统仿真软件，实现数字化工厂的利器！

【仿真】生产系统仿真软件，实现数字化工厂的利器！

产业智能官

15+阅读 · 2018年11月1日

【边缘计算】工业互联网正确打开方式系列（四）：边缘计算

【边缘计算】工业互联网正确打开方式系列（四）：边缘计算

产业智能官

19+阅读 · 2018年8月31日

【论文推荐】最新六篇用户建模精选论文推荐—深度多模态融合、跨平台、时序性RNN、ATRank、嵌入因子分解、异构信息网络

【论文推荐】最新六篇用户建模精选论文推荐—深度多模态融合、跨平台、时序性RNN、ATRank、嵌入因子分解、异构信息网络

专知

10+阅读 · 2018年3月10日

3D打印过程能耗建模与智能数据模型研究

国家自然科学基金

3+阅读 · 2015年12月31日

基于工业大数据挖掘的复杂产品总完工时间动态预测

国家自然科学基金

4+阅读 · 2015年12月31日

基于感性工学与视觉感知协同优化的产品设计理论及应用研究

国家自然科学基金

0+阅读 · 2015年12月31日

面向数万处理器的有限元线性方程组与模态多级算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

“数据-知识”驱动的大区域高分辨率遥感影像多尺度分割并行计算方法

国家自然科学基金

0+阅读 · 2015年12月31日

工业过程动态数据的多模型在线重构研究

国家自然科学基金

1+阅读 · 2015年12月31日

考虑缓冲区大小及在制品数量的多工序生产系统预测维护方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于规范的企业竞争力演化的计算实验研究

国家自然科学基金

1+阅读 · 2014年12月31日

形状先验和数据驱动的高分辨遥感影像目标提取

国家自然科学基金

3+阅读 · 2014年12月31日

云制造资源优化调度理论与方法研究

国家自然科学基金

1+阅读 · 2014年12月31日

Intelligent Elastic Feature Fading: Enabling Model Retrain-Free Feature Efficiency Rollouts at Scale

Arxiv

0+阅读 · 5月1日

What Makes Software Bugs Escape Testing? Evidence from a Large-Scale Empirical Study

Arxiv

0+阅读 · 4月29日

Toward Scalable Terminal Task Synthesis via Skill Graphs

Arxiv

0+阅读 · 4月28日

Statistical Validation of Computer Models: Global and Subdomain Hypothesis Testing

Arxiv

0+阅读 · 4月18日

The Principle of Maximum Heterogeneity Optimises Productivity in Distributed Production Systems Across Biology, Economics, and Computing

Arxiv

0+阅读 · 4月8日

Enabling Deterministic User-Level Interrupts in Real-Time Processors via Hardware Extension

Arxiv

0+阅读 · 4月5日

Towards Industrial-scale Product Configuration

Arxiv

0+阅读 · 3月24日

SwiftTailor: Efficient 3D Garment Generation with Geometry Image Representation

Arxiv

0+阅读 · 3月19日

Multifidelity Simulation-based Inference for Computationally Expensive Simulators

Arxiv

0+阅读 · 3月19日

Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting

Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting

Arxiv

21+阅读 · 2020年12月17日

VIP会员

文章信息

相关主题

最新内容

DeepSeek 版Claude Code，免费小白安装教程来了！

DeepSeek 版Claude Code，免费小白安装教程来了！

专知会员服务

7+阅读 · 5月5日

【ICML Spotlight 2026】 T²PO: 不确定性引导的探索控制框架，实现稳定多轮Agentic强化学习

【ICML Spotlight 2026】 T²PO: 不确定性引导的探索控制框架，实现稳定多轮Agentic强化学习

专知会员服务

3+阅读 · 5月5日

基础模型驱动的工业智能体：技术成熟度、能力变迁与未竟之挑战

基础模型驱动的工业智能体：技术成熟度、能力变迁与未竟之挑战

专知会员服务

3+阅读 · 5月5日

《机动炮兵的演进与未来：技术进步、历史沿革与炮兵作战前瞻》

《机动炮兵的演进与未来：技术进步、历史沿革与炮兵作战前瞻》

专知会员服务

4+阅读 · 5月5日

《火炮弹药快速效能建模：提升互操作性与技术优势》（报告）

《火炮弹药快速效能建模：提升互操作性与技术优势》（报告）

专知会员服务

6+阅读 · 5月5日

《美空军条令出版物 2-0：情报（2026版）》

《美空军条令出版物 2-0：情报（2026版）》

专知会员服务

12+阅读 · 5月5日

美陆军“飞蝇陷阱5.0”项目将新兴技术交到作战人员手中

美陆军“飞蝇陷阱5.0”项目将新兴技术交到作战人员手中

专知会员服务

4+阅读 · 5月5日

帕兰提尔 Gotham：一个游戏规则改变器

帕兰提尔 Gotham：一个游戏规则改变器

专知会员服务

6+阅读 · 5月5日

【ICML 2026】用测试时训练线性化视觉Transformer：T⁵ 实现 Softmax 注意力到线性复杂度的快速转换

【ICML 2026】用测试时训练线性化视觉Transformer：T⁵ 实现 Softmax 注意力到线性复杂度的快速转换

专知会员服务

2+阅读 · 5月5日

【AAAI 2026】大模型做知识蒸馏：CMM将LLM特征拆解给小模型协同学习

【AAAI 2026】大模型做知识蒸馏：CMM将LLM特征拆解给小模型协同学习

专知会员服务

2+阅读 · 5月5日

【ICML Spotlight 2026 】NonZero：交互引导探索的多智能体蒙特卡洛树搜索

【ICML Spotlight 2026 】NonZero：交互引导探索的多智能体蒙特卡洛树搜索

专知会员服务

8+阅读 · 5月4日

【综述】机器人学习中的世界模型：全面综述

【综述】机器人学习中的世界模型：全面综述

专知会员服务

11+阅读 · 5月4日

伊朗的导弹-无人机行动及其对美国威慑的影响

伊朗的导弹-无人机行动及其对美国威慑的影响

专知会员服务

9+阅读 · 5月4日

《未来战术无人机系统案例研究：量身定制采办策略方法》100页报告

《未来战术无人机系统案例研究：量身定制采办策略方法》100页报告

专知会员服务

9+阅读 · 5月4日

战争贩子：2026年第一季度美国对中东潜在军售激增

战争贩子：2026年第一季度美国对中东潜在军售激增

专知会员服务

7+阅读 · 5月4日

相关VIP内容

《用计算图变换加速实际工程设计优化》MIT 400页

《用计算图变换加速实际工程设计优化》MIT 400页

专知会员服务

17+阅读 · 2025年11月7日

深度学习在农业领域的研究与应用

深度学习在农业领域的研究与应用

专知会员服务

23+阅读 · 2024年5月3日

最新《工业大模型应用报告》

最新《工业大模型应用报告》

专知会员服务

121+阅读 · 2024年4月5日

AI大模型落地终端，AIPC驱动PC行业新增长

AI大模型落地终端，AIPC驱动PC行业新增长

专知会员服务

48+阅读 · 2024年2月25日

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

清华49页长文全方位分析参数高效微调方案Delta Tuning，揭秘大模型背后的机理

清华49页长文全方位分析参数高效微调方案Delta Tuning，揭秘大模型背后的机理

专知会员服务

50+阅读 · 2022年4月8日

【CVPR 2022】连续驾驶场景与不断增长的建筑的连续立体匹配，Continual Stereo Matching of Continuous Driving Scenes with Growing Architecture

【CVPR 2022】连续驾驶场景与不断增长的建筑的连续立体匹配，Continual Stereo Matching of Continuous Driving Scenes with Growing Architecture

专知会员服务

11+阅读 · 2022年3月12日

【KDD2019|讲座推荐】从生产规模神经网络中发现知识的统计学习方法：Statistical Mechanics Methods for Discovering Knowledge from Production-Scale Neural Networks

【KDD2019|讲座推荐】从生产规模神经网络中发现知识的统计学习方法：Statistical Mechanics Methods for Discovering Knowledge from Production-Scale Neural Networks

专知会员服务

18+阅读 · 2019年12月4日

【中科院计算所】边缘计算与工具综述论文，A Survey on Edge Computing Systems and Tools

【中科院计算所】边缘计算与工具综述论文，A Survey on Edge Computing Systems and Tools

专知会员服务

96+阅读 · 2019年11月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

【ICML Spotlight 2026】 T²PO: 不确定性引导的探索控制框架，实现稳定多轮Agentic强化学习

《机动炮兵的演进与未来：技术进步、历史沿革与炮兵作战前瞻》

DeepSeek 版Claude Code，免费小白安装教程来了！

基础模型驱动的工业智能体：技术成熟度、能力变迁与未竟之挑战

相关资讯

时间序列预测：一课掌握亚马逊开源算法DeepAR

时间序列预测：一课掌握亚马逊开源算法DeepAR

机器之心

13+阅读 · 2020年6月3日

初学者系列：Attentional Factorization Machines（AFM）详解

初学者系列：Attentional Factorization Machines（AFM）详解

专知

82+阅读 · 2019年9月16日

参数少一半，效果还更好，天津大学和微软提出Transformer压缩模型

参数少一半，效果还更好，天津大学和微软提出Transformer压缩模型

机器之心

15+阅读 · 2019年7月13日

【数字孪生】数字孪生是制造业实现“智能+”的技术接口

【数字孪生】数字孪生是制造业实现“智能+”的技术接口

产业智能官

35+阅读 · 2019年4月30日

【数字孪生】数字孪生是工业互联网关键技术和重要场景

【数字孪生】数字孪生是工业互联网关键技术和重要场景

产业智能官

39+阅读 · 2019年4月9日

多图带你读懂 Transformers 的工作原理

多图带你读懂 Transformers 的工作原理

AI研习社

10+阅读 · 2019年3月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【仿真】生产系统仿真软件，实现数字化工厂的利器！

【仿真】生产系统仿真软件，实现数字化工厂的利器！

产业智能官

15+阅读 · 2018年11月1日

【边缘计算】工业互联网正确打开方式系列（四）：边缘计算

【边缘计算】工业互联网正确打开方式系列（四）：边缘计算

产业智能官

19+阅读 · 2018年8月31日

【论文推荐】最新六篇用户建模精选论文推荐—深度多模态融合、跨平台、时序性RNN、ATRank、嵌入因子分解、异构信息网络

【论文推荐】最新六篇用户建模精选论文推荐—深度多模态融合、跨平台、时序性RNN、ATRank、嵌入因子分解、异构信息网络

专知

10+阅读 · 2018年3月10日

相关论文

Intelligent Elastic Feature Fading: Enabling Model Retrain-Free Feature Efficiency Rollouts at Scale

Arxiv

0+阅读 · 5月1日

What Makes Software Bugs Escape Testing? Evidence from a Large-Scale Empirical Study

Arxiv

0+阅读 · 4月29日

Toward Scalable Terminal Task Synthesis via Skill Graphs

Arxiv

0+阅读 · 4月28日

Statistical Validation of Computer Models: Global and Subdomain Hypothesis Testing

Arxiv

0+阅读 · 4月18日

The Principle of Maximum Heterogeneity Optimises Productivity in Distributed Production Systems Across Biology, Economics, and Computing

Arxiv

0+阅读 · 4月8日

Enabling Deterministic User-Level Interrupts in Real-Time Processors via Hardware Extension

Arxiv

0+阅读 · 4月5日

Towards Industrial-scale Product Configuration

Arxiv

0+阅读 · 3月24日

SwiftTailor: Efficient 3D Garment Generation with Geometry Image Representation

Arxiv

0+阅读 · 3月19日

Multifidelity Simulation-based Inference for Computationally Expensive Simulators

Arxiv

0+阅读 · 3月19日

Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting

Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting

Arxiv

21+阅读 · 2020年12月17日

相关基金

3D打印过程能耗建模与智能数据模型研究

国家自然科学基金

3+阅读 · 2015年12月31日

基于工业大数据挖掘的复杂产品总完工时间动态预测

国家自然科学基金

4+阅读 · 2015年12月31日

基于感性工学与视觉感知协同优化的产品设计理论及应用研究

国家自然科学基金

0+阅读 · 2015年12月31日

面向数万处理器的有限元线性方程组与模态多级算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

“数据-知识”驱动的大区域高分辨率遥感影像多尺度分割并行计算方法

国家自然科学基金

0+阅读 · 2015年12月31日

工业过程动态数据的多模型在线重构研究

国家自然科学基金

1+阅读 · 2015年12月31日

考虑缓冲区大小及在制品数量的多工序生产系统预测维护方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于规范的企业竞争力演化的计算实验研究

国家自然科学基金

1+阅读 · 2014年12月31日

形状先验和数据驱动的高分辨遥感影像目标提取

国家自然科学基金

3+阅读 · 2014年12月31日

云制造资源优化调度理论与方法研究

国家自然科学基金

1+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员