RoBoSR: Structured Scene Representations for Embodied Robotic Reasoning - 专知论文

会员服务 ·

0

表示 · 机器人 · MoDELS · Learning · 监督 ·

RoBoSR: Structured Scene Representations for Embodied Robotic Reasoning

翻译：暂无翻译

Kewei Hu,Wanchan Yu,Fangwen Chen,Jing Jiajian,Zimeng Li,Ying Wei,Tianhao Liu,Michael Zhang,Hanwen Kang

Despite rapid progress, embodied reasoning under real-world variability remains challenging. Existing approaches rely on demonstration-driven sequential biases, limiting flexibility in open-ended and long-horizon tasks that require structured reasoning over evolving states. We introduce RoBoSR, an intermediate structural representation that formulates manipulation as step-wise state transitions over semantically grounded, object-centric scene graphs. By modeling object states and their spatial relations at the perception-action interface, RoBoSR disentangles high-level task reasoning from raw inputs and enables structured reasoning over preconditions, effects, and goal states. This representation endows the agent with causal reasoning capability, enforcing subtask dependencies and supporting coherent long-horizon task planning. To learn such structure-aware reasoning, we construct Manip-Cognition-1.6M, an open-world dataset that jointly supervises scene understanding, instruction interpretation, and subtask planning across diverse tasks. Across several benchmarks and real-world demonstrations, our method consistently outperforms prompting-based methods and classical TAMP baselines in zero-shot generalization and long-horizon tasks. The results underscore structured intermediate representations as a critical inductive bias for scalable embodied reasoning.

翻译：暂无翻译

0

相关内容

《人工智能武器化：恐怖主义与战争的新阶段》2025最新134页

《人工智能武器化：恐怖主义与战争的新阶段》2025最新134页

专知会员服务

26+阅读 · 2025年5月3日

Robotaxi的商业模式前景展望

Robotaxi的商业模式前景展望

专知会员服务

17+阅读 · 2024年9月21日

上海交大姚振鹏副教授团队在《Nature Reviews Materials》发表人工智能加速材料发现综述论文

上海交大姚振鹏副教授团队在《Nature Reviews Materials》发表人工智能加速材料发现综述论文

专知会员服务

24+阅读 · 2022年10月31日

《智慧城市知识图谱模型与本体构建方法》拓尔思知识图谱研究院等

《智慧城市知识图谱模型与本体构建方法》拓尔思知识图谱研究院等

专知会员服务

50+阅读 · 2022年3月27日

【CVPR 2022】连续驾驶场景与不断增长的建筑的连续立体匹配，Continual Stereo Matching of Continuous Driving Scenes with Growing Architecture

【CVPR 2022】连续驾驶场景与不断增长的建筑的连续立体匹配，Continual Stereo Matching of Continuous Driving Scenes with Growing Architecture

专知会员服务

11+阅读 · 2022年3月12日

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

【KDD2021】重思考异构图神经网络

专知会员服务

27+阅读 · 2021年7月21日

【综述论文】A Survey on Dynamic Network Embedding，动态网络嵌入综述论文

【综述论文】A Survey on Dynamic Network Embedding，动态网络嵌入综述论文

专知会员服务

102+阅读 · 2020年6月16日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

KG 高引论文解读两篇 | 两种模型：多层卷积神经网络、知识感知路径递归网络

KG 高引论文解读两篇 | 两种模型：多层卷积神经网络、知识感知路径递归网络

学术头条

18+阅读 · 2019年12月8日

赛尔原创 | EMNLP 2019 基于上下文感知的变分自编码器建模事件背景知识进行If-Then类型常识推理

赛尔原创 | EMNLP 2019 基于上下文感知的变分自编码器建模事件背景知识进行If-Then类型常识推理

哈工大SCIR

17+阅读 · 2019年9月23日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

重构 Palantir 数据模型

重构 Palantir 数据模型

待字闺中

34+阅读 · 2018年12月27日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

每日论文 | 用循环世界模型改良策略进化；轻量级CNN：ChannelNets；强化学习知识点总结

每日论文 | 用循环世界模型改良策略进化；轻量级CNN：ChannelNets；强化学习知识点总结

论智

14+阅读 · 2018年9月7日

【论文推荐】最新六篇主题模型相关论文—动态主题模型、主题趋势、大规模并行采样、随机采样、非参主题建模

【论文推荐】最新六篇主题模型相关论文—动态主题模型、主题趋势、大规模并行采样、随机采样、非参主题建模

专知

14+阅读 · 2018年6月24日

【论文推荐】最新八篇主题模型相关论文—主题建模优化、变分推断、情绪强度、神经语言模型、搜索、社区聚合、主题建模的问题、光谱学习

【论文推荐】最新八篇主题模型相关论文—主题建模优化、变分推断、情绪强度、神经语言模型、搜索、社区聚合、主题建模的问题、光谱学习

专知

13+阅读 · 2018年3月8日

多层动态网络的建模、群体动力学分析与控制

国家自然科学基金

3+阅读 · 2015年12月31日

旋转式水稻钵苗有序抛栽机构创新，参数优化及设计方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

用于反演自然生物关节结构及力学性能的柔性机构设计理论与方法

国家自然科学基金

0+阅读 · 2015年12月31日

具有重构特征的系统可靠性建模方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

工业过程动态数据的多模型在线重构研究

国家自然科学基金

1+阅读 · 2015年12月31日

信息物理系统动力学演化融合机制与行为建模研究

国家自然科学基金

0+阅读 · 2014年12月31日

拼装式盾构结构服役性能退化模型及在隧道维养应用中研究

国家自然科学基金

0+阅读 · 2014年12月31日

不同重构措施复垦土壤水氮运移和作物生长模拟与响应机制

国家自然科学基金

0+阅读 · 2014年12月31日

城市群空间交互情景分析与多尺度协同模拟

国家自然科学基金

0+阅读 · 2014年12月31日

基于结构化方法的复杂研发项目多领域集成分析与优化研究

国家自然科学基金

2+阅读 · 2014年12月31日

SC3-Eval: Evaluating Robot Foundation Models via Self-Consistent Video Generation

Arxiv

0+阅读 · 6月23日

Enhancing RL Generalizability in Robotics through SHAP Analysis of Algorithms and Hyperparameters

Arxiv

0+阅读 · 6月22日

ARP: Enhancing Quantized Skill Abstractions via Visual Alignment and Iterative Refinement for Robotic Manipulation

Arxiv

0+阅读 · 6月21日

A Taxonomy of Conceptual Alignment in Human-Robot Dialogue

Arxiv

0+阅读 · 6月21日

Towards Considerate Human-Robot Coexistence: A Dual-Space Framework of Robot Design and Human Perception in Healthcare

Arxiv

0+阅读 · 6月21日

Robot Critics that Sweat the Small Stuff

Arxiv

0+阅读 · 6月19日

SARIF: Segment Anything for Robust Image Forensics

Arxiv

0+阅读 · 6月19日

Learning-Based Modeling of Soft Robots via Cosserat Rod Theory

Arxiv

0+阅读 · 6月18日

ROBOSHACKLES: A Safety Dataset for Human-Injury Prevention in Embodied Foundation Models

Arxiv

0+阅读 · 6月17日

WEAVER, Better, Faster, Longer: An Effective World Model for Robotic Manipulation

Arxiv

0+阅读 · 6月16日

VIP会员

文章信息

相关主题

最新内容

无人机自主控制与人工智能：系统性综述

无人机自主控制与人工智能：系统性综述

专知会员服务

5+阅读 · 今天7:25

巡飞弹与反无人机系统——现代战场的两大支柱

巡飞弹与反无人机系统——现代战场的两大支柱

专知会员服务

2+阅读 · 今天6:54

《打造“黄金舰队”》57页报告

《打造“黄金舰队”》57页报告

专知会员服务

1+阅读 · 今天6:52

《北约数字教官网络发展路径》128页报告

《北约数字教官网络发展路径》128页报告

专知会员服务

1+阅读 · 今天6:33

ECCV 2026 | MIMFlow：MIM与归一化流统一图像生成

ECCV 2026 | MIMFlow：MIM与归一化流统一图像生成

专知会员服务

6+阅读 · 6月25日

超越自回归边界：扩散模型、世界模型与SSM如何重塑代码智能

超越自回归边界：扩散模型、世界模型与SSM如何重塑代码智能

专知会员服务

5+阅读 · 6月25日

重塑决策优势：美军作战艺术与多域作战中联盟联合全域指挥控制（CJADC2）体系的融合

重塑决策优势：美军作战艺术与多域作战中联盟联合全域指挥控制（CJADC2）体系的融合

专知会员服务

9+阅读 · 6月25日

网状网络及其在军事领域的运用

网状网络及其在军事领域的运用

专知会员服务

7+阅读 · 6月25日

《意识即战场——全球安全体系中认知战的演进：乌克兰构建认知作战体系的展望》

《意识即战场——全球安全体系中认知战的演进：乌克兰构建认知作战体系的展望》

专知会员服务

8+阅读 · 6月25日

无美国参与的欧洲战争方式（万字长文）

无美国参与的欧洲战争方式（万字长文）

专知会员服务

8+阅读 · 6月25日

重构“下一场战争”的制胜理论：超越兰彻斯特方程与现代系统

重构“下一场战争”的制胜理论：超越兰彻斯特方程与现代系统

专知会员服务

9+阅读 · 6月25日

《国防工业中基于模型定义的实施：产品定义数字化转型的战略路径》90页

《国防工业中基于模型定义的实施：产品定义数字化转型的战略路径》90页

专知会员服务

9+阅读 · 6月25日

《国防领域敏感性分析白皮书》

《国防领域敏感性分析白皮书》

专知会员服务

9+阅读 · 6月25日

综述 | 从问答到任务完成：Agent系统与Harness设计

综述 | 从问答到任务完成：Agent系统与Harness设计

专知会员服务

9+阅读 · 6月24日

Agentic RL：框架、实践与长程智能体训练

Agentic RL：框架、实践与长程智能体训练

专知会员服务

10+阅读 · 6月24日

相关VIP内容

《人工智能武器化：恐怖主义与战争的新阶段》2025最新134页

《人工智能武器化：恐怖主义与战争的新阶段》2025最新134页

专知会员服务

26+阅读 · 2025年5月3日

Robotaxi的商业模式前景展望

Robotaxi的商业模式前景展望

专知会员服务

17+阅读 · 2024年9月21日

上海交大姚振鹏副教授团队在《Nature Reviews Materials》发表人工智能加速材料发现综述论文

上海交大姚振鹏副教授团队在《Nature Reviews Materials》发表人工智能加速材料发现综述论文

专知会员服务

24+阅读 · 2022年10月31日

《智慧城市知识图谱模型与本体构建方法》拓尔思知识图谱研究院等

《智慧城市知识图谱模型与本体构建方法》拓尔思知识图谱研究院等

专知会员服务

50+阅读 · 2022年3月27日

【CVPR 2022】连续驾驶场景与不断增长的建筑的连续立体匹配，Continual Stereo Matching of Continuous Driving Scenes with Growing Architecture

【CVPR 2022】连续驾驶场景与不断增长的建筑的连续立体匹配，Continual Stereo Matching of Continuous Driving Scenes with Growing Architecture

专知会员服务

11+阅读 · 2022年3月12日

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

【KDD2021】重思考异构图神经网络

专知会员服务

27+阅读 · 2021年7月21日

【综述论文】A Survey on Dynamic Network Embedding，动态网络嵌入综述论文

【综述论文】A Survey on Dynamic Network Embedding，动态网络嵌入综述论文

专知会员服务

102+阅读 · 2020年6月16日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

巡飞弹与反无人机系统——现代战场的两大支柱

《北约数字教官网络发展路径》128页报告

无人机自主控制与人工智能：系统性综述

《打造“黄金舰队”》57页报告

相关资讯

KG 高引论文解读两篇 | 两种模型：多层卷积神经网络、知识感知路径递归网络

KG 高引论文解读两篇 | 两种模型：多层卷积神经网络、知识感知路径递归网络

学术头条

18+阅读 · 2019年12月8日

赛尔原创 | EMNLP 2019 基于上下文感知的变分自编码器建模事件背景知识进行If-Then类型常识推理

赛尔原创 | EMNLP 2019 基于上下文感知的变分自编码器建模事件背景知识进行If-Then类型常识推理

哈工大SCIR

17+阅读 · 2019年9月23日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

重构 Palantir 数据模型

重构 Palantir 数据模型

待字闺中

34+阅读 · 2018年12月27日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

每日论文 | 用循环世界模型改良策略进化；轻量级CNN：ChannelNets；强化学习知识点总结

每日论文 | 用循环世界模型改良策略进化；轻量级CNN：ChannelNets；强化学习知识点总结

论智

14+阅读 · 2018年9月7日

【论文推荐】最新六篇主题模型相关论文—动态主题模型、主题趋势、大规模并行采样、随机采样、非参主题建模

【论文推荐】最新六篇主题模型相关论文—动态主题模型、主题趋势、大规模并行采样、随机采样、非参主题建模

专知

14+阅读 · 2018年6月24日

【论文推荐】最新八篇主题模型相关论文—主题建模优化、变分推断、情绪强度、神经语言模型、搜索、社区聚合、主题建模的问题、光谱学习

【论文推荐】最新八篇主题模型相关论文—主题建模优化、变分推断、情绪强度、神经语言模型、搜索、社区聚合、主题建模的问题、光谱学习

专知

13+阅读 · 2018年3月8日

相关论文

SC3-Eval: Evaluating Robot Foundation Models via Self-Consistent Video Generation

Arxiv

0+阅读 · 6月23日

Enhancing RL Generalizability in Robotics through SHAP Analysis of Algorithms and Hyperparameters

Arxiv

0+阅读 · 6月22日

ARP: Enhancing Quantized Skill Abstractions via Visual Alignment and Iterative Refinement for Robotic Manipulation

Arxiv

0+阅读 · 6月21日

A Taxonomy of Conceptual Alignment in Human-Robot Dialogue

Arxiv

0+阅读 · 6月21日

Towards Considerate Human-Robot Coexistence: A Dual-Space Framework of Robot Design and Human Perception in Healthcare

Arxiv

0+阅读 · 6月21日

Robot Critics that Sweat the Small Stuff

Arxiv

0+阅读 · 6月19日

SARIF: Segment Anything for Robust Image Forensics

Arxiv

0+阅读 · 6月19日

Learning-Based Modeling of Soft Robots via Cosserat Rod Theory

Arxiv

0+阅读 · 6月18日

ROBOSHACKLES: A Safety Dataset for Human-Injury Prevention in Embodied Foundation Models

Arxiv

0+阅读 · 6月17日

WEAVER, Better, Faster, Longer: An Effective World Model for Robotic Manipulation

Arxiv

0+阅读 · 6月16日

相关基金

多层动态网络的建模、群体动力学分析与控制

国家自然科学基金

3+阅读 · 2015年12月31日

旋转式水稻钵苗有序抛栽机构创新，参数优化及设计方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

用于反演自然生物关节结构及力学性能的柔性机构设计理论与方法

国家自然科学基金

0+阅读 · 2015年12月31日

具有重构特征的系统可靠性建模方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

工业过程动态数据的多模型在线重构研究

国家自然科学基金

1+阅读 · 2015年12月31日

信息物理系统动力学演化融合机制与行为建模研究

国家自然科学基金

0+阅读 · 2014年12月31日

拼装式盾构结构服役性能退化模型及在隧道维养应用中研究

国家自然科学基金

0+阅读 · 2014年12月31日

不同重构措施复垦土壤水氮运移和作物生长模拟与响应机制

国家自然科学基金

0+阅读 · 2014年12月31日

城市群空间交互情景分析与多尺度协同模拟

国家自然科学基金

0+阅读 · 2014年12月31日

基于结构化方法的复杂研发项目多领域集成分析与优化研究

国家自然科学基金

2+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员