Pragmatic Reasoning in Structured Signaling Games - 专知论文

会员服务 ·

0

Agent · Learning · INFORMS · RSA 加密 · Color ·

2023 年 5 月 17 日

Pragmatic Reasoning in Structured Signaling Games

翻译：结构化信号博弈中的语用推理

Emil Carlsson,Devdatt Dubhashi

from arxiv, CogSci 2022

In this work we introduce a structured signaling game, an extension of the classical signaling game with a similarity structure between meanings in the context, along with a variant of the Rational Speech Act (RSA) framework which we call structured-RSA (sRSA) for pragmatic reasoning in structured domains. We explore the behavior of the sRSA in the domain of color and show that pragmatic agents using sRSA on top of semantic representations, derived from the World Color Survey, attain efficiency very close to the information theoretic limit after only 1 or 2 levels of recursion. We also explore the interaction between pragmatic reasoning and learning in multi-agent reinforcement learning framework. Our results illustrate that artificial agents using sRSA develop communication closer to the information theoretic frontier compared to agents using RSA and just reinforcement learning. We also find that the ambiguity of the semantic representation increases as the pragmatic agents are allowed to perform deeper reasoning about each other during learning.

翻译：本文引入了一种结构化信号博弈（structured signaling game），这是经典信号博弈的扩展，其中语境中含义之间具有相似性结构；同时提出了一种理性言语行为（Rational Speech Act, RSA）框架的变体，称为结构化RSA（sRSA），用于结构化领域中的语用推理。我们探究了sRSA在颜色领域中的行为，并表明：在使用源于世界颜色调查（World Color Survey）的语义表示之上，基于sRSA的语用智能体在仅1至2层递归后便能达到接近信息论极限的效率。我们还探究了多智能体强化学习框架中语用推理与学习之间的交互作用。结果表明，与仅使用RSA和强化学习的智能体相比，采用sRSA的人工智能体所发展出的通信更接近信息论前沿。我们还发现，当语用智能体在学习过程中被允许对彼此进行更深层次的推理时，语义表示的模糊性会增加。

0

相关内容

Agent

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

32+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

164+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

Fe3O4-C双壳中空纳米球的可控合成及对重金属离子的吸附研究

国家自然科学基金

0+阅读 · 2015年12月31日

mTOR-STAT3-Notch信号通路介导的自噬在ALDH2改善阿尔茨海默病认知障碍中的作用

国家自然科学基金

0+阅读 · 2014年12月31日

内质网应激和线粒体通路交联介导微囊藻毒素致斑马鱼雄性生殖细胞凋亡中的作用

国家自然科学基金

0+阅读 · 2013年12月31日

分支极限糊精调控小麦淀粉回生机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

微囊藻毒素的光催化降解技术和机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

管束间气液两相绕流特性及流型演变机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

《计算机研究与发展》学术期刊

国家自然科学基金

1+阅读 · 2011年12月31日

可见光诱导型光催化剂的制备及光电催化降解微囊藻毒素研究

国家自然科学基金

0+阅读 · 2009年12月31日

Ti3SiC2 MAX 相薄膜的合成及其氦损伤特性研究

国家自然科学基金

0+阅读 · 2009年12月31日

检测β#20869;酰胺酶体内体外表达的荧光小分子探针的研究

国家自然科学基金

0+阅读 · 2009年12月31日

Failures of Contingent Thinking

Arxiv

0+阅读 · 2023年7月3日

Transfer Learning in Deep Reinforcement Learning: A Survey

Arxiv

0+阅读 · 2023年6月30日

Fact or Artifact? Revise Layer-wise Relevance Propagation on various ANN Architectures

Fact or Artifact? Revise Layer-wise Relevance Propagation on various ANN Architectures

Arxiv

0+阅读 · 2023年6月30日

Acquisition of Chess Knowledge in AlphaZero

Arxiv

14+阅读 · 2021年11月27日

The Confluence of Networks, Games and Learning

Arxiv

94+阅读 · 2021年5月17日

Neural Collaborative Reasoning

Arxiv

13+阅读 · 2021年5月3日

Reasoning on Knowledge Graphs with Debate Dynamics

Reasoning on Knowledge Graphs with Debate Dynamics

Arxiv

14+阅读 · 2020年1月2日

Learning Discrete Structures for Graph Neural Networks

Arxiv

18+阅读 · 2019年3月28日

Multimodal Sentiment Analysis To Explore the Structure of Emotions

Arxiv

19+阅读 · 2018年5月25日

Variational Knowledge Graph Reasoning

Arxiv

15+阅读 · 2018年4月5日

VIP会员

文章信息

相关主题

最新内容

ICML 2026 | SARDI：扩散语言模型的自增强检索

ICML 2026 | SARDI：扩散语言模型的自增强检索

专知会员服务

4+阅读 · 6月6日

长时程具身智能安全综述：机器人操作的跨层分析

长时程具身智能安全综述：机器人操作的跨层分析

专知会员服务

4+阅读 · 6月6日

从“杀伤链”到“杀伤网”：新时代防空反导体系的真正需求

从“杀伤链”到“杀伤网”：新时代防空反导体系的真正需求

专知会员服务

10+阅读 · 6月6日

《锻造军官能力：军官发展的军事训练、学术教育及设计思维导向创新的多维度研究》最新300页

《锻造军官能力：军官发展的军事训练、学术教育及设计思维导向创新的多维度研究》最新300页

专知会员服务

6+阅读 · 6月6日

《国防领域安全采用大语言模型的战略蓝图》

《国防领域安全采用大语言模型的战略蓝图》

专知会员服务

7+阅读 · 6月6日

《对抗性电磁环境下远程巡飞弹作战的保密指挥控制数据链》

《对抗性电磁环境下远程巡飞弹作战的保密指挥控制数据链》

专知会员服务

7+阅读 · 6月6日

CVPR2026奖项公布，谷歌D4RT最佳论文获奖，何恺明ResNet、YOLO获时间检验奖！

CVPR2026奖项公布，谷歌D4RT最佳论文获奖，何恺明ResNet、YOLO获时间检验奖！

专知会员服务

5+阅读 · 6月6日

ICML 2026 | 演化选择的因果建模

ICML 2026 | 演化选择的因果建模

专知会员服务

7+阅读 · 6月5日

综述｜学习式3D表征最新进展与趋势

综述｜学习式3D表征最新进展与趋势

专知会员服务

7+阅读 · 6月5日

《武器作战效能分析：基于虚拟构造仿真大数据与深度学习的初步见解》

《武器作战效能分析：基于虚拟构造仿真大数据与深度学习的初步见解》

专知会员服务

8+阅读 · 6月5日

《自主巡飞弹药系统量子逻辑框架：一种基于不确定模糊集的方法》

《自主巡飞弹药系统量子逻辑框架：一种基于不确定模糊集的方法》

专知会员服务

7+阅读 · 6月5日

人工智能重塑威慑：算法优势的兴起

人工智能重塑威慑：算法优势的兴起

专知会员服务

8+阅读 · 6月5日

【博士论文】基于物理结构与贝叶斯不确定性的可靠神经网络

【博士论文】基于物理结构与贝叶斯不确定性的可靠神经网络

专知会员服务

14+阅读 · 6月4日

AgentOps综述：智能体系统运维框架

AgentOps综述：智能体系统运维框架

专知会员服务

17+阅读 · 6月4日

《美陆军最新条令：兵力防护》

《美陆军最新条令：兵力防护》

专知会员服务

14+阅读 · 6月4日

相关VIP内容

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

32+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

164+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

长时程具身智能安全综述：机器人操作的跨层分析

《锻造军官能力：军官发展的军事训练、学术教育及设计思维导向创新的多维度研究》最新300页

ICML 2026 | SARDI：扩散语言模型的自增强检索

从“杀伤链”到“杀伤网”：新时代防空反导体系的真正需求

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

相关论文

Failures of Contingent Thinking

Arxiv

0+阅读 · 2023年7月3日

Transfer Learning in Deep Reinforcement Learning: A Survey

Arxiv

0+阅读 · 2023年6月30日

Fact or Artifact? Revise Layer-wise Relevance Propagation on various ANN Architectures

Fact or Artifact? Revise Layer-wise Relevance Propagation on various ANN Architectures

Arxiv

0+阅读 · 2023年6月30日

Acquisition of Chess Knowledge in AlphaZero

Arxiv

14+阅读 · 2021年11月27日

The Confluence of Networks, Games and Learning

Arxiv

94+阅读 · 2021年5月17日

Neural Collaborative Reasoning

Arxiv

13+阅读 · 2021年5月3日

Reasoning on Knowledge Graphs with Debate Dynamics

Reasoning on Knowledge Graphs with Debate Dynamics

Arxiv

14+阅读 · 2020年1月2日

Learning Discrete Structures for Graph Neural Networks

Arxiv

18+阅读 · 2019年3月28日

Multimodal Sentiment Analysis To Explore the Structure of Emotions

Arxiv

19+阅读 · 2018年5月25日

Variational Knowledge Graph Reasoning

Arxiv

15+阅读 · 2018年4月5日

相关基金

Fe3O4-C双壳中空纳米球的可控合成及对重金属离子的吸附研究

国家自然科学基金

0+阅读 · 2015年12月31日

mTOR-STAT3-Notch信号通路介导的自噬在ALDH2改善阿尔茨海默病认知障碍中的作用

国家自然科学基金

0+阅读 · 2014年12月31日

内质网应激和线粒体通路交联介导微囊藻毒素致斑马鱼雄性生殖细胞凋亡中的作用

国家自然科学基金

0+阅读 · 2013年12月31日

分支极限糊精调控小麦淀粉回生机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

微囊藻毒素的光催化降解技术和机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

管束间气液两相绕流特性及流型演变机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

《计算机研究与发展》学术期刊

国家自然科学基金

1+阅读 · 2011年12月31日

可见光诱导型光催化剂的制备及光电催化降解微囊藻毒素研究

国家自然科学基金

0+阅读 · 2009年12月31日

Ti3SiC2 MAX 相薄膜的合成及其氦损伤特性研究

国家自然科学基金

0+阅读 · 2009年12月31日

检测β#20869;酰胺酶体内体外表达的荧光小分子探针的研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员