Habits and goals in synergy: a variational Bayesian framework for behavior - 专知论文

会员服务 ·

0

贝叶斯框架 · 变分贝叶斯 · 贝叶斯 · 变分 · 协同作用 ·

2023 年 4 月 11 日

Habits and goals in synergy: a variational Bayesian framework for behavior

翻译：习惯与目标协同：行为的变分贝叶斯框架

Dongqi Han,Kenji Doya,Dongsheng Li,Jun Tani

How to behave efficiently and flexibly is a central problem for understanding biological agents and creating intelligent embodied AI. It has been well known that behavior can be classified as two types: reward-maximizing habitual behavior, which is fast while inflexible; and goal-directed behavior, which is flexible while slow. Conventionally, habitual and goal-directed behaviors are considered handled by two distinct systems in the brain. Here, we propose to bridge the gap between the two behaviors, drawing on the principles of variational Bayesian theory. We incorporate both behaviors in one framework by introducing a Bayesian latent variable called "intention". The habitual behavior is generated by using prior distribution of intention, which is goal-less; and the goal-directed behavior is generated by the posterior distribution of intention, which is conditioned on the goal. Building on this idea, we present a novel Bayesian framework for modeling behaviors. Our proposed framework enables skill sharing between the two kinds of behaviors, and by leveraging the idea of predictive coding, it enables an agent to seamlessly generalize from habitual to goal-directed behavior without requiring additional training. The proposed framework suggests a fresh perspective for cognitive science and embodied AI, highlighting the potential for greater integration between habitual and goal-directed behaviors.

翻译：如何高效且灵活地执行行为，是理解生物智能体以及创建智能具身AI的核心问题。众所周知，行为可分为两类：追求奖励最大化的习惯性行为，其特点是快速但缺乏灵活性；以及目标导向行为，其特点是灵活但速度较慢。传统上，习惯性和目标导向行为被认为由大脑中的两个不同系统处理。在此，我们基于变分贝叶斯理论原理，提出弥合这两种行为之间的鸿沟。我们通过引入一个名为"意图"的贝叶斯潜变量，将两种行为纳入同一框架。习惯性行为通过意图的先验分布生成，该分布无目标性；而目标导向行为则通过意图的后验分布生成，该分布以目标为条件。基于这一思想，我们提出了一种新颖的贝叶斯框架用于行为建模。该框架支持两类行为之间的技能共享，并借助预测编码的思想，使智能体能够无缝地从习惯性行为泛化到目标导向行为，而无需额外训练。所提出的框架为认知科学与具身AI提供了新视角，凸显了习惯性与目标导向行为之间更深度整合的潜力。

0

相关内容

贝叶斯框架

贝叶斯框架

【nature machine intelligence】终身学习机器的生物基础，Biological underpinnings for lifelong learning machines

【nature machine intelligence】终身学习机器的生物基础，Biological underpinnings for lifelong learning machines

专知会员服务

38+阅读 · 2022年3月24日

《行为与认知机器人学》，241页pdf

《行为与认知机器人学》，241页pdf

专知会员服务

55+阅读 · 2021年4月11日

【AAAI2021】元学习器的冷启动序列推荐

【AAAI2021】元学习器的冷启动序列推荐

专知会员服务

41+阅读 · 2020年12月19日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

专知会员服务

107+阅读 · 2020年6月21日

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

专知会员服务

39+阅读 · 2020年5月30日

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

专知会员服务

131+阅读 · 2020年4月19日

《C++ Primer中文版第5版》电子书与学习笔记和课后练习答案

《C++ Primer中文版第5版》电子书与学习笔记和课后练习答案

专知会员服务

276+阅读 · 2020年2月13日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

面向图像网状结构体的蚁群分割算法

国家自然科学基金

0+阅读 · 2017年12月31日

MARVELD1基因调控肝细胞癌介入治疗的机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

RNF43介导的AKT/MDM2和NEDL1途径调控肝癌细胞恶性行为的分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

Dnmt1调控斑马鱼造血干细胞产生、分化及迁移的分子机制

国家自然科学基金

0+阅读 · 2013年12月31日

基于电路QED超强耦合机制的动力学演化和量子调控

国家自然科学基金

0+阅读 · 2013年12月31日

双相不锈钢2103连铸坯凝固过程热模拟研究

国家自然科学基金

0+阅读 · 2013年12月31日

移动互联网络中的博弈与协同激励机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

RGM与neogenin信号调控应激性精神障碍-PTSD杏仁核、海马神经细胞凋亡的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

survivin拮抗细胞衰老的机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

视频选择性注意机理与语义特征提取

国家自然科学基金

1+阅读 · 2009年12月31日

Bayesian feedback in the framework of ecological sciences

Arxiv

0+阅读 · 2023年5月29日

Maximum Optimality Margin: A Unified Approach for Contextual Linear Programming and Inverse Linear Programming

Arxiv

0+阅读 · 2023年5月28日

Reading Between the Lines: Modeling User Behavior and Costs in AI-Assisted Programming

Arxiv

0+阅读 · 2023年5月26日

HAAV: Hierarchical Aggregation of Augmented Views for Image Captioning

Arxiv

0+阅读 · 2023年5月25日

FemtoDet: An Object Detection Baseline for Energy Versus Performance Tradeoffs

Arxiv

0+阅读 · 2023年5月25日

Neural incomplete factorization: learning preconditioners for the conjugate gradient method

Arxiv

0+阅读 · 2023年5月25日

The Behavior and Convergence of Local Bayesian Optimization

Arxiv

0+阅读 · 2023年5月24日

Bayesian Deep Learning for Graphs

Arxiv

23+阅读 · 2022年2月24日

Multi-Agent Cooperative Bidding Games for Multi-Objective Optimization in e-Commercial Sponsored Search

Arxiv

12+阅读 · 2021年6月8日

Adversarial and Contrastive Variational Autoencoder for Sequential Recommendation

Arxiv

17+阅读 · 2021年3月19日

VIP会员

文章信息

相关主题

贝叶斯框架

变分贝叶斯

最新内容

【博士论文】基于物理结构与贝叶斯不确定性的可靠神经网络

【博士论文】基于物理结构与贝叶斯不确定性的可靠神经网络

专知会员服务

6+阅读 · 6月4日

AgentOps综述：智能体系统运维框架

AgentOps综述：智能体系统运维框架

专知会员服务

9+阅读 · 6月4日

《美陆军最新条令：兵力防护》

《美陆军最新条令：兵力防护》

专知会员服务

9+阅读 · 6月4日

《军用物联网：架构、应用、挑战与现代战争中的战略意义》

《军用物联网：架构、应用、挑战与现代战争中的战略意义》

专知会员服务

7+阅读 · 6月4日

《人工智能的挑战：算法战的想象与现实》

《人工智能的挑战：算法战的想象与现实》

专知会员服务

10+阅读 · 6月4日

《自适应智能：融合数字孪生精准性与人工智能预见力，实现实时决策》

《自适应智能：融合数字孪生精准性与人工智能预见力，实现实时决策》

专知会员服务

11+阅读 · 6月4日

首场人工智能战争：Maven如何重塑武装冲突

首场人工智能战争：Maven如何重塑武装冲突

专知会员服务

6+阅读 · 6月4日

【博士论文】抽象信息论与安全奖励学习的数学发展

【博士论文】抽象信息论与安全奖励学习的数学发展

专知会员服务

8+阅读 · 6月3日

综述 | 机器人操作世界模型：预测、行动接口与学习生命周期

综述 | 机器人操作世界模型：预测、行动接口与学习生命周期

专知会员服务

5+阅读 · 6月3日

《推进军事决策支持：运用强化学习驱动仿真的稳健作战计划验证》

《推进军事决策支持：运用强化学习驱动仿真的稳健作战计划验证》

专知会员服务

11+阅读 · 6月3日

详解人工智能赋能战争的旗舰软件平台：Maven智能系统

详解人工智能赋能战争的旗舰软件平台：Maven智能系统

专知会员服务

22+阅读 · 6月3日

《发展用于决策支持的化生放核（CBRN）态势理解》

《发展用于决策支持的化生放核（CBRN）态势理解》

专知会员服务

8+阅读 · 6月3日

《通往人工通用智能之路上的均衡策略》

《通往人工通用智能之路上的均衡策略》

专知会员服务

7+阅读 · 6月3日

《人工智能与军事整合：现状与未来风险》报告

《人工智能与军事整合：现状与未来风险》报告

专知会员服务

5+阅读 · 6月3日

《Palantir的科技生态系统》

《Palantir的科技生态系统》

专知会员服务

21+阅读 · 6月2日

相关VIP内容

【nature machine intelligence】终身学习机器的生物基础，Biological underpinnings for lifelong learning machines

【nature machine intelligence】终身学习机器的生物基础，Biological underpinnings for lifelong learning machines

专知会员服务

38+阅读 · 2022年3月24日

《行为与认知机器人学》，241页pdf

《行为与认知机器人学》，241页pdf

专知会员服务

55+阅读 · 2021年4月11日

【AAAI2021】元学习器的冷启动序列推荐

【AAAI2021】元学习器的冷启动序列推荐

专知会员服务

41+阅读 · 2020年12月19日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

专知会员服务

107+阅读 · 2020年6月21日

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

专知会员服务

39+阅读 · 2020年5月30日

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

专知会员服务

131+阅读 · 2020年4月19日

《C++ Primer中文版第5版》电子书与学习笔记和课后练习答案

《C++ Primer中文版第5版》电子书与学习笔记和课后练习答案

专知会员服务

276+阅读 · 2020年2月13日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

AgentOps综述：智能体系统运维框架

《军用物联网：架构、应用、挑战与现代战争中的战略意义》

【博士论文】基于物理结构与贝叶斯不确定性的可靠神经网络

《美陆军最新条令：兵力防护》

相关资讯

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Bayesian feedback in the framework of ecological sciences

Arxiv

0+阅读 · 2023年5月29日

Maximum Optimality Margin: A Unified Approach for Contextual Linear Programming and Inverse Linear Programming

Arxiv

0+阅读 · 2023年5月28日

Reading Between the Lines: Modeling User Behavior and Costs in AI-Assisted Programming

Arxiv

0+阅读 · 2023年5月26日

HAAV: Hierarchical Aggregation of Augmented Views for Image Captioning

Arxiv

0+阅读 · 2023年5月25日

FemtoDet: An Object Detection Baseline for Energy Versus Performance Tradeoffs

Arxiv

0+阅读 · 2023年5月25日

Neural incomplete factorization: learning preconditioners for the conjugate gradient method

Arxiv

0+阅读 · 2023年5月25日

The Behavior and Convergence of Local Bayesian Optimization

Arxiv

0+阅读 · 2023年5月24日

Bayesian Deep Learning for Graphs

Arxiv

23+阅读 · 2022年2月24日

Multi-Agent Cooperative Bidding Games for Multi-Objective Optimization in e-Commercial Sponsored Search

Arxiv

12+阅读 · 2021年6月8日

Adversarial and Contrastive Variational Autoencoder for Sequential Recommendation

Arxiv

17+阅读 · 2021年3月19日

相关基金

面向图像网状结构体的蚁群分割算法

国家自然科学基金

0+阅读 · 2017年12月31日

MARVELD1基因调控肝细胞癌介入治疗的机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

RNF43介导的AKT/MDM2和NEDL1途径调控肝癌细胞恶性行为的分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

Dnmt1调控斑马鱼造血干细胞产生、分化及迁移的分子机制

国家自然科学基金

0+阅读 · 2013年12月31日

基于电路QED超强耦合机制的动力学演化和量子调控

国家自然科学基金

0+阅读 · 2013年12月31日

双相不锈钢2103连铸坯凝固过程热模拟研究

国家自然科学基金

0+阅读 · 2013年12月31日

移动互联网络中的博弈与协同激励机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

RGM与neogenin信号调控应激性精神障碍-PTSD杏仁核、海马神经细胞凋亡的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

survivin拮抗细胞衰老的机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

视频选择性注意机理与语义特征提取

国家自然科学基金

1+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员