GRACE：一种用于可解释逆向强化学习的语言模型框架 (GRACE: A Language Model Framework for Explainable Inverse Reinforcement Learning)

Inverse Reinforcement Learning aims to recover reward models from expert demonstrations, but traditional methods yield "black-box" models that are difficult to interpret and debug. In this work, we introduce GRACE (Generating Rewards As CodE), a method for using Large Language Models within an evolutionary search to reverse-engineer an interpretable, code-based reward function directly from expert trajectories. The resulting reward function is executable code that can be inspected and verified. We empirically validate GRACE on the BabyAI and AndroidWorld benchmarks, where it efficiently learns highly accurate rewards, even in complex, multi-task settings. Further, we demonstrate that the resulting reward leads to strong policies, compared to both competitive Imitation Learning and online RL approaches with ground-truth rewards. Finally, we show that GRACE is able to build complex reward APIs in multi-task setups.

翻译：逆向强化学习旨在从专家演示中恢复奖励模型，但传统方法会产生难以解释和调试的“黑盒”模型。本文提出GRACE（Generating Rewards As CodE），该方法在进化搜索中使用大型语言模型，直接从专家轨迹中逆向工程出可解释的、基于代码的奖励函数。所生成的奖励函数是可执行代码，可供检查与验证。我们在BabyAI和AndroidWorld基准测试中实证验证了GRACE的有效性，结果表明即使在复杂的多任务场景中，该方法也能高效学习高精度奖励函数。此外，与使用真实奖励的竞争性模仿学习及在线强化学习方法相比，GRACE生成的奖励函数能引导出更优的策略。最后，我们证明GRACE能够在多任务配置中构建复杂的奖励API。

相关内容

MoDELS

关注 44

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日