Automated Feature Selection for Inverse Reinforcement Learning

Inverse reinforcement learning (IRL) is an imitation learning approach to learning reward functions from expert demonstrations. Its use avoids the difficult and tedious procedure of manual reward specification while retaining the generalization power of reinforcement learning. In IRL, the reward is usually represented as a linear combination of features. In continuous state spaces, the state variables alone are not sufficiently rich to be used as features, but which features are good is not known in general. To address this issue, we propose a method that employs polynomial basis functions to form a candidate set of features, which are shown to allow the matching of statistical moments of state distributions. Feature selection is then performed for the candidates by leveraging the correlation between trajectory probabilities and feature expectations. We demonstrate the approach's effectiveness by recovering reward functions that capture expert policies across non-linear control tasks of increasing complexity. Code, data, and videos are available at https://sites.google.com/view/feature4irl.

翻译：逆强化学习（IRL）是一种从专家示范中学习奖励函数的模仿学习方法。该方法避免了手动指定奖励项这一困难且繁琐的过程，同时保留了强化学习的泛化能力。在逆强化学习中，奖励函数通常表示为特征的线性组合。在连续状态空间中，仅凭状态变量本身不足以作为有效特征，但究竟哪些特征具有良好性能通常未知。针对这一问题，我们提出一种采用多项式基函数构建候选特征集的方法，理论证明该方法能实现状态分布统计矩的匹配。通过利用轨迹概率与特征期望之间的相关性，对候选特征进行选择。在复杂度递增的非线性控制任务中，我们通过恢复能够捕捉专家策略的奖励函数，验证了该方法的有效性。代码、数据及视频见https://sites.google.com/view/feature4irl。

相关内容

特征选择

关注 5940

特征选择( Feature Selection )也称特征子集选择( Feature Subset Selection , FSS )，或属性选择( Attribute Selection )。是指从已有的M个特征(Feature)中选择N个特征使得系统的特定指标最优化，是从原始特征中选择出一些最有效特征以降低数据集维度的过程,是提高学习算法性能的一个重要手段,也是模式识别中关键的数据预处理步骤。对于一个学习算法来说,好的学习样本是训练模型的关键。

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日