Differentiable Logic Machines

The integration of reasoning, learning, and decision-making is key to build more general artificial intelligence systems. As a step in this direction, we propose a novel neural-logic architecture, called differentiable logic machine (DLM), that can solve both inductive logic programming (ILP) and reinforcement learning (RL) problems, where the solution can be interpreted as a first-order logic program. Our proposition includes several innovations. Firstly, our architecture defines a restricted but expressive continuous relaxation of the space of first-order logic programs by assigning weights to predicates instead of rules, in contrast to most previous neural-logic approaches. Secondly, with this differentiable architecture, we propose several (supervised and RL) training procedures, based on gradient descent, which can recover a fully-interpretable solution (i.e., logic formula). Thirdly, to accelerate RL training, we also design a novel critic architecture that enables actor-critic algorithms. Fourthly, to solve hard problems, we propose an incremental training procedure that can learn a logic program progressively. Compared to state-of-the-art (SOTA) differentiable ILP methods, DLM successfully solves all the considered ILP problems with a higher percentage of successful seeds (up to 3.5$\times$). On RL problems, without requiring an interpretable solution, DLM outperforms other non-interpretable neural-logic RL approaches in terms of rewards (up to 3.9%). When enforcing interpretability, DLM can solve harder RL problems (e.g., Sorting, Path) Moreover, we show that deep logic programs can be learned via incremental supervised training. In addition to this excellent performance, DLM can scale well in terms of memory and computational time, especially during the testing phase where it can deal with much more constants ($>$2$\times$) than SOTA.

翻译：推理、学习与决策的整合是构建更通用人工智能系统的关键。为此，我们提出一种新型神经逻辑架构——可微逻辑机（DLM），该架构可同时解决归纳逻辑编程（ILP）与强化学习（RL）问题，且其解可解释为一阶逻辑程序。本研究包含多项创新：首先，区别于以往神经逻辑方法，我们的架构通过对谓词而非规则赋予权重，对一阶逻辑程序空间进行受限但具表达力的连续松弛；其次，基于该可微架构，我们提出多种基于梯度下降的（监督式与强化学习）训练流程，可恢复完全可解释的解（即逻辑公式）；第三，为加速强化学习训练，我们设计了一种新型评论家架构以支持演员-评论家算法；第四，为解决疑难问题，我们提出增量式训练流程，可逐步学习逻辑程序。与最先进的可微ILP方法相比，DLM成功解决了所有待测ILP问题，且成功种子比例提升高达3.5倍。在强化学习问题上，无需可解释解时，DLM的奖励值超越其他不可解释的神经逻辑强化学习方法（最高提升3.9%）；当强制可解释性时，DLM仍能解决更复杂的强化学习问题（如排序、路径问题）。此外，我们证明深度逻辑程序可通过增量式监督训练习得。除了卓越性能，DLM在内存与计算时间方面具有良好扩展性，尤其在测试阶段可处理比最先进方法多两倍以上的常量。

相关内容

ILP

关注 132

归纳逻辑程序设计（ILP）是机器学习的一个分支，它依赖于逻辑程序作为一种统一的表示语言来表达例子、背景知识和假设。基于一阶逻辑的ILP具有很强的表示形式，为多关系学习和数据挖掘提供了一种很好的方法。International Conference on Inductive Logic Programming系列始于1991年，是学习结构化或半结构化关系数据的首要国际论坛。最初专注于逻辑程序的归纳，多年来，它大大扩展了研究范围，并欢迎在逻辑学习、多关系数据挖掘、统计关系学习、图形和树挖掘等各个方面作出贡献，学习其他（非命题）基于逻辑的知识表示框架，探索统计学习和其他概率方法的交叉点。官网链接：https://ilp2019.org/

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

37+阅读 · 2019年10月17日