Solving Hard Mizar Problems with Instantiation and Strategy Invention

In this work, we prove over 3000 previously ATP-unproved Mizar/MPTP problems by using several ATP and AI methods, raising the number of ATP-solved Mizar problems from 75\% to above 80\%. First, we start to experiment with the cvc5 SMT solver which uses several instantiation-based heuristics that differ from the superposition-based systems, that were previously applied to Mizar,and add many new solutions. Then we use automated strategy invention to develop cvc5 strategies that largely improve cvc5's performance on the hard problems. In particular, the best invented strategy solves over 14\% more problems than the best previously available cvc5 strategy. We also show that different clausification methods have a high impact on such instantiation-based methods, again producing many new solutions. In total, the methods solve 3021 (21.3\%) of the 14163 previously unsolved hard Mizar problems. This is a new milestone over the Mizar large-theory benchmark and a large strengthening of the hammer methods for Mizar.

翻译：本工作中，我们通过结合多种自动定理证明与人工智能方法，证明了超过3000个先前未被ATP证明的Mizar/MPTP问题，将ATP可解的Mizar问题比例从75%提升至80%以上。首先，我们开始实验采用cvc5 SMT求解器，该求解器运用了多种基于实例化的启发式策略，这与先前应用于Mizar的基于超位置的系统不同，从而带来了大量新的解。随后，我们通过自动化策略发明来开发cvc5策略，这些策略显著提升了cvc5在困难问题上的性能。特别地，最优发明策略比先前可用的最优cvc5策略多解决了超过14%的问题。我们还证明了不同的子句化方法对此类基于实例化的方法具有重要影响，再次产生了许多新的解。总体而言，这些方法在14163个先前未解决的困难Mizar问题中解决了3021个（占21.3%）。这是Mizar大理论基准测试的新里程碑，也是对Mizar的“锤子”方法的重要加强。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【WSDM2020】超越统计关系：将知识关系整合到多标签音乐风格分类的风格关联中（附pdf）

专知会员服务

18+阅读 · 2019年11月23日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日