Rango: Adaptive Retrieval-Augmented Proving for Automated Software Verification

Formal verification using proof assistants, such as Coq, enables the creation of high-quality software. However, the verification process requires significant expertise and manual effort to write proofs. Recent work has explored automating proof synthesis using machine learning and large language models (LLMs). This work has shown that identifying relevant premises, such as lemmas and definitions, can aid synthesis. We present Rango, a fully automated proof synthesis tool for Coq that automatically identifies relevant premises and also similar proofs from the current project and uses them during synthesis. Rango uses retrieval augmentation at every step of the proof to automatically determine which proofs and premises to include in the context of its fine-tuned LLM. In this way, Rango adapts to the project and to the evolving state of the proof. We create a new dataset, CoqStoq, of 2,226 open-source Coq projects and 196,929 theorems from GitHub, which includes both training data and a curated evaluation benchmark of well-maintained projects. On this benchmark, Rango synthesizes proofs for 32.0% of the theorems, which is 29% more theorems than the prior state-of-the-art tool Tactician. Our evaluation also shows that Rango adding relevant proofs to its context leads to a 47% increase in the number of theorems proven.

翻译：使用Coq等证明辅助工具进行形式化验证能够创建高质量的软件。然而，验证过程需要大量专业知识和人工努力来编写证明。近期研究探索了利用机器学习和大型语言模型（LLMs）实现证明合成的自动化。这些研究表明，识别相关前提（如引理和定义）有助于合成。我们提出了Rango——一个用于Coq的全自动证明合成工具，它能自动识别当前项目中的相关前提及相似证明，并在合成过程中加以利用。Rango在证明的每个步骤都采用检索增强技术，通过其微调的LLM自动确定将哪些证明和前提纳入上下文。通过这种方式，Rango能够适应具体项目及证明的动态演化状态。我们创建了包含2,226个开源Coq项目和196,929个定理的新数据集CoqStoq（源自GitHub），其中既包含训练数据，也包含来自维护良好项目的精选评估基准。在该基准测试中，Rango成功合成了32.0%定理的证明，相比先前最先进的工具Tactician，可证明定理数量提升了29%。我们的评估还表明，Rango在上下文中添加相关证明可使已验证定理数量增加47%。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日