Laurel: Generating Dafny Assertions Using Large Language Models

Dafny is a popular verification language, which automates proofs by outsourcing them to an SMT solver. This automation is not perfect, however, and the solver often requires guidance in the form of helper assertions creating a burden for the proof engineer. In this paper, we propose Laurel, a tool that uses large language models (LLMs) to automatically generate helper assertions for Dafny programs. To improve the success rate of LLMs in this task, we design two domain-specific prompting techniques. First, we help the LLM determine the location of the missing assertion by analyzing the verifier's error message and inserting an assertion placeholder at that location. Second, we provide the LLM with example assertions from the same codebase, which we select based on a new lemma similarity metric. We evaluate our techniques on a dataset of helper assertions we extracted from three real-world Dafny codebases. Our evaluation shows that Laurel is able to generate over 50% of the required helper assertions given only a few attempts, making LLMs a usable and affordable tool to further automate practical program verification.

翻译：Dafny是一种流行的验证语言，通过将证明任务委托给SMT求解器来自动化证明过程。然而，这种自动化并不完美，求解器通常需要以辅助断言的形式进行指导，这给验证工程师带来了负担。本文提出Laurel工具，该工具利用大型语言模型（LLMs）为Dafny程序自动生成辅助断言。为提升LLMs在此任务中的成功率，我们设计了两种领域特定的提示技术。首先，通过分析验证器的错误信息并在相应位置插入断言占位符，帮助LLM定位缺失断言的位置。其次，基于我们提出的新引理相似度度量方法，从同一代码库中筛选示例断言提供给LLM。我们在从三个真实世界Dafny代码库中提取的辅助断言数据集上评估了所提技术。实验表明，仅需少量尝试，Laurel即能生成超过50%的必要辅助断言，这使LLMs成为进一步自动化实际程序验证的可用且高效的工具。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日