VeCoGen: Automating Generation of Formally Verified C Code with Large Language Models

Large Language Models (LLMs) have demonstrated impressive capabilities in generating code, yet they often produce programs with flaws or deviations from intended behavior, limiting their suitability for safety-critical applications. To address this limitation, this paper introduces VeCoGen, a novel tool that combines LLMs with formal verification to automate the generation of formally verified C programs. VeCoGen takes a formal specification in ANSI/ISO C Specification Language (ACSL), a natural language specification, and a set of test cases to attempt to generate a program. This program-generation process consists of two steps. First, VeCoGen generates an initial set of candidate programs. Secondly, the tool iteratively improves on previously generated candidates. If a candidate program meets the formal specification, then we are sure the program is correct. We evaluate VeCoGen on 15 problems presented in Codeforces competitions. On these problems, VeCoGen solves 13 problems. This work shows the potential of combining LLMs with formal verification to automate program generation.

翻译：大型语言模型（LLMs）在代码生成方面展现出卓越能力，但其生成的程序常存在缺陷或偏离预期行为，限制了其在安全关键场景中的应用。为突破这一局限，本文提出VeCoGen——一种将LLMs与形式化验证相结合的新型工具，用于自动生成经过形式化验证的C程序。VeCoGen接收ANSI/ISO C规范语言（ACSL）编写的形式化规约、自然语言规约及一组测试用例，尝试生成目标程序。该程序生成过程包含两个阶段：首先生成初始候选程序集合，随后工具对既有候选程序进行迭代优化。若候选程序满足形式化规约，即可确保其正确性。我们在Codeforces竞赛的15道题目上评估VeCoGen，该工具成功解决了其中13道问题。本研究表明，将LLMs与形式化验证相结合在自动化程序生成领域具有显著潜力。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日