A Rust-to-Lean Verification Pipeline with AI Provers: An Experience Report

We describe a verification pipeline that takes production Rust cryptographic code and produces machine-checked correctness proofs in Lean 4. The pipeline combines three components: symbolic extraction tools (Charon and Aeneas, or Hax) that lift Rust into Lean 4; formal cryptographic specification libraries (ArkLib and CompPoly, from the Verified zkEVM project) that provide the mathematical targets; and AI provers (Aristotle from Harmonic AI and Aleph from Logical Intelligence) that close the resulting proof obligations. Every proof is checked by the Lean kernel, so AI output cannot compromise soundness. Within the scope of the Ethereum Foundation's zkEVM Verification Project, we applied the pipeline to cryptographic primitives in Plonky3 (FRI folding, Mersenne31 and KoalaBear field arithmetic, Horner polynomial evaluation) and RISC Zero (Merkle inclusion verification). In addition, Aleph authored proofs of two bounds-style theorems in Plonky3's compute_log_arity_for_round that previously stood as sorry. The paper describes the architecture, walks through a running example based on Aleph's two proofs, reports which classes of proof obligations AI closed and which required manual work, and discusses the engineering gaps we encountered: Lean 4 toolchain drift across tools and specific Aeneas/Hax extraction limits. We also document concrete missing lemmas, tactic gaps, and code-generation friction points discovered during proof development. We hope this contribution lowers the barrier to adoption of formal verification and facilitates more effective use of AI in this pipeline. The result is a working pipeline for formal verification of Rust, with kernel-checked proofs and reproducible artefacts.

翻译：我们描述了一条验证流水线，该流水线将生产级Rust密码学代码转化为Lean 4中经机器检查的正确性证明。该流水线整合了三个组件：符号提取工具（Charon和Aeneas，或Hax），将Rust提升到Lean 4；形式化密码学规范库（来自Verified zkEVM项目的ArkLib和CompPoly），提供数学目标；以及AI证明器（Harmonic AI的Aristotle和Logical Intelligence的Aleph），用于闭合由此产生的证明义务。每个证明均由Lean内核检查，因此AI输出不会损害可靠性。在以太坊基金会zkEVM验证项目的范围内，我们将该流水线应用于Plonky3（FRI折叠、Mersenne31和KoalaBear域算术、Horner多项式求值）和RISC Zero（Merkle包含验证）中的密码学原语。此外，Aleph还为Plonky3中`compute_log_arity_for_round`的两个界限式定理撰写了证明，这些定理此前被标记为`sorry`。本文描述了该架构，以Aleph的两个证明为基础示例进行讲解，报告了AI闭合了哪类证明义务、哪些需要人工操作，并讨论了我们遇到的技术缺口：跨工具的Lean 4工具链漂移以及特定的Aeneas/Hax提取限制。我们还记录了在证明开发过程中发现的具体缺失引理、策略缺口和代码生成摩擦点。我们希望这一贡献能降低形式化验证的采用门槛，并促进AI在此流水线中的更有效使用。最终成果是一条用于Rust形式化验证的可用流水线，包含内核检查的证明和可复现的工件。

相关内容

关注 7110

人工智能杂志AI(Artificial Intelligence)是目前公认的发表该领域最新研究成果的主要国际论坛。该期刊欢迎有关AI广泛方面的论文，这些论文构成了整个领域的进步，也欢迎介绍人工智能应用的论文，但重点应该放在新的和新颖的人工智能方法如何提高应用领域的性能，而不是介绍传统人工智能方法的另一个应用。关于应用的论文应该描述一个原则性的解决方案，强调其新颖性，并对正在开发的人工智能技术进行深入的评估。官网地址：http://dblp.uni-trier.de/db/journals/ai/

工程可信赖的机器学习运维——基于零知识证明

专知会员服务

9+阅读 · 2025年5月27日

《美国防部人工智能功能的数据验证和认证》2024最新25页slides

专知会员服务

37+阅读 · 2024年5月23日

【2023新书】程序证明，Program Proofs，642页pdf

专知会员服务

67+阅读 · 2023年3月29日

【ICML2022】Branchformer:并行MLP-Attention架构，捕捉局部和全局上下文，用于语音识别和理解

专知会员服务

25+阅读 · 2022年7月8日