PAL*M: Property Attestation for Large Generative Models

Machine learning property attestations allow provers (e.g., model providers or owners) to attest properties of their models/datasets to verifiers (e.g., regulators, customers), enabling accountability towards regulations and policies. But, current approaches do not support generative models or large datasets. We present PAL*M, a property attestation framework for large generative models, illustrated using large language models. PAL*M defines properties across training and inference, leverages confidential virtual machines with security-aware GPUs for coverage of CPU-GPU operations, and proposes using incremental multiset hashing over memory-mapped datasets to efficiently track their integrity. We implement PAL*M on Intel TDX+NVIDIA H100 and evaluate it using state-of-the-art models and datasets, showing PAL*M is efficient, incurring < 11% overhead for common operations. Finally, we use the Tamarin Prover symbolic verification tool to formally model PAL*M's property attestation protocol, confirming that its security guarantees are upheld under the defined threat model.

翻译：机器学习属性证明允许证明方（如模型提供者或所有者）向验证方（如监管机构、客户）证明其模型/数据集的属性，从而实现对法规和政策的可问责性。然而，现有方法不支持生成式模型或大规模数据集。我们提出PAL*M，一个面向大型生成模型的属性证明框架，并以大语言模型为例进行说明。PAL*M定义了涵盖训练与推理阶段的属性，利用配备安全感知GPU的机密虚拟机覆盖CPU-GPU操作，并提出基于内存映射数据集的增量多重集哈希方法以高效追踪数据完整性。我们在Intel TDX+NVIDIA H100平台上实现PAL*M，并使用最先进的模型与数据集进行评估，结果表明PAL*M具有高效性，常见操作的开销低于11%。最后，我们利用Tamarin Prover符号验证工具对PAL*M的属性证明协议进行形式化建模，确认其在所定义威胁模型下能维持安全保证。

相关内容

属性

关注 2

一个具体事物，总是有许许多多的性质与关系，我们把一个事物的性质与关系，都叫作事物的属性。事物与属性是不可分的，事物都是有属性的事物，属性也都是事物的属性。一个事物与另一个事物的相同或相异，也就是一个事物的属性与另一事物的属性的相同或相异。由于事物属性的相同或相异，客观世界中就形成了许多不同的事物类。具有相同属性的事物就形成一类，具有不同属性的事物就分别地形成不同的类。

大型语言模型对齐技术综述：RLHF、RLAIF、PPO、DPO 等

专知会员服务

55+阅读 · 2024年7月24日

基于大语言模型（LLM）的合成数据生成、策展和评估的综述

专知会员服务

62+阅读 · 2024年7月5日

大模型如何统一生成和嵌入？最新《生成式表示指令微调》论文详细解答

专知会员服务

44+阅读 · 2024年2月18日