PASTA: A Scalable Framework for Multi-Policy AI Compliance Evaluation

AI compliance is becoming increasingly critical as AI systems grow more powerful and pervasive. Yet the rapid expansion of AI policies creates substantial burdens for resource-constrained practitioners lacking policy expertise. Existing approaches typically address one policy at a time, making multi-policy compliance costly. We present PASTA, a scalable compliance tool integrating four innovations: (1) a comprehensive model-card format supporting descriptive inputs across development stages; (2) a policy normalization scheme; (3) an efficient LLM-powered pairwise evaluation engine with cost-saving strategies; and (4) an interface delivering interpretable evaluations via compliance heatmaps and actionable recommendations. Expert evaluation shows PASTA's judgments closely align with human experts ($ρ\geq .626$). The system evaluates five major policies in under two minutes at approximately \$3. A user study (N = 12) confirms practitioners found outputs easy-to-understand and actionable, introducing a novel framework for scalable automated AI governance.

翻译：随着AI系统日益强大和普及，AI合规性正变得愈发关键。然而，AI政策的快速扩张给缺乏政策专业知识且资源受限的实践者带来了沉重负担。现有方法通常一次仅处理单一政策，导致多政策合规成本高昂。本文提出PASTA，一种可扩展的合规性工具，其整合了四项创新：(1) 支持跨开发阶段描述性输入的综合性模型卡片格式；(2) 政策规范化方案；(3) 采用成本节约策略的高效LLM驱动配对评估引擎；(4) 通过合规性热力图与可操作建议提供可解释评估的交互界面。专家评估表明，PASTA的判断与人类专家高度一致（$ρ\geq .626$）。该系统可在约两分钟内以3美元左右的成本完成对五项主要政策的评估。一项用户研究（N = 12）证实实践者认为其输出易于理解且具备可操作性，从而为可扩展的自动化AI治理引入了新颖的框架。

相关内容

关注 7110

人工智能杂志AI(Artificial Intelligence)是目前公认的发表该领域最新研究成果的主要国际论坛。该期刊欢迎有关AI广泛方面的论文，这些论文构成了整个领域的进步，也欢迎介绍人工智能应用的论文，但重点应该放在新的和新颖的人工智能方法如何提高应用领域的性能，而不是介绍传统人工智能方法的另一个应用。关于应用的论文应该描述一个原则性的解决方案，强调其新颖性，并对正在开发的人工智能技术进行深入的评估。官网地址：http://dblp.uni-trier.de/db/journals/ai/

一种Agent自主性风险评估框架 | 最新文献

专知会员服务

24+阅读 · 2025年10月24日

多智能体协同决策研究全景透视：应用场景、研究方法、挑战与未来展望

专知会员服务

47+阅读 · 2025年3月20日

可解释人工智能（XAI）：从内在可解释性到大语言模型

专知会员服务

34+阅读 · 2025年1月20日

聚焦AIGC高质量发展新未来高金智库发布《生成式人工智能服务合规发展白皮书》，88页pdf

专知会员服务

69+阅读 · 2023年10月1日