SAIF：评估公共部门生成式人工智能风险的综合性框架 (SAIF: A Comprehensive Framework for Evaluating the Risks of Generative AI in the Public Sector)

The rapid adoption of generative AI in the public sector, encompassing diverse applications ranging from automated public assistance to welfare services and immigration processes, highlights its transformative potential while underscoring the pressing need for thorough risk assessments. Despite its growing presence, evaluations of risks associated with AI-driven systems in the public sector remain insufficiently explored. Building upon an established taxonomy of AI risks derived from diverse government policies and corporate guidelines, we investigate the critical risks posed by generative AI in the public sector while extending the scope to account for its multimodal capabilities. In addition, we propose a Systematic dAta generatIon Framework for evaluating the risks of generative AI (SAIF). SAIF involves four key stages: breaking down risks, designing scenarios, applying jailbreak methods, and exploring prompt types. It ensures the systematic and consistent generation of prompt data, facilitating a comprehensive evaluation while providing a solid foundation for mitigating the risks. Furthermore, SAIF is designed to accommodate emerging jailbreak methods and evolving prompt types, thereby enabling effective responses to unforeseen risk scenarios. We believe that this study can play a crucial role in fostering the safe and responsible integration of generative AI into the public sector.

翻译：生成式人工智能在公共部门的迅速应用，涵盖从自动化公共援助到福利服务及移民流程的多样化场景，既彰显了其变革潜力，也凸显了进行彻底风险评估的迫切需求。尽管其应用日益广泛，针对公共部门人工智能驱动系统相关风险的评估仍缺乏充分探索。基于从各类政府政策与企业指南中提炼出的人工智能风险分类体系，本研究深入探讨了生成式人工智能在公共部门可能引发的关键风险，并将研究范畴扩展至其多模态能力带来的影响。此外，我们提出了一个用于评估生成式人工智能风险的系统化数据生成框架（SAIF）。该框架包含四个核心阶段：风险解构、场景设计、越狱方法应用及提示类型探索。SAIF通过确保提示数据的系统化与一致性生成，既支撑全面风险评估，也为风险缓解策略提供了坚实基础。同时，该框架具备良好扩展性，能够兼容新兴越狱方法与动态演进的提示类型，从而有效应对未预见的风险情境。我们相信，本研究对推动生成式人工智能在公共部门实现安全、负责任的应用具有重要价值。

相关内容

关注 7093

人工智能杂志AI(Artificial Intelligence)是目前公认的发表该领域最新研究成果的主要国际论坛。该期刊欢迎有关AI广泛方面的论文，这些论文构成了整个领域的进步，也欢迎介绍人工智能应用的论文，但重点应该放在新的和新颖的人工智能方法如何提高应用领域的性能，而不是介绍传统人工智能方法的另一个应用。关于应用的论文应该描述一个原则性的解决方案，强调其新颖性，并对正在开发的人工智能技术进行深入的评估。官网地址：http://dblp.uni-trier.de/db/journals/ai/

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

专知会员服务

55+阅读 · 2020年3月8日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日