An International Consortium for Evaluations of Societal-Scale Risks from Advanced AI

Ross Gruetzemacher,Alan Chan,Kevin Frazier,Christy Manning,Štěpán Los,James Fox,José Hernández-Orallo,John Burden,Matija Franklin,Clíodhna Ní Ghuidhir,Mark Bailey,Daniel Eth,Toby Pilditch,Kyle Kilian

from arxiv, 50 pages, 2 figures

Given rapid progress toward advanced AI and risks from frontier AI systems (advanced AI systems pushing the boundaries of the AI capabilities frontier), the creation and implementation of AI governance and regulatory schemes deserves prioritization and substantial investment. However, the status quo is untenable and, frankly, dangerous. A regulatory gap has permitted AI labs to conduct research, development, and deployment activities with minimal oversight. In response, frontier AI system evaluations have been proposed as a way of assessing risks from the development and deployment of frontier AI systems. Yet, the budding AI risk evaluation ecosystem faces significant coordination challenges, such as a limited diversity of evaluators, suboptimal allocation of effort, and perverse incentives. This paper proposes a solution in the form of an international consortium for AI risk evaluations, comprising both AI developers and third-party AI risk evaluators. Such a consortium could play a critical role in international efforts to mitigate societal-scale risks from advanced AI, including in managing responsible scaling policies and coordinated evaluation-based risk response. In this paper, we discuss the current evaluation ecosystem and its shortcomings, propose an international consortium for advanced AI risk evaluations, discuss issues regarding its implementation, discuss lessons that can be learnt from previous international institutions and existing proposals for international AI governance institutions, and, finally, we recommend concrete steps to advance the establishment of the proposed consortium: (i) solicit feedback from stakeholders, (ii) conduct additional research, (iii) conduct a workshop(s) for stakeholders, (iv) analyze feedback and create final proposal, (v) solicit funding, and (vi) create a consortium.

翻译：鉴于高级AI的快速发展及前沿AI系统（即不断突破AI能力边界的高级AI系统）所带来的风险，AI治理与监管机制的创建和实施亟需优先考虑并投入大量资源。然而，现状难以为继，坦言之，危机四伏。监管真空使得AI实验室在研发、部署等活动中几乎不受约束。为此，前沿AI系统评估作为一种衡量前沿AI系统开发与部署风险的机制被提出。然而，新兴的AI风险评估体系面临显著的协调难题，例如评估主体多样性不足、资源配置效率低下以及激励扭曲等问题。本文提出解决方案：组建一个由AI开发方与第三方AI风险评估方共同参与的国际联合体。该联合体可在缓解高级AI社会层面风险的全球行动中发挥关键作用，包括实施负责任规模扩展策略与协调一致的评估风险响应。本文首先剖析现有评估体系及其缺陷，继而提出高级AI风险评估国际联合体方案，并探讨其落地实施的相关议题；随后回顾既往国际机构治理实践及现有AI国际治理提案的可借鉴经验；最终提出推动该联合体成立的具体行动建议：（一）征求利益相关方意见，（二）开展补充研究，（三）举办利益相关方研讨会，（四）分析反馈并形成最终方案，（五）筹措资金，（六）正式成立联合体。

相关内容

关注 7110

人工智能杂志AI(Artificial Intelligence)是目前公认的发表该领域最新研究成果的主要国际论坛。该期刊欢迎有关AI广泛方面的论文，这些论文构成了整个领域的进步，也欢迎介绍人工智能应用的论文，但重点应该放在新的和新颖的人工智能方法如何提高应用领域的性能，而不是介绍传统人工智能方法的另一个应用。关于应用的论文应该描述一个原则性的解决方案，强调其新颖性，并对正在开发的人工智能技术进行深入的评估。官网地址：http://dblp.uni-trier.de/db/journals/ai/

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

专知会员服务

13+阅读 · 2022年3月12日

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日